Muhammad Farrukh Mehmood

sfarrukhm

AI & ML interests

Generative AI, LLM, SLM

Recent Activity

updated a dataset about 1 month ago

sfarrukhm/intel-image-classification

published a dataset about 1 month ago

sfarrukhm/intel-image-classification

liked a model about 2 months ago

lakhera2023/deepseek-children-stories

View all activity

Organizations

upvoted 2 articles 5 months ago

Article

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

and 1 other •

Aug 17, 2022

• 104

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

and 4 others •

May 24, 2023

• 162

upvoted 2 papers 7 months ago

Critique Fine-Tuning: Learning to Critique is More Effective than Learning to Imitate

Paper • 2501.17703 • Published Jan 29 • 59

Parameters vs FLOPs: Scaling Laws for Optimal Sparsity for Mixture-of-Experts Language Models

Paper • 2501.12370 • Published Jan 21 • 11

upvoted 3 articles 7 months ago

Article

Open-R1: a fully open reproduction of DeepSeek-R1

and 2 others •

Jan 28

• 878

Article

Fine-tune ModernBERT for RAG with Synthetic Data

and 2 others •

Jan 20

• 41

Article

Preference Tuning LLMs with Direct Preference Optimization Methods

and 4 others •

Jan 18, 2024

• 70

upvoted a collection 7 months ago

Preference Datasets for DPO

Collection

This collection contains a list of curated preference datasets for DPO fine-tuning for intent alignment of LLMs • 7 items • Updated Dec 11, 2024 • 43

upvoted a paper 7 months ago

Evolving Deeper LLM Thinking

Paper • 2501.09891 • Published Jan 17 • 116

upvoted 2 articles 7 months ago

Article

The Large Language Model Course

•

Jan 16

• 199

Article

Train 400x faster Static Embedding Models with Sentence Transformers

•

Jan 15

• 205

Muhammad Farrukh Mehmood

AI & ML interests

Recent Activity

Organizations

sfarrukhm's activity

A Gentle Introduction to 8-bit Matrix Multiplication for transformers at scale using transformers, accelerate and bitsandbytes

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

Open-R1: a fully open reproduction of DeepSeek-R1

Fine-tune ModernBERT for RAG with Synthetic Data

Preference Tuning LLMs with Direct Preference Optimization Methods

The Large Language Model Course

Train 400x faster Static Embedding Models with Sentence Transformers