qinluo (qinluo)

upvoted a paper 9 months ago

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Paper • 2411.16489 • Published Nov 25, 2024 • 48

upvoted an article 9 months ago

Article

Releasing the largest multilingual open pretraining dataset

By

and 2 others •

Nov 13, 2024

• 102

upvoted 3 papers 10 months ago

upvoted 3 papers 11 months ago

General Preference Modeling with Preference Representations for Aligning Language Models

Paper • 2410.02197 • Published Oct 3, 2024 • 9

Let's Verify Step by Step

Paper • 2305.20050 • Published May 31, 2023 • 11

V-STaR: Training Verifiers for Self-Taught Reasoners

Paper • 2402.06457 • Published Feb 9, 2024 • 9

upvoted 2 papers 12 months ago

LongRecipe: Recipe for Efficient Long Context Generalization in Large Languge Models

Paper • 2409.00509 • Published Aug 31, 2024 • 43

RegMix: Data Mixture as Regression for Language Model Pre-training

Paper • 2407.01492 • Published Jul 1, 2024 • 41

upvoted an article about 1 year ago

Article

SmolLM - blazingly fast and remarkably powerful

By

and 2 others •

Jul 16, 2024

• 411

upvoted a paper about 1 year ago

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

Paper • 2406.17557 • Published Jun 25, 2024 • 98

upvoted a collection about 1 year ago

AnyTaskTune-Psychology

Collection

Elevating Domain-Specific Model Performance with Precision Task-Specific Fine-Tuning • 3 items • Updated Jun 7, 2024 • 3

upvoted a paper about 1 year ago

Xmodel-LM Technical Report

Paper • 2406.02856 • Published Jun 5, 2024 • 11

upvoted 4 papers over 1 year ago

FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Paper • 2401.14112 • Published Jan 25, 2024 • 21

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

Paper • 2403.15447 • Published Mar 18, 2024 • 16

WildChat: 1M ChatGPT Interaction Logs in the Wild

Paper • 2405.01470 • Published May 2, 2024 • 63

KAN: Kolmogorov-Arnold Networks

Paper • 2404.19756 • Published Apr 30, 2024 • 114

upvoted 2 articles over 1 year ago

Article

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

By

•

Apr 29, 2024

• 29

Article

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

By

•

Jun 4, 2024

• 79

qinluo

AI & ML interests

Organizations

O1 Replication Journey -- Part 2: Surpassing O1-preview through Simple Distillation, Big Progress or Bitter Lesson?

Releasing the largest multilingual open pretraining dataset

TableGPT2: A Large Multimodal Model with Tabular Data Integration

OpenCoder: The Open Cookbook for Top-Tier Code Large Language Models

BitNet a4.8: 4-bit Activations for 1-bit LLMs

General Preference Modeling with Preference Representations for Aligning Language Models

Let's Verify Step by Step

V-STaR: Training Verifiers for Self-Taught Reasoners

LongRecipe: Recipe for Efficient Long Context Generalization in Large Languge Models

RegMix: Data Mixture as Regression for Language Model Pre-training

SmolLM - blazingly fast and remarkably powerful

The FineWeb Datasets: Decanting the Web for the Finest Text Data at Scale

AnyTaskTune-Psychology

Xmodel-LM Technical Report

FP6-LLM: Efficiently Serving Large Language Models Through FP6-Centric Algorithm-System Co-Design

Decoding Compressed Trust: Scrutinizing the Trustworthiness of Efficient LLMs Under Compression

WildChat: 1M ChatGPT Interaction Logs in the Wild

KAN: Kolmogorov-Arnold Networks

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets

qinluo

AI & ML interests

Organizations

qinluo's activity

Releasing the largest multilingual open pretraining dataset

SmolLM - blazingly fast and remarkably powerful

⚗️ 🧑🏼‍🌾 Let's grow some Domain Specific Datasets together

🦙⚗️ Using Llama3 and distilabel to build fine-tuning datasets