11 22 109

Jisoo Kim

kuotient

AI & ML interests

NLP

Recent Activity

upvoted a collection 10 days ago

Tool Use Reasoning

upvoted an article 11 days ago

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

upvoted an article 13 days ago

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

View all activity

Organizations

upvoted a collection 10 days ago

Tool Use Reasoning

Collection

A collection of tool use reasoning dataset in Hermes format • 5 items • Updated about 1 month ago • 8

upvoted an article 11 days ago

Article

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

•

13 days ago

• 26

upvoted an article 13 days ago

Article

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

and 4 others •

15 days ago

• 50

upvoted an article 5 months ago

Article

Training Large Language Models with Interpreter Feedback using WebAssembly

and 1 other •

Apr 3

• 13

upvoted 2 papers 6 months ago

R1-Searcher: Incentivizing the Search Capability in LLMs via Reinforcement Learning

Paper • 2503.05592 • Published Mar 7 • 27

Kanana: Compute-efficient Bilingual Language Models

Paper • 2502.18934 • Published Feb 26 • 66

upvoted a paper 8 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 373

upvoted an article 9 months ago

Article

The Beginners Guide to Cleaning a Dataset

•

Nov 18, 2024

• 24

upvoted 2 papers 11 months ago

Training Language Models to Self-Correct via Reinforcement Learning

Paper • 2409.12917 • Published Sep 19, 2024 • 141

Qwen2-VL: Enhancing Vision-Language Model's Perception of the World at Any Resolution

Paper • 2409.12191 • Published Sep 18, 2024 • 78

upvoted an article 12 months ago

Article

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

•

Aug 26, 2024

• 71

upvoted a paper 12 months ago

The Mamba in the Llama: Distilling and Accelerating Hybrid Models

Paper • 2408.15237 • Published Aug 27, 2024 • 43

upvoted 2 articles about 1 year ago

Article

The Rise of Agentic Data Generation

•

Jul 15, 2024

• 84

Article

How NuminaMath Won the 1st AIMO Progress Prize

and 7 others •

Jul 11, 2024

• 122

upvoted a paper about 1 year ago

Qwen2 Technical Report

Paper • 2407.10671 • Published Jul 15, 2024 • 167

upvoted an article about 1 year ago

Article

Putting RL back in RLHF

and 1 other •

Jun 12, 2024

• 100

upvoted a paper about 1 year ago

Instruction Pre-Training: Language Models are Supervised Multitask Learners

Paper • 2406.14491 • Published Jun 20, 2024 • 95

upvoted 2 collections about 1 year ago

Qwen2

Collection

Qwen2 language models, including pretrained and instruction-tuned models of 5 sizes, including 0.5B, 1.5B, 7B, 57B-A14B, and 72B. • 39 items • Updated Jul 21 • 368

Alpha Llama-3 collection

Collection

5 items • Updated Jan 15 • 2

upvoted an article over 1 year ago

Article

Can We Train Chat Models with Raw Data?

•

Apr 25, 2024

• 19

Jisoo Kim

AI & ML interests

Recent Activity

Organizations

kuotient's activity

ChatML vs Harmony: Understanding the new Format from OpenAI 🔍

Accelerate ND-Parallel: A Guide to Efficient Multi-GPU Training

Training Large Language Models with Interpreter Feedback using WebAssembly

The Beginners Guide to Cleaning a Dataset

Efficient Deep Learning: A Comprehensive Overview of Optimization Techniques 👐 📚

The Rise of Agentic Data Generation

How NuminaMath Won the 1st AIMO Progress Prize

Putting RL back in RLHF

Can We Train Chat Models with Raw Data?