Yuchen Cheng

rudeigerc

https://rudeigerc.dev

rudeigerc

AI & ML interests

MLSys

Recent Activity

liked a model 18 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

liked a model about 1 month ago

google/gemma-3-27b-it

liked a model about 1 month ago

microsoft/Phi-4-multimodal-instruct

View all activity

Organizations

None yet

rudeigerc's activity

liked a model 18 days ago

meta-llama/Llama-4-Scout-17B-16E-Instruct

Image-Text-to-Text • Updated 15 days ago • 753k • • 821

liked 4 models about 1 month ago

upvoted a paper about 2 months ago

Native Sparse Attention: Hardware-Aligned and Natively Trainable Sparse Attention

Paper • 2502.11089 • Published Feb 16 • 155

liked a Space 2 months ago

2.51k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

liked a model 7 months ago

Qwen/Qwen2.5-72B-Instruct

Text Generation • Updated Jan 12 • 175k • • 808

liked a dataset 7 months ago

openai/MMMLU

Viewer • Updated Oct 16, 2024 • 393k • 12.6k • 478

liked a model 7 months ago

jinaai/reader-lm-1.5b

Text Generation • Updated Jan 17 • 577 • 596

upvoted a paper 8 months ago

NanoFlow: Towards Optimal Large Language Model Serving Throughput

Paper • 2408.12757 • Published Aug 22, 2024 • 18

liked a model 8 months ago

microsoft/Phi-3.5-MoE-instruct

Text Generation • Updated Mar 7 • 34.6k • • 556

upvoted a paper 9 months ago

Transformer Explainer: Interactive Learning of Text-Generative Models

Paper • 2408.04619 • Published Aug 8, 2024 • 163

liked a model 9 months ago

google/gemma-2-2b-it

Text Generation • Updated Aug 27, 2024 • 355k • • 1.06k

upvoted a paper 9 months ago

The Llama 3 Herd of Models

Paper • 2407.21783 • Published Jul 31, 2024 • 115

liked 4 models 9 months ago

meta-llama/Llama-3.1-8B-Instruct

Text Generation • Updated Sep 25, 2024 • 6.07M • • 3.88k

sentence-transformers/all-MiniLM-L6-v2

mistralai/Mamba-Codestral-7B-v0.1

Updated Aug 23, 2024 • 5.45k • 583

mistralai/Mistral-Nemo-Instruct-2407

Text Generation • Updated Nov 6, 2024 • 122k • • 1.51k

upvoted a paper 9 months ago

Inference Performance Optimization for Large Language Models on CPUs

Paper • 2407.07304 • Published Jul 10, 2024 • 54