1 47 13

Le Huy Hoang

splendor1811

huyhoang18112k2

AI & ML interests

Computer Vision

Recent Activity

updated a dataset 3 days ago

splendor1811/deploy-chatbot-external

published a dataset 15 days ago

splendor1811/deploy-chatbot-external

updated a model 15 days ago

splendor1811/qwen3-30b-awq

View all activity

Organizations

None yet

updated a dataset 3 days ago

splendor1811/deploy-chatbot-external

Updated 15 days ago • 108

published a dataset 15 days ago

splendor1811/deploy-chatbot-external

Updated 15 days ago • 108

updated a model 15 days ago

splendor1811/qwen3-30b-awq

5B • Updated 15 days ago • 15

upvoted a collection 23 days ago

Qwen3

Collection

84 items • Updated 16 days ago • 1.12k

published a model 25 days ago

splendor1811/qwen3-30b-awq

5B • Updated 15 days ago • 15

upvoted 3 articles 25 days ago

Article

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

•

Feb 7

• 209

Article

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

•

Mar 17

• 328

Article

I trained a Language Model to schedule events with GRPO!

•

Apr 29

• 85

liked a dataset about 1 month ago

NousResearch/Hermes-3-Dataset

Viewer • Updated Jul 11 • 959k • 4.95k • 278

upvoted a paper about 2 months ago

MiniMax-M1: Scaling Test-Time Compute Efficiently with Lightning Attention

Paper • 2506.13585 • Published Jun 16 • 262

upvoted a collection 3 months ago

Qwen3-Embedding

Collection

6 items • Updated Jul 21 • 119

updated a model 3 months ago

splendor1811/BGE-base-banking-ONE-v0106

0.6B • Updated Jun 1 • 7

published a model 3 months ago

splendor1811/BGE-base-banking-ONE-v0106

0.6B • Updated Jun 1 • 7

upvoted an article 3 months ago

Article

Vision Language Models (Better, Faster, Stronger)

and 4 others •

May 12

• 510

updated a Space 5 months ago

AlfredAgent

📚

published a Space 5 months ago

AlfredAgent

📚

updated a model 6 months ago

splendor1811/gemma-2-2B-it-thinking_FC

Updated Mar 2

published a model 6 months ago

splendor1811/gemma-2-2B-it-thinking_FC

Updated Mar 2

liked a Space 6 months ago

3.1k

The Ultra-Scale Playbook

🌌

The ultimate guide to training LLM on large GPU Clusters

upvoted a paper 6 months ago

TransMLA: Multi-head Latent Attention Is All You Need

Paper • 2502.07864 • Published Feb 11 • 57

Le Huy Hoang

AI & ML interests

Recent Activity

Organizations

splendor1811's activity

DeepSeek-R1 Dissection: Understanding PPO & GRPO Without Any Prior Reinforcement Learning Knowledge

🦸🏻#14: What Is MCP, and Why Is Everyone – Suddenly!– Talking About It?

I trained a Language Model to schedule events with GRPO!

Vision Language Models (Better, Faster, Stronger)

AlfredAgent

AlfredAgent

The Ultra-Scale Playbook