ldwang's picture

ldwang

ldwang

·

ftgreat

AI & ML interests

LLM, MLLM, Infra

Recent Activity

upvoted a paper about 3 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

liked a model about 3 hours ago

agents-course/notebooks

liked a dataset 3 days ago

nvidia/Nemotron-Pretraining-SFT-v1

View all activity

Organizations

upvoted a paper about 3 hours ago

FineWeb2: One Pipeline to Scale Them All -- Adapting Pre-Training Data Processing to Every Language

Paper • 2506.20920 • Published Jun 26 • 69

upvoted a collection 14 days ago

MiscAgentic

3 items • Updated 14 days ago • 1

upvoted a paper 14 days ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 164

upvoted an article 15 days ago

Article

Introducing smolagents: simple agents that write actions in code.

By

and 2 others •

Dec 31, 2024

• 1.11k

upvoted a paper 18 days ago

Trainable Dynamic Mask Sparse Attention

Paper • 2508.02124 • Published 19 days ago • 15

upvoted a paper 29 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published about 1 month ago • 289

upvoted a paper about 1 month ago

Revisiting Reinforcement Learning for LLM Reasoning from A Cross-Domain Perspective

Paper • 2506.14965 • Published Jun 17 • 49

upvoted a collection about 1 month ago

Kimi-K2

Moonshot's MoE LLMs with 1 trillion parameters, exceptional on agentic intellegence • 2 items • Updated Jul 12 • 116

upvoted 2 papers about 1 month ago

GoT: Unleashing Reasoning Capability of Multimodal Large Language Model for Visual Generation and Editing

Paper • 2503.10639 • Published Mar 13 • 52

Rope to Nope and Back Again: A New Hybrid Attention Strategy

Paper • 2501.18795 • Published Jan 30 • 6

upvoted an article about 1 month ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

Jul 8

• 635

upvoted a paper about 1 month ago

RoboBrain 2.0 Technical Report

Paper • 2507.02029 • Published Jul 2 • 30

upvoted 2 collections about 2 months ago

MiscKernel

7 items • Updated Jul 8 • 1

MiscIndustry

1 item • Updated Jul 7 • 1

upvoted a paper about 2 months ago

OmniGen2: Exploration to Advanced Multimodal Generation

Paper • 2506.18871 • Published Jun 23 • 74

upvoted 5 papers 2 months ago

Essential-Web v1.0: 24T tokens of organized web data

Paper • 2506.14111 • Published Jun 17 • 43

Infinity Instruct: Scaling Instruction Selection and Synthesis to Enhance Language Models

Paper • 2506.11116 • Published Jun 9 • 5

MiniCPM4: Ultra-Efficient LLMs on End Devices

Paper • 2506.07900 • Published Jun 9 • 90

CCI4.0: A Bilingual Pretraining Dataset for Enhancing Reasoning in Large Language Models

Paper • 2506.07463 • Published Jun 9 • 10

The Common Pile v0.1: An 8TB Dataset of Public Domain and Openly Licensed Text

Paper • 2506.05209 • Published Jun 5 • 46