YangWang92's picture

YangWang92

yangwang92

·

AI & ML interests

None yet

Recent Activity

liked a model about 19 hours ago

microsoft/kosmos-2.5

liked a model about 22 hours ago

meta-llama/Llama-3.2-3B

liked a model 4 days ago

deepseek-ai/DeepSeek-V3.1-Base

View all activity

Organizations

upvoted a paper 5 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 149

upvoted a collection 5 days ago

Recurrent Models

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space. • 15 items • Updated May 21 • 10

upvoted a collection 8 days ago

Web-SSL

17 items • Updated Apr 23 • 19

upvoted a collection 24 days ago

Physics of Language Models: Part 4.2

16 items • Updated 24 days ago • 3

upvoted a collection 26 days ago

GLM-4.5

GLM-4.5: An open-source large language model designed for intelligent agents by Z.ai • 11 items • Updated 12 days ago • 218

upvoted a paper 29 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published 30 days ago • 289

upvoted 3 papers about 1 month ago

Step-Audio 2 Technical Report

Paper • 2507.16632 • Published Jul 22 • 60

Scaling Laws for Optimal Data Mixtures

Paper • 2507.09404 • Published Jul 12 • 34

Not All Correct Answers Are Equal: Why Your Distillation Source Matters

Paper • 2505.14464 • Published May 20 • 9

upvoted a collection about 1 month ago

AM-Distilled-Dataset

AM-Distilled-Dataset • 5 items • Updated Jun 5 • 3

upvoted a paper about 1 month ago

PyVision: Agentic Vision with Dynamic Tooling

Paper • 2507.07998 • Published Jul 10 • 31

upvoted an article about 1 month ago

Article

SmolLM3: smol, multilingual, long-context reasoner

By

and 22 others •

Jul 8

• 635

upvoted a paper about 2 months ago

Skywork-Reward-V2: Scaling Preference Data Curation via Human-AI Synergy

Paper • 2507.01352 • Published Jul 2 • 53

upvoted a collection about 2 months ago

Skywork-Reward-V2

Scaling preference data curation to the extreme • 9 items • Updated Jul 4 • 23

upvoted 2 papers about 2 months ago

Does Math Reasoning Improve General LLM Capabilities? Understanding Transferability of LLM Reasoning

Paper • 2507.00432 • Published Jul 1 • 73

Pangu Pro MoE: Mixture of Grouped Experts for Efficient Sparsity

Paper • 2505.21411 • Published May 27 • 17

upvoted an article 2 months ago

Article

Large-scale Near-deduplication Behind BigCode

By

•

May 16, 2023

• 32

upvoted 3 papers 2 months ago

DataDecide: How to Predict Best Pretraining Data with Small Experiments

Paper • 2504.11393 • Published Apr 15 • 18

Reinforcement Learning with Verifiable Rewards Implicitly Incentivizes Correct Reasoning in Base LLMs

Paper • 2506.14245 • Published Jun 17 • 40

SwS: Self-aware Weakness-driven Problem Synthesis in Reinforcement Learning for LLM Reasoning

Paper • 2506.08989 • Published Jun 10 • 15