9 117 248

YangWang92

yangwang92

AI & ML interests

None yet

Recent Activity

upvoted a paper 3 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

upvoted a collection 4 days ago

NVIDIA Nemotron

liked a model 4 days ago

inclusionAI/Rubicon-Preview

View all activity

Organizations

upvoted a paper 3 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published 6 days ago • 70

upvoted a collection 4 days ago

NVIDIA Nemotron

Collection

Open, Production-ready Enterprise Models. Nvidia Open Model license. • 3 items • Updated about 13 hours ago • 54

liked a model 4 days ago

inclusionAI/Rubicon-Preview

Text Generation • 31B • Updated 11 days ago • 89 • 16

liked a model 6 days ago

Motif-Technologies/optimizer

Updated 2 days ago • 39

liked 2 models 7 days ago

ByteDance-Seed/Seed-OSS-36B-Base

Text Generation • 36B • Updated 4 days ago • 1.24k • 47

ByteDance-Seed/Seed-OSS-36B-Instruct

Text Generation • 36B • Updated 4 days ago • 14.9k • 370

liked 2 models 8 days ago

microsoft/kosmos-2.5

Image-Text-to-Text • 1B • Updated 2 days ago • 1.89k • 220

meta-llama/Llama-3.2-3B

Text Generation • 3B • Updated Oct 24, 2024 • 559k • 624

liked 2 models 11 days ago

deepseek-ai/DeepSeek-V3.1-Base

Text Generation • 685B • Updated 4 days ago • 22.4k • 953

Qwen/Qwen3-1.7B

Text Generation • 2B • Updated Jul 26 • 1.12M • • 242

liked a dataset 11 days ago

ByteDance-Seed/mga-fineweb-edu

Viewer • Updated May 19 • 846M • 2.23k • 32

upvoted a paper 12 days ago

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 150

upvoted a collection 12 days ago

Recurrent Models

Collection

These are checkpoints for recurrent LLMs developed to scale test-time compute by recurring in latent space. • 15 items • Updated May 21 • 10

liked a dataset 12 days ago

tomg-group-umd/huginn-dataset

Viewer • Updated Jul 15 • 274M • 2.3k • 6

liked a model 13 days ago

Qwen/Qwen3-4B-Base

Text Generation • 4B • Updated Jul 26 • 4.88M • 52

upvoted a collection 15 days ago

Web-SSL

Collection

17 items • Updated Apr 23 • 19

liked a model 15 days ago

Dream-org/DreamOn-v0-7B

Text Generation • 8B • Updated Jul 15 • 839 • 15

liked a model 18 days ago

Qwen/Qwen3-8B-Base

Text Generation • 8B • Updated May 21 • 1.35M • • 52

liked 2 datasets 24 days ago

SWE-Swiss/SWESwiss-SFT-Unittest-1K

Viewer • Updated 25 days ago • 1.02k • 250 • 2

SWE-bench/SWE-smith

Viewer • Updated 5 days ago • 59.1k • 3.39k • 36

YangWang92

AI & ML interests

Recent Activity

Organizations

yangwang92's activity