Reasoning Transfer

classroom

AI & ML interests

None defined yet.

Recent Activity

aaabiao authored a paper 2 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Ibisbill updated a model 4 days ago

ReasoningTransferability/UniReason-Qwen3-14B-no-think-SFT

Ibisbill updated a model 4 days ago

ReasoningTransferability/UniReason-Qwen3-14B-think-SFT

View all activity

aaabiao

authored a paper 2 days ago

TreePO: Bridging the Gap of Policy Optimization and Efficacy and Inference Efficiency with Heuristic Tree-based Modeling

Paper • 2508.17445 • Published 5 days ago • 70

Ibisbill

updated 3 models 4 days ago

aaabiao

updated a dataset about 2 months ago

ReasoningTransferability/math_rl_48k

Viewer • Updated Jul 11 • 48.8k • 209

aaabiao

published a dataset about 2 months ago

ReasoningTransferability/math_rl_48k

Viewer • Updated Jul 11 • 48.8k • 209

aaabiao

authored a paper about 2 months ago

First Return, Entropy-Eliciting Explore

Paper • 2507.07017 • Published Jul 9 • 23

Ibisbill

updated a dataset about 2 months ago

ReasoningTransferability/math_sft_40K

Viewer • Updated Jul 8 • 39.9k • 156 • 2

Ibisbill

published a dataset about 2 months ago

ReasoningTransferability/math_sft_40K

Viewer • Updated Jul 8 • 39.9k • 156 • 2

Ibisbill

in ReasoningTransferability/UniReason-Qwen3-14B-RL about 2 months ago

Add `library_name` metadata and GitHub link to model card

#1 opened about 2 months ago by

nielsr

Ibisbill

in ReasoningTransferability/UniReason-Qwen3-14B-think-SFT about 2 months ago

Add library_name and prominent link to GitHub repository

#1 opened about 2 months ago by

nielsr

Ibisbill

in ReasoningTransferability/UniReason-Qwen3-14B-no-think-SFT about 2 months ago

Add library name and GitHub link to model card

#1 opened about 2 months ago by

nielsr

Ibisbill

published 3 models about 2 months ago

ReasoningTransferability/UniReason-Qwen3-14B-no-think-SFT

Text Generation • 15B • Updated 4 days ago • 68 • 1

ReasoningTransferability/UniReason-Qwen3-14B-think-SFT

Text Generation • 15B • Updated 4 days ago • 50

ReasoningTransferability/UniReason-Qwen3-14B-RL

Text Generation • 15B • Updated 4 days ago • 93 • 3

yuexiang96

authored 5 papers about 2 months ago

Small Models Struggle to Learn from Strong Reasoners

Paper • 2502.12143 • Published Feb 17 • 40

Evaluating Vision-Language Models as Evaluators in Path Planning

Paper • 2411.18711 • Published Nov 27, 2024

VisualWebInstruct: Scaling up Multimodal Instruction Data through Web Search

Paper • 2503.10582 • Published Mar 13 • 24

Scaling Evaluation-time Compute with Reasoning Models as Process Evaluators

Paper • 2503.19877 • Published Mar 25 • 1

VisualPuzzles: Decoupling Multimodal Reasoning Evaluation from Domain Knowledge

Paper • 2504.10342 • Published Apr 14 • 11

AI & ML interests

Recent Activity

Team members 4