3 20

M Saad Salman

MSS444

MSS444

AI & ML interests

None yet

Recent Activity

upvoted a paper about 24 hours ago

A Survey on Large Language Model Benchmarks

upvoted a paper about 24 hours ago

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

upvoted a paper 7 days ago

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

View all activity

Organizations

None yet

upvoted 2 papers about 24 hours ago

A Survey on Large Language Model Benchmarks

Paper • 2508.15361 • Published 5 days ago • 18

AetherCode: Evaluating LLMs' Ability to Win In Premier Programming Competitions

Paper • 2508.16402 • Published 4 days ago • 9

upvoted a paper 7 days ago

Speed Always Wins: A Survey on Efficient Architectures for Large Language Models

Paper • 2508.09834 • Published 13 days ago • 48

upvoted a paper 8 days ago

Pass@k Training for Adaptively Balancing Exploration and Exploitation of Large Reasoning Models

Paper • 2508.10751 • Published 12 days ago • 25

upvoted 4 papers 11 days ago

GeRe: Towards Efficient Anti-Forgetting in Continual Learning of LLM via General Samples Replay

Paper • 2508.04676 • Published 19 days ago • 4

MathReal: We Keep It Real! A Real Scene Benchmark for Evaluating Math Reasoning in Multimodal Large Language Models

Paper • 2508.06009 • Published 18 days ago • 15

Learning to Align, Aligning to Learn: A Unified Approach for Self-Optimized Alignment

Paper • 2508.07750 • Published 15 days ago • 19

Can LLM-Generated Textual Explanations Enhance Model Classification Performance? An Empirical Study

Paper • 2508.09776 • Published 13 days ago • 3

upvoted 6 papers 15 days ago

StepFun-Formalizer: Unlocking the Autoformalization Potential of LLMs through Knowledge-Reasoning Fusion

Paper • 2508.04440 • Published 20 days ago • 9

Training Long-Context, Multi-Turn Software Engineering Agents with Reinforcement Learning

Paper • 2508.03501 • Published 21 days ago • 53

On the Generalization of SFT: A Reinforcement Learning Perspective with Reward Rectification

Paper • 2508.05629 • Published 18 days ago • 159

upvoted a paper 19 days ago

Agent Lightning: Train ANY AI Agents with Reinforcement Learning

Paper • 2508.03680 • Published 20 days ago • 61

upvoted a paper 25 days ago

Seed-Prover: Deep and Broad Reasoning for Automated Theorem Proving

Paper • 2507.23726 • Published 25 days ago • 108

upvoted a paper 26 days ago

Repair-R1: Better Test Before Repair

Paper • 2507.22853 • Published 26 days ago • 8

commented a paper 29 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 290 •

upvoted a paper 29 days ago

Group Sequence Policy Optimization

Paper • 2507.18071 • Published Jul 24 • 290

upvoted a paper about 1 month ago

Can One Domain Help Others? A Data-Centric Study on Multi-Domain Reasoning via Reinforcement Learning

Paper • 2507.17512 • Published Jul 23 • 36

M Saad Salman

AI & ML interests

Recent Activity

Organizations

MSS444's activity