Monard Juliet's picture

3

Monard Juliet

Mzzzhu

Mzzzhu

AI & ML interests

None yet

Recent Activity

upvoted a paper about 2 months ago

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

upvoted a paper 6 months ago

Constraint Back-translation Improves Complex Instruction Following of Large Language Models

upvoted a paper 6 months ago

Pre-training Distillation for Large Language Models: A Design Space Exploration

View all activity

Organizations

None yet

Mzzzhu's activity

upvoted a paper about 2 months ago

Agentic Reward Modeling: Integrating Human Preferences with Verifiable Correctness Signals for Reliable Reward Systems

Paper • 2502.19328 • Published Feb 26 • 22

upvoted 2 papers 6 months ago

Constraint Back-translation Improves Complex Instruction Following of Large Language Models

Paper • 2410.24175 • Published Oct 31, 2024 • 18

Pre-training Distillation for Large Language Models: A Design Space Exploration

Paper • 2410.16215 • Published Oct 21, 2024 • 16