Light-R1: Curriculum SFT, DPO and RL for Long COT from Scratch and Beyond Paper • 2503.10460 • Published Mar 13 • 27
hub_datasets_3d Collection 3d related datasets from Hugging Face Hub • 8 items • Updated Dec 13, 2024 • 1
PERL: Parameter Efficient Reinforcement Learning from Human Feedback Paper • 2403.10704 • Published Mar 15, 2024 • 60
ViewDiff: 3D-Consistent Image Generation with Text-to-Image Models Paper • 2403.01807 • Published Mar 4, 2024 • 9
EasyQuant: An Efficient Data-free Quantization Algorithm for LLMs Paper • 2403.02775 • Published Mar 5, 2024 • 13
Feast Your Eyes: Mixture-of-Resolution Adaptation for Multimodal Large Language Models Paper • 2403.03003 • Published Mar 5, 2024 • 11