1 26 4

Youssef Hosni

YoussefHosni

AI & ML interests

None yet

Recent Activity

updated a model 3 days ago

YoussefHosni/gemma-3

published a model 3 days ago

YoussefHosni/gemma-3

liked a dataset 4 days ago

abuelkhair-corpus/arabic_billion_words

View all activity

Organizations

None yet

YoussefHosni's activity

upvoted 3 papers 14 days ago

Open-Reasoner-Zero: An Open Source Approach to Scaling Up Reinforcement Learning on the Base Model

Paper • 2503.24290 • Published 23 days ago • 62

Any2Caption:Interpreting Any Condition to Caption for Controllable Video Generation

Paper • 2503.24379 • Published 22 days ago • 75

Envisioning Beyond the Pixels: Benchmarking Reasoning-Informed Visual Editing

Paper • 2504.02826 • Published 19 days ago • 67

upvoted a paper 22 days ago

I Have Covered All the Bases Here: Interpreting Reasoning Features in Large Language Models via Sparse Autoencoders

Paper • 2503.18878 • Published 30 days ago • 117

upvoted a paper about 1 month ago

SmolDocling: An ultra-compact vision-language model for end-to-end multi-modal document conversion

Paper • 2503.11576 • Published Mar 14 • 97

upvoted an article about 1 month ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 398

upvoted 13 papers about 2 months ago

CODESYNC: Synchronizing Large Language Models with Dynamic Code Evolution at Scale

Paper • 2502.16645 • Published Feb 23 • 22

Self-rewarding correction for mathematical reasoning

Paper • 2502.19613 • Published Feb 26 • 84

Can Large Language Models Detect Errors in Long Chain-of-Thought Reasoning?

Paper • 2502.19361 • Published Feb 26 • 28

TheoremExplainAgent: Towards Multimodal Explanations for LLM Theorem Understanding

Paper • 2502.19400 • Published Feb 26 • 49

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25 • 56

SWE-RL: Advancing LLM Reasoning via Reinforcement Learning on Open Software Evolution

Paper • 2502.18449 • Published Feb 25 • 74

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 73

Multimodal Inconsistency Reasoning (MMIR): A New Benchmark for Multimodal Reasoning Models

Paper • 2502.16033 • Published Feb 22 • 18

CodeCriticBench: A Holistic Code Critique Benchmark for Large Language Models

Paper • 2502.16614 • Published Feb 23 • 27

Make LoRA Great Again: Boosting LoRA with Adaptive Singular Values and Mixture-of-Experts Optimization Alignment

Paper • 2502.16894 • Published Feb 24 • 29

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published Feb 19 • 69

Thus Spake Long-Context Large Language Model

Paper • 2502.17129 • Published Feb 24 • 73

KITAB-Bench: A Comprehensive Multi-Domain Benchmark for Arabic OCR and Document Understanding

Paper • 2502.14949 • Published Feb 20 • 8

upvoted a paper 9 months ago

Improving Text Embeddings for Smaller Language Models Using Contrastive Fine-tuning

Paper • 2408.00690 • Published Aug 1, 2024 • 26