5 412

Literate Goggles

literate-goggles

AI & ML interests

None yet

Recent Activity

upvoted a paper 1 day ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

upvoted a paper 6 days ago

D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation

upvoted a paper 7 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

View all activity

Organizations

None yet

literate-goggles's activity

upvoted a paper 1 day ago

Does Reinforcement Learning Really Incentivize Reasoning Capacity in LLMs Beyond the Base Model?

Paper • 2504.13837 • Published 4 days ago • 84

upvoted a paper 6 days ago

D^2iT: Dynamic Diffusion Transformer for Accurate Image Generation

Paper • 2504.09454 • Published 10 days ago • 11

upvoted a paper 7 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published 11 days ago • 47

upvoted a paper 12 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 15 days ago • 73

upvoted an article 13 days ago

Article

Hugging Face and Cloudflare Partner to Make Real-Time Speech and Video Seamless with FastRTC

14 days ago

• 21

upvoted an article 19 days ago

Article

The NLP Course is becoming the LLM Course!

20 days ago

• 81

upvoted a paper 20 days ago

MergeVQ: A Unified Framework for Visual Generation and Representation with Disentangled Token Merging and Quantization

Paper • 2504.00999 • Published 21 days ago • 83

upvoted a paper 26 days ago

Gemini Robotics: Bringing AI into the Physical World

Paper • 2503.20020 • Published 28 days ago • 24

upvoted a paper 29 days ago

Bridging Continuous and Discrete Tokens for Autoregressive Visual Generation

Paper • 2503.16430 • Published Mar 20 • 35

upvoted 4 papers about 1 month ago

upvoted a paper about 2 months ago

OmniAlign-V: Towards Enhanced Alignment of MLLMs with Human Preference

Paper • 2502.18411 • Published Feb 25 • 73

upvoted an article about 2 months ago

Article

SigLIP 2: A better multilingual vision language encoder

Feb 21

• 153

upvoted a paper about 2 months ago

MLGym: A New Framework and Benchmark for Advancing AI Research Agents

Paper • 2502.14499 • Published Feb 20 • 191

upvoted 4 papers 2 months ago

Meta Audiobox Aesthetics: Unified Automatic Quality Assessment for Speech, Music, and Sound

Paper • 2502.05139 • Published Feb 7 • 1

SongGen: A Single Stage Auto-regressive Transformer for Text-to-Song Generation

Paper • 2502.13128 • Published Feb 18 • 42

Soundwave: Less is More for Speech-Text Alignment in LLMs

Paper • 2502.12900 • Published Feb 18 • 85

Learning Getting-Up Policies for Real-World Humanoid Robots

Paper • 2502.12152 • Published Feb 17 • 42