SAMBIT CHAKRABORTY

sambitchakhf03

AI & ML interests

None yet

Recent Activity

upvoted a paper 5 days ago

Iterative Self-Training for Code Generation via Reinforced Re-Ranking

upvoted a paper 10 days ago

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

upvoted a paper 10 days ago

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

View all activity

Organizations

sambitchakhf03's activity

upvoted a paper 5 days ago

Iterative Self-Training for Code Generation via Reinforced Re-Ranking

Paper • 2504.09643 • Published 9 days ago • 34

upvoted 3 papers 10 days ago

VideoChat-R1: Enhancing Spatio-Temporal Perception via Reinforcement Fine-Tuning

Paper • 2504.06958 • Published 13 days ago • 10

Scaling Laws for Native Multimodal Models Scaling Laws for Native Multimodal Models

Paper • 2504.07951 • Published 12 days ago • 27

DeepSeek-R1 Thoughtology: Let's <think> about LLM Reasoning

Paper • 2504.07128 • Published 20 days ago • 81

upvoted a paper 12 days ago

DDT: Decoupled Diffusion Transformer

Paper • 2504.05741 • Published 14 days ago • 73

upvoted a paper 15 days ago

FreSca: Unveiling the Scaling Space in Diffusion Models

Paper • 2504.02154 • Published 20 days ago • 18

upvoted a paper 16 days ago

VideoScene: Distilling Video Diffusion Model to Generate 3D Scenes in One Step

Paper • 2504.01956 • Published 20 days ago • 40

upvoted 3 papers 17 days ago

GPT-ImgEval: A Comprehensive Benchmark for Diagnosing GPT4o in Image Generation

Paper • 2504.02782 • Published 19 days ago • 55

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 22 days ago • 252

ZClip: Adaptive Spike Mitigation for LLM Pre-Training

Paper • 2504.02507 • Published 19 days ago • 76

upvoted a paper 18 days ago

Smarter, Better, Faster, Longer: A Modern Bidirectional Encoder for Fast, Memory Efficient, and Long Context Finetuning and Inference

Paper • 2412.13663 • Published Dec 18, 2024 • 148

liked a model 18 days ago

sambitchakhf03/chatbox-llm-merged

Text Generation • Updated Aug 15, 2023 • 39 • 1

upvoted 2 papers about 1 month ago

Transformers without Normalization

Paper • 2503.10622 • Published Mar 13 • 158

Token-Efficient Long Video Understanding for Multimodal LLMs

Paper • 2503.04130 • Published Mar 6 • 94

upvoted 4 papers about 2 months ago

LLM-Microscope: Uncovering the Hidden Role of Punctuation in Context Memory of Transformers

Paper • 2502.15007 • Published Feb 20 • 175

SpargeAttn: Accurate Sparse Attention Accelerating Any Model Inference

Paper • 2502.18137 • Published Feb 25 • 56

Multilingual Machine Translation with Open Large Language Models at Practical Scale: An Empirical Study

Paper • 2502.02481 • Published Feb 4 • 13

Slamming: Training a Speech Language Model on One GPU in a Day

Paper • 2502.15814 • Published Feb 19 • 69

upvoted 2 papers 2 months ago

InfiniteHiP: Extending Language Model Context Up to 3 Million Tokens on a Single GPU

Paper • 2502.08910 • Published Feb 13 • 149

Scaling up Test-Time Compute with Latent Reasoning: A Recurrent Depth Approach

Paper • 2502.05171 • Published Feb 7 • 138