1 192 160

Mohammed Brıman

mohammedbriman

AI & ML interests

Machine Learning, Deep Learning, Natural Language Processing, Natural Language Generation, Computer Vision

Recent Activity

liked a model 4 days ago

ytu-ce-cosmos/previous-token-prediction-turkish-gpt2-large

updated a collection 6 days ago

To read... eventually

upvoted a paper 6 days ago

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

View all activity

Organizations

None yet

mohammedbriman's activity

upvoted a paper 6 days ago

M1: Towards Scalable Test-Time Compute with Mamba Reasoning Models

Paper • 2504.10449 • Published 6 days ago • 8

upvoted a paper 13 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 13 days ago • 163

upvoted a paper 19 days ago

Command A: An Enterprise-Ready Large Language Model

Paper • 2504.00698 • Published 20 days ago • 24

upvoted an article 25 days ago

Article

Training and Finetuning Reranker Models with Sentence Transformers v4

26 days ago

• 112

upvoted 2 papers 28 days ago

Reinforcement Learning for Reasoning in Small LLMs: What Works and What Doesn't

Paper • 2503.16219 • Published Mar 20 • 46

Survey on Evaluation of LLM-based Agents

Paper • 2503.16416 • Published Mar 20 • 88

upvoted an article about 1 month ago

Article

Welcome Gemma 3: Google's all new multimodal, multilingual, long context open LLM

Mar 12

• 392

upvoted 3 papers about 1 month ago

upvoted 7 papers 3 months ago

Streaming DiLoCo with overlapping communication: Towards a Distributed Free Lunch

Paper • 2501.18512 • Published Jan 30 • 30

DeepSeek-R1: Incentivizing Reasoning Capability in LLMs via Reinforcement Learning

Paper • 2501.12948 • Published Jan 22 • 383

FAST: Efficient Action Tokenization for Vision-Language-Action Models

Paper • 2501.09747 • Published Jan 16 • 23

Cut Your Losses in Large-Vocabulary Language Models

Paper • 2411.09009 • Published Nov 13, 2024 • 49

Titans: Learning to Memorize at Test Time

Paper • 2501.00663 • Published Dec 31, 2024 • 22

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 55

rStar-Math: Small LLMs Can Master Math Reasoning with Self-Evolved Deep Thinking

Paper • 2501.04519 • Published Jan 8 • 276

upvoted 2 papers 4 months ago

Monolith: Real Time Recommendation System With Collisionless Embedding Table

Paper • 2209.07663 • Published Sep 16, 2022 • 1

Human-Timescale Adaptation in an Open-Ended Task Space

Paper • 2301.07608 • Published Jan 18, 2023 • 1