Adriel Martins's picture

Adriel Martins

Martins6

·

https://github.com/Martins6

Martins6

AI & ML interests

Graph Neural Networks (GNN) & Robot Learning & Multimodal AI

Recent Activity

liked a model 6 days ago

ByteDance-Seed/UI-TARS-1.5-7B

liked a model 9 days ago

CAMB-AI/MARS5-TTS

liked a model 11 days ago

thomasgauthier/csm-1b-hf

View all activity

Organizations

None yet

Martins6's activity

upvoted a paper 16 days ago

Executable Code Actions Elicit Better LLM Agents

Paper • 2402.01030 • Published Feb 1, 2024 • 127

upvoted 3 collections about 2 months ago

Dinov2

5 items • Updated Jan 16, 2024 • 18

SmolVLM2 📺 Smallest video LM ever 🤏🏻

11 items • Updated Feb 25 • 82

SmolLM2

State-of-the-art compact LLMs for on-device applications: 1.7B, 360M, 135M • 16 items • Updated Feb 20 • 255

upvoted a paper about 2 months ago

SigLIP 2: Multilingual Vision-Language Encoders with Improved Semantic Understanding, Localization, and Dense Features

Paper • 2502.14786 • Published Feb 20 • 143

upvoted a collection about 2 months ago

SigLIP2

36 items • Updated 20 days ago • 67

upvoted an article 2 months ago

Article

π0 and π0-FAST: Vision-Language-Action Models for General Robot Control

Feb 4

• 144

upvoted a collection 2 months ago

Phi-3

Phi-3 family of small language and multi-modal models. Language models are available in short- and long-context lengths. • 26 items • Updated 6 days ago • 566

upvoted a paper 2 months ago

SmolLM2: When Smol Goes Big -- Data-Centric Training of a Small Language Model

Paper • 2502.02737 • Published Feb 4 • 226

upvoted 2 articles 3 months ago

Article

Open-source DeepResearch – Freeing our search agents

Feb 4

• 1.22k

Article

Open-R1: a fully open reproduction of DeepSeek-R1

Jan 28

• 845

upvoted a collection 6 months ago

Open LLM Leaderboard best models ❤️‍🔥

A daily uploaded list of models with best evaluations on the LLM leaderboard: • 65 items • Updated Mar 20 • 582

upvoted a collection 7 months ago

LLaVA-Video

Models focus on video understanding (previously known as LLaVA-NeXT-Video). • 8 items • Updated Feb 21 • 60

upvoted a paper 8 months ago

Building and better understanding vision-language models: insights and future directions

Paper • 2408.12637 • Published Aug 22, 2024 • 130

upvoted 2 papers over 1 year ago

DocLLM: A layout-aware generative language model for multimodal document understanding

Paper • 2401.00908 • Published Dec 31, 2023 • 184

Improving Text Embeddings with Large Language Models

Paper • 2401.00368 • Published Dec 31, 2023 • 81

upvoted a collection over 1 year ago

📦 3D creation workflow

Going from a text prompt to a nice 3D model • 3 items • Updated 13 days ago • 30

upvoted 2 papers over 1 year ago

VR-NeRF: High-Fidelity Virtualized Walkable Spaces

Paper • 2311.02542 • Published Nov 5, 2023 • 19

Connecting Large Language Models with Evolutionary Algorithms Yields Powerful Prompt Optimizers

Paper • 2309.08532 • Published Sep 15, 2023 • 53