Victor Mustar's picture

Victor Mustar PRO

victor

·

victormustar

AI & ML interests

Building the UX of this website

Recent Activity

upvoted a paper about 14 hours ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

liked a model about 18 hours ago

lmstudio-community/granite-3.3-2b-instruct-GGUF

updated a Space about 18 hours ago

victor/spaces-trending

View all activity

Organizations

victor's activity

upvoted a paper about 14 hours ago

ReTool: Reinforcement Learning for Strategic Tool Use in LLMs

Paper • 2504.11536 • Published 3 days ago • 33

upvoted 2 papers 1 day ago

xVerify: Efficient Answer Verifier for Reasoning Model Evaluations

Paper • 2504.10481 • Published 4 days ago • 73

RealHarm: A Collection of Real-World Language Model Application Failures

Paper • 2504.10277 • Published 4 days ago • 10

upvoted a paper 3 days ago

PRIMA.CPP: Speeding Up 70B-Scale LLM Inference on Low-Resource Everyday Home Clusters

Paper • 2504.08791 • Published 11 days ago • 110

upvoted a collection 3 days ago

GLM-4-0414

GLM-4-0414 series model • 8 items • Updated 3 days ago • 98

upvoted a paper 4 days ago

Seaweed-7B: Cost-Effective Training of Video Generation Foundation Model

Paper • 2504.08685 • Published 7 days ago • 113

upvoted a paper 8 days ago

FantasyTalking: Realistic Talking Portrait Generation via Coherent Motion Synthesis

Paper • 2504.04842 • Published 11 days ago • 30

upvoted a paper 9 days ago

OmniSVG: A Unified Scalable Vector Graphics Generation Model

Paper • 2504.06263 • Published 10 days ago • 143

upvoted a collection 10 days ago

Cogito v1 Preview

5 items • Updated 10 days ago • 98

upvoted a paper 10 days ago

Multi-SWE-bench: A Multilingual Benchmark for Issue Resolving

Paper • 2504.02605 • Published 15 days ago • 43

upvoted a paper 11 days ago

Advances and Challenges in Foundation Agents: From Brain-Inspired Intelligence to Evolutionary, Collaborative, and Safe Systems

Paper • 2504.01990 • Published 18 days ago • 242

upvoted 2 collections 11 days ago

OneSQL-v0.1-Qwen

Text-to-SQL model • 15 items • Updated 14 days ago • 4

SANA-Sprint

🏃SANA-Sprint: One-Step Diffusion with Continuous-Time Consistency Distillation • 6 items • Updated 1 day ago • 35

upvoted a collection 15 days ago

Gemma 3 QAT

Quantization Aware Trained (QAT) Gemma 3 checkpoints. The model preserves similar quality as half precision while using 3x less memory • 8 items • Updated 15 days ago • 117

upvoted 2 papers 17 days ago

TextCrafter: Accurately Rendering Multiple Texts in Complex Visual Scenes

Paper • 2503.23461 • Published 19 days ago • 93

Transformers Use Causal World Models in Maze-Solving Tasks

Paper • 2412.11867 • Published Dec 16, 2024 • 1

upvoted 3 papers 21 days ago

Dita: Scaling Diffusion Transformer for Generalist Vision-Language-Action Policy

Paper • 2503.19757 • Published 24 days ago • 50

Qwen2.5-Omni Technical Report

Paper • 2503.20215 • Published 23 days ago • 138

Cube: A Roblox View of 3D Intelligence

Paper • 2503.15475 • Published 30 days ago • 28