9 130 156

Emanuele Vivoli

emanuelevivoli

https://emanuelevivoli.github.io

AI & ML interests

I work on Comics/Manga :)

Recent Activity

upvoted a paper 6 days ago

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

upvoted a paper 6 days ago

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

upvoted a paper 9 days ago

Kimi-VL Technical Report

View all activity

Organizations

emanuelevivoli's activity

upvoted 2 papers 6 days ago

AnimeGamer: Infinite Anime Life Simulation with Next Game State Prediction

Paper • 2504.01014 • Published 21 days ago • 64

GigaTok: Scaling Visual Tokenizers to 3 Billion Parameters for Autoregressive Image Generation

Paper • 2504.08736 • Published 11 days ago • 47

upvoted a paper 9 days ago

Kimi-VL Technical Report

Paper • 2504.07491 • Published 13 days ago • 118

upvoted a paper 15 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 15 days ago • 168

liked a model 20 days ago

andreagemelli/Phi-3.5-mini-thinking-function_calling-V0

Updated Feb 22 • 1

updated a dataset 22 days ago

VLR-CVC/ComicsPAP

Viewer • Updated 16 days ago • 80.6k • 1.31k • 13

upvoted a paper 22 days ago

Your ViT is Secretly an Image Segmentation Model

Paper • 2503.19108 • Published 29 days ago • 21

New activity in ragavsachdeva/magi 22 days ago

Add HungarianMatcher

#2 opened 22 days ago by

emanuelevivoli

upvoted a paper 25 days ago

Video-R1: Reinforcing Video Reasoning in MLLMs

Paper • 2503.21776 • Published 26 days ago • 78

upvoted a paper 26 days ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published 29 days ago • 47

upvoted a paper about 1 month ago

Vamba: Understanding Hour-Long Videos with Hybrid Mamba-Transformers

Paper • 2503.11579 • Published Mar 14 • 19

upvoted a collection about 1 month ago

Comics Pick-A-Panel

Collection

Dataset, Models and Paper from ComicsPAP: understanding comic strips by picking the correct panel • 4 items • Updated Mar 14 • 3

authored 2 papers about 1 month ago

HoloMine: A Synthetic Dataset for Buried Landmines Recognition using Microwave Holographic Imaging

Paper • 2502.21054 • Published Feb 28

ComicsPAP: understanding comic strips by picking the correct panel

Paper • 2503.08561 • Published Mar 11 • 2

upvoted 3 papers about 1 month ago

R1-Omni: Explainable Omni-Multimodal Emotion Recognition with Reinforcing Learning

Paper • 2503.05379 • Published Mar 7 • 36

R1-Zero's "Aha Moment" in Visual Reasoning on a 2B Non-SFT Model

Paper • 2503.05132 • Published Mar 7 • 57

Vision-R1: Incentivizing Reasoning Capability in Multimodal Large Language Models

Paper • 2503.06749 • Published Mar 9 • 29

liked 2 models about 2 months ago

HuggingFaceTB/SmolVLM2-2.2B-Instruct

Image-Text-to-Text • Updated 15 days ago • 47.7k • 162

microsoft/Phi-4-multimodal-instruct

Automatic Speech Recognition • Updated 14 days ago • 619k • 1.32k

upvoted a paper about 2 months ago

Phi-4-Mini Technical Report: Compact yet Powerful Multimodal Language Models via Mixture-of-LoRAs

Paper • 2503.01743 • Published Mar 3 • 84