Ashish Mishra

ashbuilds

ashbuilds

AI & ML interests

None yet

Recent Activity

liked a model 17 days ago

openai/gpt-oss-120b

liked a model 22 days ago

CohereLabs/command-a-vision-07-2025

liked a model 22 days ago

rednote-hilab/dots.ocr

View all activity

Organizations

None yet

upvoted an article about 1 month ago

Article

Creating custom kernels for the AMD MI300

and 1 other •

Jul 9

• 43

upvoted a collection 3 months ago

Holo1

Collection

Vision-Language Action Model for use in Surfer-H web navigation agent • 6 items • Updated Jun 10 • 48

upvoted an article 3 months ago

Article

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H

and 1 other •

Jun 3

• 70

upvoted a collection 4 months ago

Qwen2.5-VL

Collection

Vision-language model series based on Qwen2.5 • 11 items • Updated Jul 21 • 528

upvoted a paper 4 months ago

CoRAG: Collaborative Retrieval-Augmented Generation

Paper • 2504.01883 • Published Apr 2 • 10

upvoted 2 papers 6 months ago

LLMVoX: Autoregressive Streaming Text-to-Speech Model for Any LLM

Paper • 2503.04724 • Published Mar 6 • 72

InterFeedback: Unveiling Interactive Intelligence of Large Multimodal Models via Human Feedback

Paper • 2502.15027 • Published Feb 20 • 7

upvoted 2 papers 7 months ago

Transformer^2: Self-adaptive LLMs

Paper • 2501.06252 • Published Jan 9 • 55

MinMo: A Multimodal Large Language Model for Seamless Voice Interaction

Paper • 2501.06282 • Published Jan 10 • 53

upvoted a collection 8 months ago

DeepSeek-V3

Collection

4 items • Updated Mar 25 • 278

upvoted a paper 8 months ago

DiffSensei: Bridging Multi-Modal LLMs and Diffusion Models for Customized Manga Generation

Paper • 2412.07589 • Published Dec 10, 2024 • 49

Ashish Mishra

AI & ML interests

Recent Activity

Organizations

ashbuilds's activity

Creating custom kernels for the AMD MI300

Holo1: New family of GUI automation VLMs powering GUI agent Surfer-H