DJ Sri Vigneshwar's picture

DJ Sri Vigneshwar PRO

Sri-Vigneshwar-DJ

·

https://hawky.ai/

AI & ML interests

Co-Founder, CTO @Hawky.ai - Creative Intelligence for Performance Marketing

Recent Activity

liked a model 11 days ago

logasja/FaceNet512

liked a model 11 days ago

zai-org/GLM-4.5V

upvoted a paper 12 days ago

OpenVLA: An Open-Source Vision-Language-Action Model

View all activity

Organizations

upvoted a paper 12 days ago

OpenVLA: An Open-Source Vision-Language-Action Model

Paper • 2406.09246 • Published Jun 13, 2024 • 42

upvoted an article about 1 month ago

Article

Arc Virtual Cell Challenge: A Primer

By

and 1 other •

Jul 18

• 54

upvoted an article about 2 months ago

Article

Reachy Mini - The Open-Source Robot for Today's and Tomorrow's AI Builders

By

and 1 other •

Jul 9

• 654

upvoted an article 2 months ago

Article

🤔👀🎬🖥️📖 Kimi-VL-A3B-Thinking-2506: A Quick Navigation

By

and 1 other •

Jun 21

• 67

upvoted an article 5 months ago

Article

SmolVLM2: Bringing Video Understanding to Every Device

By

and 6 others •

Feb 20

• 295

upvoted a paper 8 months ago

LSceneLLM: Enhancing Large 3D Scene Understanding Using Adaptive Visual Preferences

Paper • 2412.01292 • Published Dec 2, 2024 • 13

upvoted a collection 9 months ago

🖼️ MLLMs

39 items • Updated 27 days ago • 12

upvoted an article 9 months ago

Article

Releasing the largest multilingual open pretraining dataset

By

and 2 others •

Nov 13, 2024

• 102

upvoted a paper 11 months ago

SaRA: High-Efficient Diffusion Model Fine-tuning with Progressive Sparse Low-Rank Adaptation

Paper • 2409.06633 • Published Sep 10, 2024 • 15

upvoted 7 papers 12 months ago

Meta Flow Matching: Integrating Vector Fields on the Wasserstein Manifold

Paper • 2408.14608 • Published Aug 26, 2024 • 8

3D Reconstruction with Spatial Memory

Paper • 2408.16061 • Published Aug 28, 2024 • 15

SAM2Point: Segment Any 3D as Videos in Zero-shot and Promptable Manners

Paper • 2408.16768 • Published Aug 29, 2024 • 29

ReconX: Reconstruct Any Scene from Sparse Views with Video Diffusion Model

Paper • 2408.16767 • Published Aug 29, 2024 • 33

WavTokenizer: an Efficient Acoustic Discrete Codec Tokenizer for Audio Language Modeling

Paper • 2408.16532 • Published Aug 29, 2024 • 52

Law of Vision Representation in MLLMs

Paper • 2408.16357 • Published Aug 29, 2024 • 96

Scalable High-Resolution Pixel-Space Image Synthesis with Hourglass Diffusion Transformers

Paper • 2401.11605 • Published Jan 21, 2024 • 23

upvoted 2 articles about 1 year ago

Article

Making LLMs even more accessible with bitsandbytes, 4-bit quantization and QLoRA

By

and 4 others •

May 24, 2023

• 162

Article

Google releases Gemma 2 2B, ShieldGemma and Gemma Scope

By

and 3 others •

Jul 31, 2024

• 60