HugGAN Community

non-profit

Activity Feed Request to join this org

AI & ML interests

GANs!

Recent Activity

DrishtiSharma authored a paper 2 days ago

Robust and Fine-Grained Detection of AI Generated Texts

gigant authored a paper 3 days ago

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

gigant authored a paper 3 days ago

Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure

View all activity

huggan's activity

DrishtiSharma

authored a paper 2 days ago

Robust and Fine-Grained Detection of AI Generated Texts

Paper • 2504.11952 • Published 3 days ago • 9

clem

posted an update 2 days ago

Post

1259

You can now bill your inference costs from all our inference partners (together, fireworks, fal, sambanova, cerebras, hyperbolic,...) to your Hugging Face organization.

Useful to drive more company-wide usage of AI without the billing headaches!

1 reply

gigant

authored 2 papers 3 days ago

BigBIO: A Framework for Data-Centric Biomedical Natural Language Processing

Paper • 2206.15076 • Published Jun 30, 2022 • 4

Summarization of Multimodal Presentations with Vision-Language Models: Study of the Effect of Modalities and Structure

Paper • 2504.10049 • Published 5 days ago • 2

merve

posted an update 5 days ago

Post

3920

sooo many open AI releases past week, let's summarize! 🤗
merve/april-11-releases-67fcd78be33d241c0977b9d2

multimodal
> Moonshot AI released Kimi VL Thinking, first working open-source multimodal reasoning model and Kimi VL Instruct, both 16B MoEs with 3B active params (OS)
> InternVL3 released based on Qwen2.5VL, 7 ckpts with various sizes (1B to 78B)

LLMs
> NVIDIA released Llama-3_1-Nemotron-Ultra-253B-v1 an LLM built on Llama 405B for reasoning, chat and tool use
> Agentica released DeepCoder-14B-Preview, fine-tuned version of DeepSeek-R1-Distilled-Qwen-14B on problem-test pairs, along with the compiled dataset
> Zyphra/ZR1-1.5B is a new small reasoning LLM built on R1-Distill-1.5B (OS)
> Skywork-OR1-32B-Preview is a new reasoning model by Skywork

Image Generation
> HiDream releases three new models, HiDream I1 Dev, I1 Full, and I1 fast for image generation (OS)

*OS ones have Apache 2.0 or MIT licenses

4 replies

nouamanetazi

authored a paper 11 days ago

SmolVLM: Redefining small and efficient multimodal models

Paper • 2504.05299 • Published 12 days ago • 161

clem

posted an update 13 days ago

Post

2630

Llama 4 is in transformers!

Fun example using the instruction-tuned Maverick model responding about two images, using tensor parallel for maximum speed.

From https://huggingface.co/blog/llama4-release

1 reply

clem

posted an update 16 days ago

Post

1938

Llama models (arguably the most successful open AI models of all times) just represented 3% of total model downloads on Hugging Face in March.

People and media like stories of winner takes all & one model/company to rule them all but the reality is much more nuanced than this!

Kudos to all the small AI builders out there!

2 replies

clem

posted an update 17 days ago

Post

1331

Now in Enterprise Hub organizations, you can centralize your billing not only for HF usage but also inference through our inference partners.

Will prevent some headaches for your finance & accounting teams haha (so feel free to share that with them).

3 replies

clem

posted an update 18 days ago

Post

3972

Before 2020, most of the AI field was open and collaborative. For me, that was the key factor that accelerated scientific progress and made the impossible possible—just look at the “T” in ChatGPT, which comes from the Transformer architecture openly shared by Google.

Then came the myth that AI was too dangerous to share, and companies started optimizing for short-term revenue. That led many major AI labs and researchers to stop sharing and collaborating.

With OAI and sama now saying they're willing to share open weights again, we have a real chance to return to a golden age of AI progress and democratization—powered by openness and collaboration, in the US and around the world.

This is incredibly exciting. Let’s go, open science and open-source AI!

5 replies

clem

posted an update 22 days ago

Post

2395

What's this cool purple banner haha 😶😶😶

4 replies

osanseviero

authored a paper 22 days ago

Gemma 3 Technical Report

Paper • 2503.19786 • Published 25 days ago • 46

clem

posted an update 23 days ago

Post

2239

Very interesting security section by @yjernite @lvwerra @reach-vb @dvilasuero & the team replicating R1. Broadly applicable to most open-source models & some to APIs (but APIs have a lot more additional risks because you're not in control of the underlying system):

https://huggingface.co/blog/open-r1/update-4#is-it-safe

1 reply

clem

posted an update 24 days ago

Post

1572

A repository is created every ~15 secs on Hugging Face so @kramp added a "Getting Started" to make it easier & a model release checklist: https://huggingface.co/docs/hub/model-release-checklist

What are you uploading today?

1 reply

emre

authored a paper 26 days ago

BLOOM: A 176B-Parameter Open-Access Multilingual Language Model

Paper • 2211.05100 • Published Nov 9, 2022 • 31

emre

posted an update 26 days ago

Post

3320

having trouble with auto train
hello there this is the first time i am testing auto train with a 1.8k SFT dataset. Howevery i am not quite sure the training is going smooth. Logs seem quite confusing, token did not match can not auth, generates confusing train splits, do you know how i can check my running job properly?
what is being used for training as data?
any ideas?

1 reply

merve

posted an update 28 days ago

Post

4055

So many open releases at Hugging Face past week 🤯 recapping all here ⤵️ merve/march-21-releases-67dbe10e185f199e656140ae

👀 Multimodal
> Mistral AI released a 24B vision LM, both base and instruction FT versions, sota 🔥 (OS)
> with IBM we released SmolDocling, a sota 256M document parser with Apache 2.0 license (OS)
> SpatialLM is a new vision LM that outputs 3D bounding boxes, comes with 0.5B (QwenVL based) and 1B (Llama based) variants
> SkyWork released SkyWork-R1V-38B, new vision reasoning model (OS)

💬 LLMs
> NVIDIA released new Nemotron models in 49B and 8B with their post-training dataset
> LG released EXAONE, new reasoning models in 2.4B, 7.8B and 32B
> Dataset: Glaive AI released a new reasoning dataset of 22M+ examples
> Dataset: NVIDIA released new helpfulness dataset HelpSteer3
> Dataset: OpenManusRL is a new agent dataset based on ReAct framework (OS)
> Open-R1 team released OlympicCoder, new competitive coder model in 7B and 32B
> Dataset: GeneralThought-430K is a new reasoning dataset (OS)

🖼️ Image Generation/Computer Vision
> Roboflow released RF-DETR, new real-time sota object detector (OS) 🔥
> YOLOE is a new real-time zero-shot object detector with text and visual prompts 🥹
> Stability AI released Stable Virtual Camera, a new novel view synthesis model
> Tencent released Hunyuan3D-2mini, new small and fast 3D asset generation model
> ByteDance released InfiniteYou, new realistic photo generation model
> StarVector is a new 8B model that generates svg from images
> FlexWorld is a new model that expands 3D views (OS)

🎤 Audio
> Sesame released CSM-1B new speech generation model (OS)

🤖 Robotics
> NVIDIA released GR00T, new robotics model for generalized reasoning and skills, along with the dataset

*OS ones have Apache 2.0 or MIT license

clem

posted an update 30 days ago

Post

3716

Should we assemble affordable open-source robots at Hugging Face for the community. Would you buy them? At what price?

8 replies

clem

posted an update about 1 month ago

Post

2591

Nice new space to see how fast your personal or organization followers are growing on HF:
julien-c/follow-history

As you can see, I still have more followers than @julien-c even if he's trying to change this by building such cool spaces 😝😝😝

gigant

in huggan/wikiart about 1 month ago

Request: DOI

#4 opened about 1 month ago by

jianhuoyan

AI & ML interests

Recent Activity

Team members 96

huggan's activity

Request: DOI