CountGD

A Multi-Modal Open-World Counting Model for counting objects in an image with text and image prompts. For more details, please check out the following links

Sample prediction

Architecture

CountGD Architecture

Citation

@article{AminiNaieni24,
    author       = "Amini-Naieni, N. and Han, T. and Zisserman, A.",
    title        = "CountGD: Multi-Modal Open-World Counting",
    booktitle    = "arxiv",
    year         = "2024",
}
Downloads last month
14
Safetensors
Model size
234M params
Tensor type
I64
·
F32
·
Inference Providers NEW
This model isn't deployed by any Inference Provider. 🙋 Ask for provider support