5 12 46

Jason Corkill

jasoncorkill

https://rapidata.ai

AI & ML interests

Human data annotation

Recent Activity

liked a dataset 9 days ago

Rapidata/2k-ranked-images-open-image-preferences-v1

reacted to their post with ❤️ 9 days ago

🚀 We tried something new! We just published a dataset using a new (for us) preference modality: direct ranking based on aesthetic preference. We ranked a couple of thousand images from most to least preferred, all sampled from the Open Image Preferences v1 dataset by the amazing @data-is-better-together team. 📊 Check it out here: https://huggingface.co/datasets/Rapidata/2k-ranked-images-open-image-preferences-v1 We're really curious to hear your thoughts! Is this kind of ranking interesting or useful to you? Let us know! 💬 If it is, please consider leaving a ❤️ and if we hit 30 ❤️s, we’ll go ahead and rank the full 17k image dataset!

reacted to their post with 🔥 9 days ago

View all activity

Organizations

jasoncorkill's activity

liked a dataset 9 days ago

Rapidata/2k-ranked-images-open-image-preferences-v1

Viewer • Updated 10 days ago • 2k • 152 • 18

reacted to their post with ❤️🔥🚀 9 days ago

Post

3212

🚀 We tried something new!

We just published a dataset using a new (for us) preference modality: direct ranking based on aesthetic preference. We ranked a couple of thousand images from most to least preferred, all sampled from the Open Image Preferences v1 dataset by the amazing @data-is-better-together team.

📊 Check it out here:
Rapidata/2k-ranked-images-open-image-preferences-v1

We're really curious to hear your thoughts!
Is this kind of ranking interesting or useful to you? Let us know! 💬

If it is, please consider leaving a ❤️ and if we hit 30 ❤️s, we’ll go ahead and rank the full 17k image dataset!

5 replies

replied to their post 9 days ago

@davanstrien might be interesting for you 🚀

posted an update 9 days ago

Post

3212

🚀 We tried something new!

We just published a dataset using a new (for us) preference modality: direct ranking based on aesthetic preference. We ranked a couple of thousand images from most to least preferred, all sampled from the Open Image Preferences v1 dataset by the amazing @data-is-better-together team.

📊 Check it out here:
Rapidata/2k-ranked-images-open-image-preferences-v1

We're really curious to hear your thoughts!
Is this kind of ranking interesting or useful to you? Let us know! 💬

If it is, please consider leaving a ❤️ and if we hit 30 ❤️s, we’ll go ahead and rank the full 17k image dataset!

5 replies

updated a dataset 10 days ago

Rapidata/2k-ranked-images-open-image-preferences-v1

Viewer • Updated 10 days ago • 2k • 152 • 18

reacted to their post with 🔥👀🚀 11 days ago

Post

3026

🔥 Yesterday was a fire day!
We dropped two brand-new datasets capturing Human Preferences for text-to-video and text-to-image generations powered by our own crowdsourcing tool!

Whether you're working on model evaluation, alignment, or fine-tuning, this is for you.

1. Text-to-Video Dataset (Pika 2.2 model):
Rapidata/text-2-video-human-preferences-pika2.2

2. Text-to-Image Dataset (Reve-AI Halfmoon):
Rapidata/Reve-AI-Halfmoon_t2i_human_preference

Let’s train AI on AI-generated content with humans in the loop.
Let’s make generative models that actually get us.

posted an update 11 days ago

Post

3026

liked 2 datasets 12 days ago

Rapidata/Reve-AI-Halfmoon_t2i_human_preference

Viewer • Updated 12 days ago • 13k • 183 • 7

Rapidata/text-2-video-human-preferences-pika2.2

Viewer • Updated 12 days ago • 1.68k • 281 • 8

reacted to their post with 👍🚀 17 days ago

Post

2728

We benchmarked @xai-org 's Aurora model, as far as we know the first public evaluation of the model at scale.

We collected 401k human annotations in over the past ~2 days for this, we have uploaded all of the annotation data here on huggingface with a fully permissive license
Rapidata/xAI_Aurora_t2i_human_preferences

1 reply

reacted to their post with 🚀🧠👀❤️ 17 days ago

Post

4656

Runway Gen-3 Alpha: The Style and Coherence Champion

Runway's latest video generation model, Gen-3 Alpha, is something special. It ranks #3 overall on our text-to-video human preference benchmark, but in terms of style and coherence, it outperforms even OpenAI Sora.

However, it struggles with alignment, making it less predictable for controlled outputs.

We've released a new dataset with human evaluations of Runway Gen-3 Alpha: Rapidata's text-2-video human preferences dataset. If you're working on video generation and want to see how your model compares to the biggest players, we can benchmark it for you.

🚀 DM us if you’re interested!

Dataset: Rapidata/text-2-video-human-preferences-runway-alpha

1 reply

reacted to their post with 🚀 17 days ago

Post

2556

This dataset was collected in roughly 4 hours using the Rapidata Python API, showcasing how quickly large-scale annotations can be performed with the right tooling!

All that at less than the cost of a single hour of a typical ML engineer in Zurich!

The new dataset of ~22,000 human annotations evaluating AI-generated videos based on different dimensions, such as Prompt-Video Alignment, Word for Word Prompt Alignment, Style, Speed of Time flow and Quality of Physics.

Rapidata/text-2-video-Rich-Human-Feedback

1 reply