Datasets and models associated with the paper "Large-Scale Data Selection for Instruction Tuning" (https://arxiv.org/abs/2503.01807)
Hamish Ivison
hamishivi
AI & ML interests
NLP :)
Recent Activity
updated
a dataset
1 day ago
hamishivi/tulu_3_thinker_test
published
a dataset
1 day ago
hamishivi/tulu_3_thinker_test
updated
a dataset
2 days ago
ai2-adapt-dev/eurus2_ground_truth_with_random_bucketed_length
Organizations
models
34

hamishivi/s1k_seq_orig_hyper__42__1740446762
Updated

hamishivi/tulu_3_long_finetune_qwen_7b_reg_system_prompt
Updated

hamishivi/tulu-2-wildchat-326k-sft
Updated
•
3

hamishivi/tulu-2-arena-hard-326k-sft
Updated
•
2

hamishivi/llama-3.1-tulu-3-arena-hard-939k-sft
Updated
•
10

hamishivi/llama-3.1-tulu-3-multitask-rrmax-939k-sft
Updated
•
2

hamishivi/tulu-2-multitask-rrmax-326k-sft
Updated
•
2

hamishivi/qwen2_math_tokenizer_tweaked
Updated

hamishivi/0224_jupiter_hamish_grpo_tulu3_s1k_orz_30350
Updated
•
1

hamishivi/0224_jupiter_hamish_grpo_s1k_only_orz_24021
Updated
•
1
datasets
51
hamishivi/tulu_3_thinker_test
Viewer
•
Updated
•
909
•
13
hamishivi/SimpleQA-RLVR
Viewer
•
Updated
•
4.33k
•
161
hamishivi/2wiki_rlvr
Viewer
•
Updated
•
15.3k
•
32
hamishivi/tqa_rlvr
Viewer
•
Updated
•
156k
•
31
hamishivi/nq_rlvr
Viewer
•
Updated
•
91.5k
•
36
hamishivi/hotpotqa_rlvr
Viewer
•
Updated
•
97.9k
•
34
hamishivi/SimpleQA-RLVR-noprompt
Viewer
•
Updated
•
4.33k
•
51
hamishivi/simpleqa_5_actions_llama3.3_70b_it
Viewer
•
Updated
•
4.33k
•
30
hamishivi/simpleqa_10_actions_llama3.3_70b_it
Viewer
•
Updated
•
1.03k
•
30
hamishivi/GeneralThought-430K-filtered-thinker
Viewer
•
Updated
•
296k
•
89