Data, embedding, and index of MassiveDS by "Scaling Retrieval-Based Language Models with a Trillion-Token Datastore"
Rulin Shao
rulins
AI & ML interests
None yet
Recent Activity
updated
a dataset
about 20 hours ago
rulins/gsm_symbolic_p2_fewshot_eval_data
published
a dataset
about 20 hours ago
rulins/gsm_symbolic_p2_fewshot_eval_data
updated
a dataset
about 21 hours ago
rulins/gsm_symbolic_p2_eval_data
Organizations
Collections
1
models
3
datasets
49
rulins/gsm_symbolic_p2_fewshot_eval_data
Viewer
•
Updated
•
50
•
5
rulins/gsm_symbolic_p2_eval_data
Viewer
•
Updated
•
50
•
6
rulins/gsm_symbolic_p2_train_data
Updated
•
23
rulins/hybrid_factualqa_5_actions_sft_0418_v2
Viewer
•
Updated
•
5
•
8
rulins/hybrid_factualqa_5_actions_sft_0418
Viewer
•
Updated
•
2.75k
•
7
rulins/hotpotqa_8_actions_qwen2.5_32b_100_samples_filtered
Viewer
•
Updated
•
82
•
12
rulins/simpleqa_10_actions_llama3.3_79b_it
Viewer
•
Updated
•
4.33k
•
19
rulins/reasoning-v1-1m_rl_no_prompt
Viewer
•
Updated
•
1M
•
48
rulins/open_scholar_rl_no_prompt
Viewer
•
Updated
•
60.2k
•
38
rulins/aime24_train_data
Updated
•
109