LAUNCH Lab

Enterprise

university

https://launch.eecs.umich.edu/

launchnlp

launchnlp

AI & ML interests

Factuality, reasoning, alignment, LLM applications

Recent Activity

JieRuan new activity about 22 hours ago

launch/ExpertLongBench:Change ordering and remove columns from T3

zkjzou updated a dataset 1 day ago

launch/ManyICLBench

JieRuan new activity 3 days ago

launch/ExpertLongBench:Add task category and relevant tags

View all activity

Collections 1

spaces 4

ExpertLongBench

Leaderboard for ExpertLongBench

FactRBench

View and analyze long-form factuality leaderboard

MLRC-BENCH

Display model performance metrics

Factbench

Display a leaderboard for evaluating language model factuality

models 4

launch/ThinkPRM-7B

Text Generation • Updated 28 days ago • 111

launch/ThinkPRM-14B

Text Generation • Updated May 12 • 93 • 3

launch/ThinkPRM-1.5B

Text Generation • Updated May 12 • 1.31k • 2

launch/POLITICS

Fill-Mask • Updated Apr 13 • 171 • 13

datasets 11

launch/ExpertLongBench

Preview • Updated about 22 hours ago • 257 • 6

launch/ManyICLBench

Viewer • Updated 1 day ago • 9.6k • 44

launch/FactRBench

Viewer • Updated 6 days ago • 1.06k • 52

launch/FactBench

Viewer • Updated 6 days ago • 1k • 56 • 3

launch/thinkprm-1K-verification-cots

Viewer • Updated Apr 26 • 1k • 168 • 5

launch/CLASH

Viewer • Updated Apr 16 • 345 • 89

launch/gov_report

Viewer • Updated Nov 9, 2022 • 58.4k • 377 • 7

launch/gov_report_qs

Viewer • Updated Nov 9, 2022 • 7.87k • 167 • 2

launch/open_question_type

Viewer • Updated Nov 9, 2022 • 4.96k • 149 • 4

launch/reddit_qg

Viewer • Updated Nov 9, 2022 • 720k • 123