IPBench: Benchmarking the Knowledge of Large Language Models in Intellectual Property Paper • 2504.15524 • Published 1 day ago • 3
IV-Bench: A Benchmark for Image-Grounded Video Perception and Reasoning in Multimodal LLMs Paper • 2504.15415 • Published 2 days ago • 15
SuperGPQA: Scaling LLM Evaluation across 285 Graduate Disciplines Paper • 2502.14739 • Published Feb 20 • 103
LLMs for Patent Collection Researches (Topic: LLMs4Patent) collection of Qiyao Wang. • 8 items • Updated about 13 hours ago • 1
Long Context is Not Long at All: A Prospector of Long-Dependency Data for Large Language Models Paper • 2405.17915 • Published May 28, 2024 • 2
Leave No Document Behind: Benchmarking Long-Context LLMs with Extended Multi-Doc QA Paper • 2406.17419 • Published Jun 25, 2024 • 17
MMEvol: Empowering Multimodal Large Language Models with Evol-Instruct Paper • 2409.05840 • Published Sep 9, 2024 • 49
Ruler: A Model-Agnostic Method to Control Generated Length for Large Language Models Paper • 2409.18943 • Published Sep 27, 2024 • 30
Next Token Prediction Towards Multimodal Intelligence: A Comprehensive Survey Paper • 2412.18619 • Published Dec 16, 2024 • 58
AutoPatent: A Multi-Agent Framework for Automatic Patent Generation Paper • 2412.09796 • Published Dec 13, 2024 • 1
IPEval: A Bilingual Intellectual Property Agency Consultation Evaluation Benchmark for Large Language Models Paper • 2406.12386 • Published Jun 18, 2024 • 1