new

Get trending papers in your email inbox once a day!

Get trending papers in your email inbox!

Daily Papers

byAK and the research community

Apr 24

Submitted by

GenuineWWD

VisuLogic: A Benchmark for Evaluating Visual Reasoning in Multi-modal Large Language Models

·
13 authors

1

Submitted by

Alon77777

DreamID: High-Fidelity and Fast diffusion-based Face Swapping via Triplet ID Group Learning

·
8 authors

7

Submitted by

scottsuk0306

Trillion 7B Technical Report

·
8 authors

1

Submitted by

mhamilton723

I-Con: A Unifying Framework for Representation Learning

·
5 authors

1

Submitted by

upup-ashton-wang

Tina: Tiny Reasoning Models via LoRA

·
6 authors

3

Submitted by

StarThomas1002

PHYBench: Holistic Evaluation of Physical Perception and Reasoning in Large Language Models

·
52 authors

Submitted by

Swtheking

Pre-DPO: Improving Data Utilization in Direct Preference Optimization Using a Guiding Reference Model

·
5 authors

1

Submitted by

Kaichengalex

Decoupled Global-Local Alignment for Improving Compositional Understanding

·
6 authors

1

Submitted by

yanze

DreamO: A Unified Framework for Image Customization

·
15 authors

1

Submitted by

igitman

AIMO-2 Winning Solution: Building State-of-the-Art Mathematical Reasoning Models with OpenMathReasoning dataset

·
8 authors

Submitted by

Ningyu

A Comprehensive Survey in LLM(-Agent) Full Stack Safety: Data, Training and Deployment

·
82 authors

1

Submitted by

USTCYu

Rethinking the Generation of High-Quality CoT Data from the Perspective of LLM-Adaptive Question Difficulty Grading

·
10 authors

2

Submitted by

YanNeu

RePOPE: Impact of Annotation Errors on the POPE Benchmark

·
2 authors

1

Submitted by

mturski

Unchecked and Overlooked: Addressing the Checkbox Blind Spot in Large Language Models with CheckboxQA

·
3 authors

1

Submitted by

anirudhkhatry

CRUST-Bench: A Comprehensive Benchmark for C-to-safe-Rust Transpilation

·
7 authors

1

Submitted by

jcwang0602

Progressive Language-guided Visual Learning for Multi-Task Visual Grounding

·
6 authors

1

Submitted by

WenyiWU0111

Causal-Copilot: An Autonomous Causal Analysis Agent

·
13 authors

1