1 7 1

Chao Feng

chfeng

AI & ML interests

None yet

Recent Activity

authored a paper 7 days ago

This&That: Language-Gesture Controlled Video Generation for Robot Planning

authored a paper 7 days ago

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

upvoted a paper 9 days ago

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

View all activity

Organizations

chfeng's activity

authored 2 papers 7 days ago

This&That: Language-Gesture Controlled Video Generation for Robot Planning

Paper • 2407.05530 • Published Jul 8, 2024 • 4

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published 10 days ago • 16

upvoted a paper 9 days ago

SoTA with Less: MCTS-Guided Sample Selection for Data-Efficient Visual Reasoning Self-Improvement

Paper • 2504.07934 • Published 10 days ago • 16

upvoted 2 papers about 1 month ago

ReCamMaster: Camera-Controlled Generative Rendering from A Single Video

Paper • 2503.11647 • Published Mar 14 • 134

TransPixar: Advancing Text-to-Video Generation with Transparency

Paper • 2501.03006 • Published Jan 6 • 27

updated a model 2 months ago

chfeng/Touch-LLM

Updated Feb 7 • 1

published a model 2 months ago

chfeng/Touch-LLM

Updated Feb 7 • 1

upvoted a paper 2 months ago

OmniHuman-1: Rethinking the Scaling-Up of One-Stage Conditioned Human Animation Models

Paper • 2502.01061 • Published Feb 3 • 213

authored a paper 3 months ago

GPS as a Control Signal for Image Generation

Paper • 2501.12390 • Published Jan 21 • 13

commented a paper 3 months ago

GPS as a Control Signal for Image Generation

Paper • 2501.12390 • Published Jan 21 • 13 •

upvoted a paper 3 months ago

GPS as a Control Signal for Image Generation

Paper • 2501.12390 • Published Jan 21 • 13

upvoted a paper 4 months ago

Motion Prompting: Controlling Video Generation with Motion Trajectories

Paper • 2412.02700 • Published Dec 3, 2024 • 15

liked a dataset 9 months ago

HuggingFaceM4/howto100m

Updated May 18, 2022 • 76 • 6

authored 5 papers about 1 year ago

AVA-AVD: Audio-Visual Speaker Diarization in the Wild

Paper • 2111.14448 • Published Nov 29, 2021

Self-Supervised Video Forensics by Audio-Visual Anomaly Detection

Paper • 2301.01767 • Published Jan 4, 2023

Knowledge Solver: Teaching LLMs to Search for Domain Knowledge from Knowledge Graphs

Paper • 2309.03118 • Published Sep 6, 2023 • 2

Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning

Paper • 2402.11690 • Published Feb 18, 2024 • 10

Binding Touch to Everything: Learning Unified Multimodal Tactile Representations

Paper • 2401.18084 • Published Jan 31, 2024

upvoted a paper about 1 year ago

Vision-Flan: Scaling Human-Labeled Tasks in Visual Instruction Tuning

Paper • 2402.11690 • Published Feb 18, 2024 • 10