Chinese University of Hong Kong, Shenzhen

university

https://www.cuhk.edu.cn/

Activity Feed Request to join this org

AI & ML interests

NLP, CV

Recent Activity

cppppppc authored a paper about 2 months ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

guangyil authored a paper 4 months ago

Voila: Voice-Language Foundation Models for Real-Time Autonomous Interaction and Voice Role-Play

guangyil authored a paper 4 months ago

LLM-Powered GUI Agents in Phone Automation: Surveying Progress and Prospects

View all activity

HarryHe

authored 2 papers 9 days ago

Overview of the Amphion Toolkit (v0.2)

Paper • 2501.15442 • Published Jan 26 • 3

Fact2Fiction: Targeted Poisoning Attack to Agentic Fact-checking System

Paper • 2508.06059 • Published 16 days ago • 4

cppppppc

authored a paper about 2 months ago

ShareGPT-4o-Image: Aligning Multimodal Models with GPT-4o-Level Image Generation

Paper • 2506.18095 • Published Jun 22 • 65

IranQin

authored 7 papers 5 months ago

MP5: A Multi-modal Open-ended Embodied System in Minecraft via Active Perception

Paper • 2312.07472 • Published Dec 12, 2023 • 2

SupFusion: Supervised LiDAR-Camera Fusion for 3D Object Detection

Paper • 2309.07084 • Published Sep 13, 2023 • 1

MineDreamer: Learning to Follow Instructions via Chain-of-Imagination for Simulated-World Control

Paper • 2403.12037 • Published Mar 18, 2024 • 1

WorldSimBench: Towards Video Generation Models as World Simulators

Paper • 2410.18072 • Published Oct 23, 2024 • 20

GameFactory: Creating New Games with Generative Interactive Videos

Paper • 2501.08325 • Published Jan 14 • 68

T2ISafety: Benchmark for Assessing Fairness, Toxicity, and Privacy in Image Generation

Paper • 2501.12612 • Published Jan 22

RoboFactory: Exploring Embodied Agent Collaboration with Compositional Constraints

Paper • 2503.16408 • Published Mar 20 • 41

SP4595

authored 2 papers 6 months ago

CLEA: Closed-Loop Embodied Agent for Enhancing Task Execution in Dynamic Environments

Paper • 2503.00729 • Published Mar 2 • 3

STMA: A Spatio-Temporal Memory Agent for Long-Horizon Embodied Task Planning

Paper • 2502.10177 • Published Feb 14 • 6

TobyYang7

authored a paper 7 months ago

TwinMarket: A Scalable Behavioral and Social Simulation for Financial Markets

Paper • 2502.01506 • Published Feb 3 • 39

HarryHe

authored 3 papers 7 months ago

Emilia: A Large-Scale, Extensive, Multilingual, and Diverse Dataset for Speech Generation

Paper • 2501.15907 • Published Jan 27 • 17

Emilia: An Extensive, Multilingual, and Diverse Speech Dataset for Large-Scale Speech Generation

Paper • 2407.05361 • Published Jul 7, 2024 • 2

SpMis: An Investigation of Synthetic Spoken Misinformation Detection

Paper • 2409.11308 • Published Sep 17, 2024

SP4595

authored a paper 10 months ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17, 2024 • 62

TobyYang7

authored a paper 10 months ago

UCFE: A User-Centric Financial Expertise Benchmark for Large Language Models

Paper • 2410.14059 • Published Oct 17, 2024 • 62

TobyYang7

authored a paper 12 months ago

Open-FinLLMs: Open Multimodal Large Language Models for Financial Applications

Paper • 2408.11878 • Published Aug 20, 2024 • 64

chongjie

authored a paper about 1 year ago

StableNormal: Reducing Diffusion Variance for Stable and Sharp Normal

Paper • 2406.16864 • Published Jun 24, 2024 • 3