5 2 7

duk guo

dukkkk

AI & ML interests

None yet

Recent Activity

liked a model 28 days ago

Qwen/Qwen2.5-Omni-7B

liked a Space 28 days ago

Qwen/Qwen2.5-Omni-7B-Demo

upvoted a collection 28 days ago

Qwen2.5-Omni

View all activity

Organizations

dukkkk's activity

liked a model 28 days ago

Qwen/Qwen2.5-Omni-7B

Any-to-Any • Updated 8 days ago • 202k • 1.46k

liked a Space 28 days ago

266

Qwen2.5 Omni 7B Demo

🏆

Generate text and speech responses from text, images, or audio input

upvoted a collection 28 days ago

Qwen2.5-Omni

Collection

End-to-End Omni (text, audio, image, video, and natural speech interaction) model based Qwen2.5 • 3 items • Updated 27 days ago • 90

upvoted a paper 4 months ago

Qwen2.5 Technical Report

Paper • 2412.15115 • Published Dec 19, 2024 • 365

liked a dataset 5 months ago

mythicinfinity/Libriheavy-HQ

Viewer • Updated Jul 13, 2024 • 123k • 156 • 5

liked a dataset 7 months ago

zqning/RapBank

Viewer • Updated Sep 13, 2024 • 94.2k • 183 • 7

updated a dataset 9 months ago

Wenetspeech4TTS/WenetSpeech4TTS

Updated Jul 25, 2024 • 1.41k • 73

authored 4 papers 9 months ago

Text-aware and Context-aware Expressive Audiobook Speech Synthesis

Paper • 2406.05672 • Published Jun 9, 2024

WenetSpeech4TTS: A 12,800-hour Mandarin TTS Corpus for Large Speech Generation Model Benchmark

Paper • 2406.05763 • Published Jun 9, 2024

HiGNN-TTS: Hierarchical Prosody Modeling with Graph Neural Networks for Expressive Long-form TTS

Paper • 2309.13907 • Published Sep 25, 2023

Automatic channel selection and spatial feature integration for multi-channel speech recognition across various array topologies

Paper • 2312.09746 • Published Dec 15, 2023

New activity in Wenetspeech4TTS/Audiodec-Valle-Wenetspeech4TTS 10 months ago

Add model card

#1 opened 10 months ago by

AdinaY

updated a model 10 months ago

Wenetspeech4TTS/Audiodec-Valle-Wenetspeech4TTS

Updated Jun 20, 2024 • 8

liked 2 models 11 months ago

Wenetspeech4TTS/Audiodec-Valle-Wenetspeech4TTS

Updated Jun 20, 2024 • 8

laion/larger_clap_music_and_speech

Feature Extraction • Updated Oct 31, 2023 • 25.9k • 27

New activity in Wenetspeech4TTS/WenetSpeech4TTS 12 months ago

Can we get the speaker id from audio file's name

#3 opened 12 months ago by

yunfengwang

liked a dataset 12 months ago

Wenetspeech4TTS/WenetSpeech4TTS

Updated Jul 25, 2024 • 1.41k • 73

New activity in Wenetspeech4TTS/WenetSpeech4TTS 12 months ago

Create WenetSpeech4TTS.py

#2 opened 12 months ago by

Bakerbunker