MrDragonFox's picture

MrDragonFox PRO

MrDragonFox

AI & ML interests

llm + audio i/o, (un)alignment

Recent Activity

updated a dataset about 10 hours ago
SynthoCraft/baddy_rawer_then_raw
published a dataset about 12 hours ago
SynthoCraft/baddy_rawer_then_raw
View all activity

Organizations

DeepGHS's profile picture Blog-explorers's profile picture SynthoCraft Ai's profile picture FoxEngineAi's profile picture Social Post Explorers's profile picture Mistral AI Game Jam's profile picture

Posts 3

view post
Post
1715
as a few of you know - i am working on a rather more elaborate-tts that can produce more interesting sounds in context of rp

early sneak peak is here -

MrDragonFox/mOrpheus_3B-1Base_early_preview-v1-25000

its based on orpheus - but really the model is irrelevant as i focus mostly on data augmentation / prep / pipelineing - its just the way to show progress

should be able to express fine even in a sfw context

probably the last release for a few weeks as i go back to the data pipeline and improve there ..

in the mean time, please do test and report problems or enjoyable generations you found - we have a growing discord community and i love to see what you get out of that early release !

(small colab is provided on the model page if you dont have the gpu to run that your self)
view post
Post
3306
yet a other audio datasets pre classified for events + audio aestetics

this time for german - 680h sampled from emilia yodas

timestamps for asr training or other fancier things available as nc in the raw repo

MrDragonFox/DE_Emilia_Yodas_680h

cc by 4.0 as by emilia yodas

raw events / transcriptions are cc by NC 4.0

MrDragonFox/DE_Emilia_Yodas_680h_raw_timestamps

the coming days i should push about 600h english + some japanese too same format