Is it possible to prompt the model with instructions, like Unmute?

#1
by deathknight0 - opened

Thank you for your release. Just wondering if this repo supports prompting the model natively - like what you implemented in Unmute.

Kyutai org

Hi, what would that mean for a speech-to-text? It is possible to feed in a prefix of audio + transcript which would condition the model to transcribe words in certain ways. For that you'd want to hack around in the Pytorch implementation, see here, we don't provide a high-level interface for that atm. But if you mean something like "transcribe every proper name in upper case", that is not possible.

Sign up or log in to comment