Tanel's picture
Update README.md
2390afa unverified
|
raw
history blame
506 Bytes
metadata
license: apache-2.0
language:
  - et
  - en
  - ru
pipeline_tag: automatic-speech-recognition

The is a Whisper large-v3 model finetuned to do Estonian-English and Estonian-Russian bidirectional speech translation.

You have to use the "transcribe" task and specify the target language ("et", "en" or "ru"). Source language doesn't have to be specified.

The model is trained on synthetic data (ASR data with machine translated transcripts) as well as some data scraped from the web (audio + subititles).