kotoba-tech
/

kotoba-whisper-v1.0-ggml

Automatic Speech Recognition

Model card Files Files and versions Community

asahi417 commited on May 7, 2024

Commit

bfb854c

·

verified ·

1 Parent(s): cb58db6

Update README.md

Files changed (1) hide show

README.md +6 -2

README.md CHANGED Viewed

@@ -31,10 +31,15 @@ wget https://huggingface.co/kotoba-tech/kotoba-whisper-v1.0-ggml/resolve/main/gg
 3. Run inference using the provided sample audio:
 ```bash
-wget
 make -j && ./main -m models/ggml-kotoba-whisper-v1.0.bin -f sample_ja_speech.wav
 ```
 ### Quantized Model
 To use the quantized model, download the quantized GGML weights:
@@ -44,7 +49,6 @@ wget https://huggingface.co/kotoba-tech/kotoba-whisper-v1.0-ggml/resolve/main/gg
 Run inference on the sample audio:
 ```bash
-wget
 make -j && ./main -m models/ggml-kotoba-whisper-v1.0-q5_0.bin -f sample_ja_speech.wav
 ```

 3. Run inference using the provided sample audio:
 ```bash
+wget https://huggingface.co/kotoba-tech/kotoba-whisper-v1.0-ggml/resolve/main/sample_ja_speech.wav
 make -j && ./main -m models/ggml-kotoba-whisper-v1.0.bin -f sample_ja_speech.wav
 ```
+Note that it runs only with 16-bit WAV files, so make sure to convert your input before running the tool. For example, you can use ffmpeg like this:
+```
+ffmpeg -i input.mp3 -ar 16000 -ac 1 -c:a pcm_s16le output.wav
+```
 ### Quantized Model
 To use the quantized model, download the quantized GGML weights:
 Run inference on the sample audio:
 ```bash
 make -j && ./main -m models/ggml-kotoba-whisper-v1.0-q5_0.bin -f sample_ja_speech.wav
 ```