Commit
·
350dbcc
1
Parent(s):
13956bc
Update README.md
Browse files
README.md
CHANGED
@@ -15,7 +15,9 @@ metrics:
|
|
15 |
|
16 |
This model trained with CommonVoice V8 dataset by increase data from CommonVoice V7 dataset that It was use in [airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th). It was finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53).
|
17 |
|
18 |
-
|
|
|
|
|
19 |
|
20 |
## Datasets
|
21 |
|
@@ -31,24 +33,32 @@ This model was finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook
|
|
31 |
|
32 |
**Test with CommonVoice V8 Testset**
|
33 |
|
34 |
-
| Model | WER by newmm (%) | WER by deepcut (%) | CER |
|
35 |
-
|
36 |
-
|
|
37 |
-
| wav2vec2 with
|
38 |
-
|
|
|
|
|
|
39 |
|
40 |
**Test with CommonVoice V7 Testset (same test by CV V7)**
|
41 |
|
42 |
-
| Model | WER by newmm (%) | WER by deepcut (%) | CER |
|
43 |
-
|
44 |
-
|
|
45 |
-
| wav2vec2 with
|
46 |
-
|
|
|
|
|
|
47 |
|
48 |
This is use same testset from [https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th).
|
49 |
|
50 |
-
source code benchmark: https://github.com/wannaphong/thai-asr-benchmark/tree/main/commonvoice
|
51 |
-
|
52 |
**Links:**
|
53 |
- GitHub Dataset: [https://github.com/wannaphong/thai_commonvoice_dataset](https://github.com/wannaphong/thai_commonvoice_dataset)
|
54 |
-
-
|
|
|
|
|
|
|
|
|
|
|
|
|
|
15 |
|
16 |
This model trained with CommonVoice V8 dataset by increase data from CommonVoice V7 dataset that It was use in [airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th). It was finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53).
|
17 |
|
18 |
+
|
19 |
+
## Model description
|
20 |
+
- Paper: [Thai Wav2Vec2.0 with CommonVoice V8](https://arxiv.org/abs/2208.04799)
|
21 |
|
22 |
## Datasets
|
23 |
|
|
|
33 |
|
34 |
**Test with CommonVoice V8 Testset**
|
35 |
|
36 |
+
| Model | WER by newmm (%) | WER by deepcut (%) | CER |
|
37 |
+
|-----------------------|------------------|--------------------|----------|
|
38 |
+
| AIResearch.in.th and PyThaiNLP | 17.414503 | 11.923089 | 3.854153 |
|
39 |
+
| wav2vec2 with deepcut | 16.354521 | 11.424476 | 3.684060 |
|
40 |
+
| wav2vec2 with newmm | 16.698299 | 11.436941 | 3.737407 |
|
41 |
+
| **wav2vec2 with deepcut + language model** | 12.630260 | 9.613886 | 3.292073 |
|
42 |
+
| wav2vec2 with newmm + language model | 12.583706 | 9.598305 | 3.276610 |
|
43 |
|
44 |
**Test with CommonVoice V7 Testset (same test by CV V7)**
|
45 |
|
46 |
+
| Model | WER by newmm (%) | WER by deepcut (%) | CER |
|
47 |
+
|-----------------------|------------------|--------------------|----------|
|
48 |
+
| AIResearch.in.th and PyThaiNLP | 13.936698 | 9.347462 | 2.804787 |
|
49 |
+
| wav2vec2 with deepcut | 12.776381 | 8.773006 | 2.628882 |
|
50 |
+
| wav2vec2 with newmm | 12.750596 | 8.672616 | 2.623341 |
|
51 |
+
| **wav2vec2 with deepcut + language model** | 9.940050 | 7.423313 | 2.344940 |
|
52 |
+
| wav2vec2 with newmm + language model | 9.559724 | 7.339654 | 2.277071 |
|
53 |
|
54 |
This is use same testset from [https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th).
|
55 |
|
|
|
|
|
56 |
**Links:**
|
57 |
- GitHub Dataset: [https://github.com/wannaphong/thai_commonvoice_dataset](https://github.com/wannaphong/thai_commonvoice_dataset)
|
58 |
+
- Paper: [Thai Wav2Vec2.0 with CommonVoice V8](https://arxiv.org/abs/2208.04799)
|
59 |
+
|
60 |
+
## BibTeX entry and citation info
|
61 |
+
|
62 |
+
```
|
63 |
+
```
|
64 |
+
|