wannaphong commited on
Commit
350dbcc
·
1 Parent(s): 13956bc

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +24 -14
README.md CHANGED
@@ -15,7 +15,9 @@ metrics:
15
 
16
  This model trained with CommonVoice V8 dataset by increase data from CommonVoice V7 dataset that It was use in [airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th). It was finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53).
17
 
18
- GitHub: [https://github.com/wannaphong/th-cv-v8-wav2vev2-deepcut](https://github.com/wannaphong/th-cv-v8-wav2vev2-deepcut)
 
 
19
 
20
  ## Datasets
21
 
@@ -31,24 +33,32 @@ This model was finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook
31
 
32
  **Test with CommonVoice V8 Testset**
33
 
34
- | Model | WER by newmm (%) | WER by deepcut (%) | CER | URL |
35
- |-----------------------|------------------|--------------------|----------|-------------------------------------------------------------|
36
- | wav2vec2 with deepcut | 16.354521 | 11.424476 | 3.684060 | https://github.com/wannaphong/th-cv-v8-wav2vev2-deepcut |
37
- | wav2vec2 with newmm | 16.698299 | 11.436941 | 3.737407 | https://github.com/wannaphong/thai-wav2vec2-cv-v8 |
38
- | CV v7 | 17.414503 | 11.923089 | 3.854153 | https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th |
 
 
39
 
40
  **Test with CommonVoice V7 Testset (same test by CV V7)**
41
 
42
- | Model | WER by newmm (%) | WER by deepcut (%) | CER | URL |
43
- |-----------------------|------------------|--------------------|----------|-------------------------------------------------------------|
44
- | wav2vec2 with deepcut | 12.776381 | 8.773006 | 2.628882 | https://github.com/wannaphong/th-cv-v8-wav2vev2-deepcut |
45
- | wav2vec2 with newmm | 12.750596 | 8.672616 | 2.623341 | https://github.com/wannaphong/thai-wav2vec2-cv-v8 |
46
- | CV v7 | 13.936698 | 9.347462 | 2.804787 | https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th |
 
 
47
 
48
  This is use same testset from [https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th).
49
 
50
- source code benchmark: https://github.com/wannaphong/thai-asr-benchmark/tree/main/commonvoice
51
-
52
  **Links:**
53
  - GitHub Dataset: [https://github.com/wannaphong/thai_commonvoice_dataset](https://github.com/wannaphong/thai_commonvoice_dataset)
54
- - Deepcut: [https://github.com/rkcosmos/deepcut](https://github.com/rkcosmos/deepcut)
 
 
 
 
 
 
 
15
 
16
  This model trained with CommonVoice V8 dataset by increase data from CommonVoice V7 dataset that It was use in [airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th). It was finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53).
17
 
18
+
19
+ ## Model description
20
+ - Paper: [Thai Wav2Vec2.0 with CommonVoice V8](https://arxiv.org/abs/2208.04799)
21
 
22
  ## Datasets
23
 
 
33
 
34
  **Test with CommonVoice V8 Testset**
35
 
36
+ | Model | WER by newmm (%) | WER by deepcut (%) | CER |
37
+ |-----------------------|------------------|--------------------|----------|
38
+ | AIResearch.in.th and PyThaiNLP | 17.414503 | 11.923089 | 3.854153 |
39
+ | wav2vec2 with deepcut | 16.354521 | 11.424476 | 3.684060 |
40
+ | wav2vec2 with newmm | 16.698299 | 11.436941 | 3.737407 |
41
+ | **wav2vec2 with deepcut + language model** | 12.630260 | 9.613886 | 3.292073 |
42
+ | wav2vec2 with newmm + language model | 12.583706 | 9.598305 | 3.276610 |
43
 
44
  **Test with CommonVoice V7 Testset (same test by CV V7)**
45
 
46
+ | Model | WER by newmm (%) | WER by deepcut (%) | CER |
47
+ |-----------------------|------------------|--------------------|----------|
48
+ | AIResearch.in.th and PyThaiNLP | 13.936698 | 9.347462 | 2.804787 |
49
+ | wav2vec2 with deepcut | 12.776381 | 8.773006 | 2.628882 |
50
+ | wav2vec2 with newmm | 12.750596 | 8.672616 | 2.623341 |
51
+ | **wav2vec2 with deepcut + language model** | 9.940050 | 7.423313 | 2.344940 |
52
+ | wav2vec2 with newmm + language model | 9.559724 | 7.339654 | 2.277071 |
53
 
54
  This is use same testset from [https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th).
55
 
 
 
56
  **Links:**
57
  - GitHub Dataset: [https://github.com/wannaphong/thai_commonvoice_dataset](https://github.com/wannaphong/thai_commonvoice_dataset)
58
+ - Paper: [Thai Wav2Vec2.0 with CommonVoice V8](https://arxiv.org/abs/2208.04799)
59
+
60
+ ## BibTeX entry and citation info
61
+
62
+ ```
63
+ ```
64
+