wannaphong
/

wav2vec2-large-xlsr-53-th-cv8-deepcut

@@ -15,7 +15,9 @@ metrics:
 This model trained with CommonVoice V8 dataset by increase data from CommonVoice V7 dataset that It was use in [airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th). It was finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53).
-GitHub: [https://github.com/wannaphong/th-cv-v8-wav2vev2-deepcut](https://github.com/wannaphong/th-cv-v8-wav2vev2-deepcut)
 ## Datasets
@@ -31,24 +33,32 @@ This model was finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook
 **Test with CommonVoice V8 Testset**
-| Model                 | WER by newmm (%) | WER by deepcut (%) | CER      | URL                                                         |
-|-----------------------|------------------|--------------------|----------|-------------------------------------------------------------|
-| wav2vec2 with deepcut | 16.354521        | 11.424476          | 3.684060 | https://github.com/wannaphong/th-cv-v8-wav2vev2-deepcut     |
-| wav2vec2 with newmm   | 16.698299        | 11.436941          | 3.737407 | https://github.com/wannaphong/thai-wav2vec2-cv-v8           |
-| CV v7                 | 17.414503        | 11.923089          | 3.854153 | https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th |
 **Test with CommonVoice V7 Testset (same test by CV V7)**
-| Model                 | WER by newmm (%) | WER by deepcut (%) | CER      | URL                                                         |
-|-----------------------|------------------|--------------------|----------|-------------------------------------------------------------|
-| wav2vec2 with deepcut | 12.776381        | 8.773006           | 2.628882 | https://github.com/wannaphong/th-cv-v8-wav2vev2-deepcut     |
-| wav2vec2 with newmm   | 12.750596        | 8.672616           | 2.623341 | https://github.com/wannaphong/thai-wav2vec2-cv-v8           |
-| CV v7                 | 13.936698        | 9.347462           | 2.804787 | https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th |
 This is use same testset from [https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th).
-source code benchmark: https://github.com/wannaphong/thai-asr-benchmark/tree/main/commonvoice
 **Links:**
 - GitHub Dataset: [https://github.com/wannaphong/thai_commonvoice_dataset](https://github.com/wannaphong/thai_commonvoice_dataset)
-- Deepcut: [https://github.com/rkcosmos/deepcut](https://github.com/rkcosmos/deepcut)

 This model trained with CommonVoice V8 dataset by increase data from CommonVoice V7 dataset that It was use in [airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th). It was finetune [wav2vec2-large-xlsr-53](https://huggingface.co/facebook/wav2vec2-large-xlsr-53).
+## Model description
+- Paper: [Thai Wav2Vec2.0 with CommonVoice V8](https://arxiv.org/abs/2208.04799)
 ## Datasets
 **Test with CommonVoice V8 Testset**
+| Model                 | WER by newmm (%) | WER by deepcut (%) | CER      |
+|-----------------------|------------------|--------------------|----------|
+| AIResearch.in.th and PyThaiNLP                  | 17.414503        | 11.923089          | 3.854153 |
+| wav2vec2 with deepcut | 16.354521        | 11.424476          | 3.684060 |
+| wav2vec2 with newmm   | 16.698299        | 11.436941          | 3.737407 |
+| **wav2vec2 with deepcut + language model** | 12.630260        | 9.613886           | 3.292073 |
+| wav2vec2 with newmm + language model  | 12.583706        | 9.598305          | 3.276610 |
 **Test with CommonVoice V7 Testset (same test by CV V7)**
+| Model                 | WER by newmm (%) | WER by deepcut (%) | CER      |
+|-----------------------|------------------|--------------------|----------|
+| AIResearch.in.th and PyThaiNLP                  | 13.936698        | 9.347462           | 2.804787 |
+| wav2vec2 with deepcut | 12.776381        | 8.773006           | 2.628882 |
+| wav2vec2 with newmm   | 12.750596        | 8.672616           | 2.623341 |
+| **wav2vec2 with deepcut + language model** | 9.940050        | 7.423313           | 2.344940 |
+| wav2vec2 with newmm + language model   | 9.559724        | 7.339654          | 2.277071 |
 This is use same testset from [https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th](https://huggingface.co/airesearch/wav2vec2-large-xlsr-53-th).
 **Links:**
 - GitHub Dataset: [https://github.com/wannaphong/thai_commonvoice_dataset](https://github.com/wannaphong/thai_commonvoice_dataset)
+- Paper: [Thai Wav2Vec2.0 with CommonVoice V8](https://arxiv.org/abs/2208.04799)
+## BibTeX entry and citation info
+```
+```