AMR-KELEG commited on
Commit
65b3d25
·
verified ·
1 Parent(s): 3ab43b3

Update README.md

Browse files
Files changed (1) hide show
  1. README.md +42 -0
README.md CHANGED
@@ -88,4 +88,46 @@ print(s1, s1_pred)
88
  s2 = "خليلي في مساج بريفي كيفاش الاتصال"
89
  s2_pred = predict_top_p(s2) # ['Algeria', 'Tunisia']
90
  print(s2, s2_pred)
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
91
  ```
 
88
  s2 = "خليلي في مساج بريفي كيفاش الاتصال"
89
  s2_pred = predict_top_p(s2) # ['Algeria', 'Tunisia']
90
  print(s2, s2_pred)
91
+ ```
92
+
93
+ ### Citation
94
+
95
+ If you find the model useful, please cite the following [respective paper](https://aclanthology.org/2024.arabicnlp-1.79/):
96
+ ```
97
+ @inproceedings{abdul-mageed-etal-2024-nadi,
98
+ title = "{NADI} 2024: The Fifth Nuanced {A}rabic Dialect Identification Shared Task",
99
+ author = "Abdul-Mageed, Muhammad and
100
+ Keleg, Amr and
101
+ Elmadany, AbdelRahim and
102
+ Zhang, Chiyu and
103
+ Hamed, Injy and
104
+ Magdy, Walid and
105
+ Bouamor, Houda and
106
+ Habash, Nizar",
107
+ editor = "Habash, Nizar and
108
+ Bouamor, Houda and
109
+ Eskander, Ramy and
110
+ Tomeh, Nadi and
111
+ Abu Farha, Ibrahim and
112
+ Abdelali, Ahmed and
113
+ Touileb, Samia and
114
+ Hamed, Injy and
115
+ Onaizan, Yaser and
116
+ Alhafni, Bashar and
117
+ Antoun, Wissam and
118
+ Khalifa, Salam and
119
+ Haddad, Hatem and
120
+ Zitouni, Imed and
121
+ AlKhamissi, Badr and
122
+ Almatham, Rawan and
123
+ Mrini, Khalil",
124
+ booktitle = "Proceedings of The Second Arabic Natural Language Processing Conference",
125
+ month = aug,
126
+ year = "2024",
127
+ address = "Bangkok, Thailand",
128
+ publisher = "Association for Computational Linguistics",
129
+ url = "https://aclanthology.org/2024.arabicnlp-1.79",
130
+ pages = "709--728",
131
+ abstract = "We describe the findings of the fifth Nuanced Arabic Dialect Identification Shared Task (NADI 2024). NADI{'}s objective is to help advance SoTA Arabic NLP by providing guidance, datasets, modeling opportunities, and standardized evaluation conditions that allow researchers to collaboratively compete on prespecified tasks. NADI 2024 targeted both dialect identification cast as a multi-label task (Subtask 1), identification of the Arabic level of dialectness (Subtask 2), and dialect-to-MSA machine translation (Subtask 3). A total of 51 unique teams registered for the shared task, of whom 12 teams have participated (with 76 valid submissions during the test phase). Among these, three teams participated in Subtask 1, three in Subtask 2, and eight in Subtask 3. The winning teams achieved 50.57 F1 on Subtask 1, 0.1403 RMSE for Subtask 2, and 20.44 BLEU in Subtask 3, respectively. Results show that Arabic dialect processing tasks such as dialect identification and machine translation remain challenging. We describe the methods employed by the participating teams and briefly offer an outlook for NADI.",
132
+ }
133
  ```