Search CORE

4 research outputs found

AfriMTE and AfriCOMET : Empowering COMET to Embrace Under-resourced African Languages

Author: Abdullahi Saheed S.
Abolade Daud
Adelani David Ifeoluwa
Adewumi Tosin
Afolabi Abeeb
Agrawal Sweta
Ajao Simbiat
Akinjobi Zainab
Al-Azzawi Sana
Alkhaled Lama
Anigri Salma El
Aremu Anuoluwapo
Awoyomi Oluwabusayo Olufunke
Bourhim Sofia
Briakou Eleftheria
Brian Sam
Bukula Andiswa
Carpuat Marine
Chukwuneke Chiamaka
Etori Naome A.
Hassan Ayinde
He Xuanli
Hourrane Oumaima
Iro Ruqayya Nasir
Kimotho Wangari
Kimotho Wangui
Macharm Ricky
Mangwana Thabiso
Masiak Marek
Mbonu Chinedu Emmanuel
Mohamed Muhidin
Mohamed Shafie Abdi
Mokayede Hamam
Momo Lyse Naomi Wamba
Moore Stephen E.
Muchiri Eric
Muhammad Shamsuddeen Hassan
Mwase Christine
Ndolela Lolwethu
Njoroge Samuel
Obiefuna Nnaemeka
Ochieng Millicent
Ogayo Perez
Ogbu Onyekachi Raphael
Ojo Jessica
Olatoye Temitayo
Omotayo Abdul-Hakeem
Opoku Bernard
Osei Salomey
Otiende Verrah Akinyi
Rei Ricardo
Sari Sakayo Toadoum
Shode Iyanuoluwa
Siro Clemencia
Stenetorp Pontus
Wang Jiayi
Yuehgoh Foutse
Publication venue: 'Center for Open Science'
Publication date: 16/11/2023
Field of study

Despite the progress we have recorded in scaling multilingual machine translation (MT) models and evaluation data to several under-resourced African languages, it is difficult to measure accurately the progress we have made on these languages because evaluation is often performed on n-gram matching metrics like BLEU that often have worse correlation with human judgments. Embedding-based metrics such as COMET correlate better; however, lack of evaluation data with human ratings for under-resourced languages, complexity of annotation guidelines like Multidimensional Quality Metrics (MQM), and limited language coverage of multilingual encoders have hampered their applicability to African languages. In this paper, we address these challenges by creating high-quality human evaluation data with a simplified MQM guideline for error-span annotation and direct assessment (DA) scoring for 13 typologically diverse African languages. Furthermore, we develop AfriCOMET, a COMET evaluation metric for African languages by leveraging DA training data from high-resource languages and African-centric multilingual encoder (AfroXLM-Roberta) to create the state-of-the-art evaluation metric for African languages MT with respect to Spearman-rank correlation with human judgments (+0.406)

Lancaster E-Prints

MasakhaNEWS:News Topic Classification for African languages

Author: Abdullahi Saheed Salahudeen
Abdulmumin Idris
Abeeb Afolabi
Adeeko Adetola
Adelani David Ifeoluwa
Adelani Tolulope Anu
Ajayi Tunde Oluwaseyi
al-azzawi Sana Sabah
Alabi Jesujoba Oluwadara
Aremu Anuoluwapo
Awosan Oyinkansola F.
Awoyomi Oluwabusayo Olufunke
Azime Israel Abebe
Bame Mahlet Taye
Chukwuneke Chiamaka I.
David Davis
Diko Thina
Dossou Bonaventure F. P.
Emezue Chris Chinenye
Fanijo Samuel
Gebre Sinodos
Guge Tadesse Kebede
Gwadabe Tajuddeen
Hassan Fuad Mire
Johar Abdulmejid Tuni
Kailani Habiba Abdulganiy
Kimanuka Ussen
Kimotho Wangari
Masiak Marek
Mbonu Chinedu E.
Mehamed Moges Ahmed
Mohamed Muhidin
Mohamed Shafie Abdi
Muhammad Shamsuddeen Hassan
Mukiibi Jonathan
Mwase Christine
Ndolela Lolwethu
Ngabire Evrard
Ngoli Tatiana Moteu
Nixdorf Doreen
Nxakama Siyanda
Nyatsine Pamela
Obiefuna Nnaemeka C.
Odhiambo Brian
Oduwole Mardiyyah
Ogbu Onyekachi Raphael
Ogundepo Odunayo
Ojo Jessica
Oladipo Akintunde
Omotayo Abdul-Hakeem
Owodunni Abraham Toluwase
Samuel Olanrewaju
Sari Sakayo Toadoum
Shode Iyanuoluwa
Sibanda Blessing K.
Sidume Freedmore
Siro Clemencia
Stenetorp Pontus
Tonja Atnafu Lambebo
Tshinu Kanda Patrick
Yigezu Mesay Gemeda
Yousuf Oreen
Publication venue
Publication date: 19/04/2023
Field of study

African languages are severely under-represented in NLP research due to lack of datasets covering several NLP tasks. While there are individual language specific datasets that are being expanded to different tasks, only a handful of NLP tasks (e.g. named entity recognition and machine translation) have standardized benchmark datasets covering several geographical and typologically-diverse African languages. In this paper, we develop MasakhaNEWS -- a new benchmark dataset for news topic classification covering 16 languages widely spoken in Africa. We provide an evaluation of baseline models by training classical machine learning models and fine-tuning several language models. Furthermore, we explore several alternatives to full fine-tuning of language models that are better suited for zero-shot and few-shot learning such as cross-lingual parameter-efficient fine-tuning (like MAD-X), pattern exploiting training (PET), prompting language models (like ChatGPT), and prompt-free sentence transformer fine-tuning (SetFit and Cohere Embedding API). Our evaluation in zero-shot setting shows the potential of prompting ChatGPT for news topic classification in low-resource African languages, achieving an average performance of 70 F1 points without leveraging additional supervision like MAD-X. In few-shot setting, we show that with as little as 10 examples per label, we achieved more than 90\% (i.e. 86.0 F1 points) of the performance of full supervised training (92.6 F1 points) leveraging the PET approach

Lancaster E-Prints

AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under-resourced African Languages

Author: Abdullahi Saheed S.
Abolade Daud
Adelani David Ifeoluwa
Adewumi Tosin
Afolabi Abeeb
Agrawal Sweta
Ajao Simbiat
Akinjobi Zainab
Al-Azzawi Sana
Alkhaled Lama
Anigri Salma El
Aremu Anuoluwapo
Awoyomi Oluwabusayo Olufunke
Bourhim Sofia
Briakou Eleftheria
Brian Sam
Bukula Andiswa
Carpuat Marine
Chukwuneke Chiamaka
Etori Naome A.
Hassan Ayinde
He Xuanli
Hourrane Oumaima
Iro Ruqayya Nasir
Kimotho Wangari
Kimotho Wangui
Lu Yao
Macharm Ricky
Mangwana Thabiso
Masiak Marek
Mbonu Chinedu Emmanuel
Mohamed Muhidin
Mohamed Shafie Abdi
Mokayed Hamam
Momo Lyse Naomi Wamba
Moore Stephen E.
Muchiri Eric
Muhammad Shamsuddeen Hassan
Mwase Christine
Ndolela Lolwethu
Njoroge Samuel
Obiefuna Nnaemeka
Ochieng Millicent
Ogayo Perez
Ogbu Onyekachi Raphael
Ojo Jessica
Olatoye Temitayo
Omotayo Abdul-Hakeem
Opoku Bernard
Osei Salomey
Otiende Verrah Akinyi
Rei Ricardo
Sari Sakayo Toadoum
Shode Iyanuoluwa
Siro Clemencia
Stenetorp Pontus
Wang Jiayi
Yuehgoh Foutse
Publication venue: arXiv.org
Publication date: 16/11/2023
Field of study

Despite the recent progress on scaling multilingual machine translation (MT) to several under-resourced African languages, accurately measuring this progress remains challenging, since evaluation is often performed on n-gram matching metrics such as BLEU, which typically show a weaker correlation with human judgments. Learned metrics such as COMET have higher correlation; however, the lack of evaluation data with human ratings for under-resourced languages, complexity of annotation guidelines like Multidimensional Quality Metrics (MQM), and limited language coverage of multilingual encoders have hampered their applicability to African languages. In this paper, we address these challenges by creating high-quality human evaluation data with simplified MQM guidelines for error detection and direct assessment (DA) scoring for 13 typologically diverse African languages. Furthermore, we develop AfriCOMET: COMET evaluation metrics for African languages by leveraging DA data from well-resourced languages and an African-centric multilingual encoder (AfroXLM-R) to create the state-of-the-art MT evaluation metrics for African languages with respect to Spearman-rank correlation with human judgments (0.441)

Aston Publications Explorer

MasakhaNEWS:News Topic Classification for African languages

Author: Abdullahi Saheed Salahudeen
Abdulmumin Idris
Abeeb Afolabi
Adeeko Adetola
Adelani David Ifeoluwa
Adelani Tolulope Anu
Ajayi Tunde Oluwaseyi
al-azzawi Sana Sabah
Alabi Jesujoba Oluwadara
Aremu Anuoluwapo
Awosan Oyinkansola F.
Awoyomi Oluwabusayo Olufunke
Azime Israel Abebe
Bame Mahlet Taye
Chukwuneke Chiamaka I.
David Davis
Diko Thina
Dossou Bonaventure F. P.
Emezue Chris Chinenye
Fanijo Samuel
Gebre Sinodos
Guge Tadesse Kebede
Gwadabe Tajuddeen
Hassan Fuad Mire
Johar Abdulmejid Tuni
Kailani Habiba Abdulganiy
Kimanuka Ussen
Kimotho Wangari
Masiak Marek
Mbonu Chinedu E.
Mehamed Moges Ahmed
Mohamed Muhidin
Mohamed Shafie Abdi
Muhammad Shamsuddeen Hassan
Mukiibi Jonathan
Mwase Christine
Ndolela Lolwethu
Ngabire Evrard
Ngoli Tatiana Moteu
Nixdorf Doreen
Nxakama Siyanda
Nyatsine Pamela
Obiefuna Nnaemeka C.
Odhiambo Brian
Oduwole Mardiyyah
Ogbu Onyekachi Raphael
Ogundepo Odunayo
Ojo Jessica
Oladipo Akintunde
Omotayo Abdul-Hakeem
Owodunni Abraham Toluwase
Samuel Olanrewaju
Sari Sakayo Toadoum
Shode Iyanuoluwa
Sibanda Blessing K.
Sidume Freedmore
Siro Clemencia
Stenetorp Pontus
Tonja Atnafu Lambebo
Tshinu Kanda Patrick
Yigezu Mesay Gemeda
Yousuf Oreen
Publication venue: arXiv.org
Publication date: 19/04/2023
Field of study

Aston Publications Explorer