publications | Shamsuddeen Hassan Muhammad

An up-to-date list is available on Google Scholar.

2026

FAccT

Going PLACES: Participatory Localized Red Teaming for Text-to-Image Safety in the Global South

C. Rastogi, M. Bhutani, M. Kahng, and 13 more authors

In Proceedings of the 2026 ACM Conference on Fairness, Accountability, and Transparency (FAccT 2026), 2026

arXiv
TACL

Beyond Majority Voting: Agreement-Based Clustering to Model Annotator Perspectives in Subjective NLP Tasks

T. D. Belay, I. S. Ahmad, I. Abdulmumin, and 6 more authors

Transactions of the Association for Computational Linguistics (TACL), 2026

arXiv
ACL

DimABSA: Building Multilingual and Multidomain Datasets for Dimensional Aspect-Based Sentiment Analysis

S. H. Muhammad, and others

In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), 2026
ACL

CommonLID: Re-evaluating State-of-the-Art Language Identification Performance on Web Data

S. H. Muhammad, and others

In Proceedings of the 64th Annual Meeting of the Association for Computational Linguistics (ACL 2026), 2026
arXiv

AfriSUD: A Dependency Treebank Collection for Evaluating Models on African Languages

S. H. Muhammad, and others

arXiv preprint, 2026
arXiv

AfriScience-MT: Towards Decolonizing Science in Africa through Text Translation

I. Abdulmumin, T. Gwadabe, S. H. Muhammad, and 1 more author

arXiv preprint, 2026

2025

ACL

BRIGHTER: Bridging the Gap in Human‑Annotated Textual Emotion Recognition Datasets for 28 Languages

S. H. Muhammad, N. Ousidhoum, I. Abdulmumin, and 1 more author

In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025), 2025

ACL 2025 Best Resource Paper Award
SemEval

SemEval‑2025 Task 11: Bridging the Gap in Text‑Based Emotion Detection

S. H. Muhammad, N. Ousidhoum, I. Abdulmumin, and 1 more author

In Proceedings of the 19th International Workshop on Semantic Evaluation (SemEval‑2025), 2025

Best Task Paper Award
NAACL

IrokoBench: A New Benchmark for African Languages in the Age of Large Language Models

D. I. Adelani, J. Ojo, I. A. Azime, and 24 more authors

In Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2025, Long Papers), 2025

Outstanding Paper Award
arXiv

The State of Large Language Models for African Languages: Progress and Challenges

K. Y. Hussen, W. T. Sewunetie, A. A. Ayele, and 3 more authors

arXiv preprint, 2025

Best Paper Award at Deep Learning Indaba 2025
NAACL

AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages

S. H. Muhammad, I. Abdulmumin, A. A. Ayele, and 3 more authors

In Proceedings of the 2025 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2025, Long Papers), 2025
arXiv

HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing

S. H. Muhammad, I. S. Ahmad, I. Abdulmumin, and 8 more authors

arXiv preprint, 2025
AfricaNLP

Automatic Speech Recognition for African Low‑Resource Languages: Challenges and Future Directions

S. H. Imam, B. Sani, D. K. Gete, and 7 more authors

In Proceedings of the AfricaNLP 2025 Workshop at ACL 2025, 2025
AfricaNLP

Who Wrote This? Identifying Machine vs Human‑Generated Text in Hausa

B. Sani, A. Soy, S. H. Imam, and 6 more authors

In Proceedings of the AfricaNLP 2025 Workshop at ACL 2025, 2025
ACL

INJONGO: A Multicultural Intent Detection and Slot‑filling Dataset for 16 African Languages

H. Yu, J. O. Alabi, A. Bukula, and 15 more authors

In Proceedings of the 63rd Annual Meeting of the Association for Computational Linguistics (ACL 2025, Long Papers), 2025
arXiv

POLAR: A Benchmark for Multilingual, Multicultural, and Multi‑Event Online Polarization

U. Naseem, J. Ren, S. Anwar, and 9 more authors

arXiv preprint, 2025
EMNLP

AfroXLMR‑Social: Adapting Pre‑trained Language Models for African Languages Social Media Text

T. D. Belay, I. A. Azime, I. S. Ahmad, and 5 more authors

In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025), 2025
EMNLP

AfriDoc‑MT: Document‑level MT Corpus for African Languages

J. O. Alabi, I. A. Azime, M. Zhang, and 9 more authors

In Proceedings of the 2025 Conference on Empirical Methods in Natural Language Processing (EMNLP 2025), 2025

2024

NeurIPS

BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages

J. Myung, N. Lee, Y. Zhou, and 19 more authors

In Advances in Neural Information Processing Systems (NeurIPS 2024, Datasets and Benchmarks Track), 2024

Best Non‑archival Paper Award at C3NLP Workshop
SemEval

SemEval Task 1: Semantic Textual Relatedness for African and Asian Languages

N. Ousidhoum, S. H. Muhammad, and others

In Proceedings of the 18th International Workshop on Semantic Evaluation (SemEval‑2024), 2024

Honourable Mention, Best Task Paper Award
Findings

SemRel2024: A Collection of Semantic Textual Relatedness Datasets for 13 Languages

N. Ousidhoum, S. H. Muhammad, and others

In Findings of the Association for Computational Linguistics: ACL 2024, 2024
arXiv

Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low‑resource African Languages

E. Bayes, I. A. Azime, J. O. Alabi, and 8 more authors

In arXiv preprint, 2024
NAACL

AfriMTE and AfriCOMET: Enhancing COMET to Embrace Under‑resourced African Languages

J. Wang, D. I. Adelani, S. Agrawal, and 18 more authors

In Proceedings of the 2024 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL 2024, Long Papers), 2024
WMT

Correcting FLORES Evaluation Dataset for Four African Languages

I. Abdulmumin, S. Mkhwanazi, M. Mbooi, and 3 more authors

In Proceedings of the Ninth Conference on Machine Translation (WMT 2024), 2024
WOAH

HausaHate: An Expert Annotated Corpus for Hausa Hate Speech Detection

F. A. Vargas, S. Guimarães, S. H. Muhammad, and 3 more authors

In Proceedings of the 8th Workshop on Online Abuse and Harms (WOAH 2024), 2024
WMT

Findings of WMT2024 English‑to‑Low Resource Multimodal Translation Task

S. Parida, O. Bojar, I. Abdulmumin, and 3 more authors

In Proceedings of the Ninth Conference on Machine Translation (WMT 2024), 2024

2023

EMNLP

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages

S. H. Muhammad, I. Abdulmumin, A. A. Ayele, and 2 more authors

In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023, Main), 2023

Best Non‑archival Paper Award at AfricaNLP 2023 Workshop
Findings

HaVQA: A Dataset for Visual Question Answering and Multimodal Research in Hausa Language

S. Parida, I. Abdulmumin, S. H. Muhammad, and 4 more authors

In Findings of the Association for Computational Linguistics: ACL 2023, 2023
NAACL

AfriMTE and AfriCOMET: Empowering COMET to Embrace Under‑resourced African Languages

J. Wang, D. I. Adelani, S. Agrawal, and 4 more authors

In Proceedings of the 2023 Conference of the North American Chapter of the Association for Computational Linguistics (NAACL 2023), 2023
EMNLP

AfriWOZ: Corpus for Exploiting Cross‑Lingual Transfer for Dialogue Generation in Low‑Resource African Languages

T. P. Adewumi, M. Adeyemi, A. Anuoluwapo, and 4 more authors

In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023
Chapter

Combining Symbolic and Deep Learning Approaches for Sentiment Analysis

S. H. Muhammad, P. Brazdil, and A. M. Jorge

In Compendium of Neurosymbolic Artificial Intelligence, 2023
ACL

MasakhaPOS: Part‑of‑Speech Tagging for Typologically Diverse African Languages

C. M. B. Dione, D. I. Adelani, P. Nabende, and 4 more authors

In Proceedings of the 2023 Annual Meeting of the Association for Computational Linguistics (ACL 2023), 2023
EMNLP

AfriSenti: A Twitter Sentiment Analysis Benchmark for African Languages

S. H. Muhammad, I. Abdulmumin, I. S. Ahmad, and 2 more authors

In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023
arXiv

HausaNLP at SemEval‑2023 Task 10: Transfer Learning, Synthetic Data and Side‑information for Multi‑level Sexism Classification

S. M. Aliyu, I. Abdulmumin, S. H. Muhammad, and 3 more authors

arXiv preprint, 2023
EMNLP

AfriQA: Cross‑lingual Open‑Retrieval Question Answering for African Languages

N. Ousidhoum, S. H. Muhammad, M. Abdalla, and 3 more authors

In Proceedings of the 2023 Conference on Empirical Methods in Natural Language Processing (EMNLP 2023), 2023
arXiv

The African Stopwords Project: Curating Stopwords for African Languages

C. C. Emezue, H. H. Nigatu, C. Thinwa, and 3 more authors

arXiv preprint, 2023

2022

EMNLP

MasakhaNER 2.0: Africa‑centric Transfer Learning for Named Entity Recognition

D. I. Adelani, G. Neubig, S. Ruder, and 4 more authors

In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), 2022

Service: IJCNLP–AACL 2023 Area Chair Award
LREC

NaijaSenti: A Nigerian Twitter Sentiment Corpus for Multilingual Sentiment Analysis

S. H. Muhammad, D. I. Adelani, S. Ruder, and 3 more authors

In Proceedings of the Thirteenth Language Resources and Evaluation Conference (LREC 2022), 2022
arXiv

Separating Grains from the Chaff: Using Data Filtering to Improve Multilingual Translation for Low‑Resourced African Languages

I. Abdulmumin, M. Beukman, J. O. Alabi, and 3 more authors

arXiv preprint, 2022
EMNLP

MasakhaNER 2.0: Africa‑centric Transfer Learning for Named Entity Recognition

D. I. Adelani, G. Neubig, S. Ruder, and 4 more authors

In Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing (EMNLP 2022), 2022
arXiv

BibleTTS: A Large, High‑Fidelity, Multilingual, and Uniquely African Speech Corpus

J. Meyer, D. I. Adelani, E. Casanova, and 3 more authors

In arXiv preprint, 2022
EPIA

Symbolic Versus Deep Learning Techniques for Explainable Sentiment Analysis

S. H. Muhammad, P. Brazdil, and A. M. Jorge

In Portuguese Conference on Artificial Intelligence, 2022
LREC

Hausa Visual Genome: A Dataset for Multi‑Modal English to Hausa Machine Translation

I. Abdulmumin, S. R. Dash, M. A. Dawud, and 3 more authors

In Proceedings of the 2022 International Conference on Language Resources and Evaluation (LREC 2022), 2022
arXiv

HERDPhobia: A Dataset for Hate Speech Against Fulani in Nigeria

S. M. Aliyu, G. M. Wajiga, M. T. Murtala, and 2 more authors

arXiv preprint, 2022

2021

TACL

Quality at a Glance: An Audit of Web‑Crawled Multilingual Datasets

I. Caswell, J. Kreutzer, L. Wang, and 4 more authors

Transactions of the Association for Computational Linguistics, 2021

2020

Findings

Participatory Research for Low‑resourced Machine Translation: A Case Study in African Languages

W. Nekoto, V. Marivate, T. Matsila, and 5 more authors

In Findings of the Association for Computational Linguistics: EMNLP 2020, 2020

Wikimedia Research Award
arXiv

Participatory Research for Low‑resourced Machine Translation: A Case Study in African Languages

W. O. Nekoto, V. Marivate, T. Matsila, and 4 more authors

arXiv preprint, 2020
ECIR

Incremental Approach for Automatic Generation of Domain‑Specific Sentiment Lexicon

S. H. Muhammad, P. Brazdil, and A. M. Jorge

In Advances in Information Retrieval (ECIR 2020), 2020
SN CS

A Survey on Machine Learning Techniques in Movie Revenue Prediction

I. S. Ahmad, A. Abu Bakar, M. R. Yaakub, and 1 more author

SN Computer Science, 2020

2019

arXiv

An Overview of Sentiment Analysis Approaches

S. H. Muhammad

arXiv preprint, 2019

2017

OcRI

Massive Open Online Courses: Awareness, Adoption, Benefits and Challenges in Sub‑Saharan Africa

S. H. Muhammad, A. Mustapha, and K. Haruna

In Proceedings of OcRI, 2017
OcRI

A Framework for Implementation of E‑Classroom System

K. Haruna, M. Baffa, S. H. Muhammad, and 1 more author

In Proceedings of OcRI, 2017

2016

OcRI

Massive Open Online Courses: A Success of Cloud Computing in Education

A. Mustapha, S. H. Muhammad, and S. A. Salahudeen

In Proceedings of OcRI, 2016