Shamsuddeen Hassan Muhammad
Imperial College London. Google DeepMind Academic Fellow. Co-founder of HausaNLP. Founder of Arewa Data Science Academy.

I am an Advanced Research Fellow and a Google DeepMind Academic Fellow at Imperial College London. I received my PhD from the University of Porto, Portugal, under the supervision of Professor Pavel Brazdil and Professor Alipio Jorge. Prior to that, I earned an MS in Computer Science from the University of Manchester, UK, and a BSc in Computer Science from Bayero University, Kano, Nigeria. I also serve as a faculty member at the Faculty of Computing, Bayero University, Kano-Nigeria.
I have co-authored over 60 research papers published in top venues such as ICLR, NeurIPS, ACL, EMNLP, NAACL, and LREC. My work has received wide recognition, contributing to a significant number of citations and 6 best paper awards. I have served the research community in various leadership roles, including Area Chair for ACL, NAACL, and EMNLP.
I am deeply passionate about diversity and inclusion. To further this cause, I co-founded the HausaNLP research group, which aims to advance research and development in Hausa language, one of the most widely spoken languages in Africa. I also founded the Arewa Data Science Academy, which aims to democratize data science and AI education by providing free data science and machine learning training to underserved students in Nigeria.
news
Jun 07, 2025 | I will be giving a keynote speech at the York St John University on 19th June 2025. |
---|---|
Jun 04, 2025 | I will be teaching Natural Language Processing (NLP) at the African Institute for Mathematical Sciences (AIMS) Camerron from Febuary 23 to March 14, 2025. |
Jun 02, 2025 | I will be teaching Natural Language Processing (NLP) in the AI for Science Master’s program at the African Institute for Mathematical Sciences (AIMS) in South Africa from November 24 to December 12, 2025. |
May 05, 2025 | Two papers accepted at ACL2024 |
May 05, 2025 | Four Papers Accepted at AfricaNLP 2025 |
latest posts
selected publications
- SemEval-2025 task 11: Bridging the gap in text-based emotion detectionarXiv preprint arXiv:2503.07269, 2025
- INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African LanguagesarXiv preprint arXiv:2502.09814, 2025
- The State of Large Language Models for African Languages: Progress and ChallengesIn , 2025
- POLAR: A Benchmark for Multilingual, Multicultural, and Multi-Event Online PolarizationIn , 2025
- HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language ProcessingIn , 2025
- HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language ProcessingIn , 2025
- Exploring Cultural Nuances in Emotion Perception Across 15 African LanguagesArXiv, 2025
- AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media TextArXiv, 2025
- Whispering in Amharic: Fine-tuning Whisper for Low-resource LanguageArXiv, 2025
- Who Wrote This? Identifying Machine vs Human-Generated Text in HausaArXiv, 2025
- Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African LanguagesArXiv, 2024
- Correcting FLORES Evaluation Dataset for Four African LanguagesIn Conference on Machine Translation, 2024
- Mitigating Translationese in Low-resource Languages: The Storyboard ApproachArXiv, 2024
- BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and LanguagesArXiv, 2024