Shamsuddeen Hassan Muhammad

Imperial College London. Google DeepMind Academic Fellow. Co-founder of HausaNLP. Founder of Arewa Data Science Academy.

prof_pic.jpg

I am an Advanced Research Fellow and a Google DeepMind Academic Fellow at Imperial College London. I received my PhD from the University of Porto, Portugal, under the supervision of Professor Pavel Brazdil and Professor Alipio Jorge. Prior to that, I earned an MS in Computer Science from the University of Manchester, UK, and a BSc in Computer Science from Bayero University, Kano, Nigeria. I also serve as a faculty member at the Faculty of Computing, Bayero University, Kano-Nigeria.

I have co-authored over 60 research papers published in top venues such as ICLR, NeurIPS, ACL, EMNLP, NAACL, and LREC. My work has received wide recognition, contributing to a significant number of citations Google Scholar Citations and 6 best paper awards. I have served the research community in various leadership roles, including Area Chair for ACL, NAACL, and EMNLP.

I am deeply passionate about diversity and inclusion. To further this cause, I co-founded the HausaNLP research group, which aims to advance research and development in Hausa language, one of the most widely spoken languages in Africa. I also founded the Arewa Data Science Academy, which aims to democratize data science and AI education by providing free data science and machine learning training to underserved students in Nigeria.

news

Jun 07, 2025 I will be giving a keynote speech at the York St John University on 19th June 2025.
Jun 04, 2025 I will be teaching Natural Language Processing (NLP) at the African Institute for Mathematical Sciences (AIMS) Camerron from Febuary 23 to March 14, 2025.
Jun 02, 2025 I will be teaching Natural Language Processing (NLP) in the AI for Science Master’s program at the African Institute for Mathematical Sciences (AIMS) in South Africa from November 24 to December 12, 2025.
May 05, 2025 Two papers accepted at ACL2024
May 05, 2025 Four Papers Accepted at AfricaNLP 2025

latest posts

selected publications

  1. BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages
    Shamsuddeen Hassan Muhammad, Nedjma Ousidhoum, Idris Abdulmumin, and 8 more authors
    arXiv preprint arXiv:2502.11926, 2025
  2. SemEval-2025 task 11: Bridging the gap in text-based emotion detection
    Shamsuddeen Hassan Muhammad, Nedjma Ousidhoum, Idris Abdulmumin, and 8 more authors
    arXiv preprint arXiv:2503.07269, 2025
  3. AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages
    Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Abinew Ali Ayele, and 24 more authors
    In , 2025
  4. INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages
    Hao Yu, Jesujoba O Alabi, Andiswa Bukula, and 8 more authors
    arXiv preprint arXiv:2502.09814, 2025
  5. The State of Large Language Models for African Languages: Progress and Challenges
    Kedir Yassin Hussen, Walelign Tewabe Sewunetie, Abinew Ali Ayele, and 3 more authors
    In , 2025
  6. POLAR: A Benchmark for Multilingual, Multicultural, and Multi-Event Online Polarization
    Usman Naseem, Juan Ren, Saba Anwar, and 14 more authors
    In , 2025
  7. HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing
    Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad, Idris Abdulmumin, and 8 more authors
    In , 2025
  8. HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing
    Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad, Idris Abdulmumin, and 8 more authors
    In , 2025
  9. Exploring Cultural Nuances in Emotion Perception Across 15 African Languages
    Ibrahim Said Ahmad, Shiran Dudy, Tadesse Destaw Belay, and 4 more authors
    ArXiv, 2025
  10. AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text
    Tadesse Destaw Belay, Israel Abebe Azime, Ibrahim Said Ahmad, and 4 more authors
    ArXiv, 2025
  11. Whispering in Amharic: Fine-tuning Whisper for Low-resource Language
    Dawit Ketema Gete, Bedru Yimam Ahamed, Tadesse Destaw Belay, and 11 more authors
    ArXiv, 2025
  12. Who Wrote This? Identifying Machine vs Human-Generated Text in Hausa
    Babangida Sani, Aakansha Soy, Sukairaj Hafiz Imam, and 5 more authors
    ArXiv, 2025
  13. Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages
    Edward Bayes, Israel Abebe Azime, Jesujoba Oluwadara Alabi, and 11 more authors
    ArXiv, 2024
  14. Correcting FLORES Evaluation Dataset for Four African Languages
    Idris Abdulmumin, Sthembiso Mkhwanazi, Mahlatse S Mbooi, and 7 more authors
    In Conference on Machine Translation, 2024
  15. Mitigating Translationese in Low-resource Languages: The Storyboard Approach
    Garry Kuwanto, Eno-Abasi Urua, Priscilla Amuok, and 21 more authors
    ArXiv, 2024
  16. BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages
    Junho Myung, Nayeon Lee, Yi Zhou, and 19 more authors
    ArXiv, 2024