Shamsuddeen Hassan Muhammad

Imperial College London. Google DeepMind Academic Fellow. Co-founder of HausaNLP. Founder of Arewa Data Science Academy.

prof_pic.jpg

I am a Google DeepMind Academic Fellow and an Advanced Research Fellow at Imperial College London. I also serve as an Assistant Professor (Senior Lecturer) at the Faculty of Computing, Bayero University, Kano, Nigeria, and as a Visiting Professor at the African Institute of Mathematical Sciences in Cameroon and South Africa, as well as at York St John University.

I received my PhD from the University of Porto, Portugal, under the supervision of Professor Pavel Brazdil and Professor Alipio Jorge. Prior to that, I earned an MS in Computer Science from the University of Manchester, UK, and a BSc in Computer Science from Bayero University, Kano, Nigeria.

My research focuses on Natural Language Processing (NLP) for low-resource African languages and aims to bridge the gap in the need for equitable language technologies in underserved communities. I have published in top venues such as ACL, EMNLP, NAACL, ICLR, and NeurIPS. My work has received wide recognition and several best paper awards. I have served the research community in various leadership roles, including as Area Chair for ACL, NAACL, and EMNLP.

I am deeply passionate about diversity and inclusion. To further this cause, I co-founded the HausaNLP research group, which aims to advance research and development in the Hausa language, one of the most widely spoken languages in Africa. I also founded the Arewa Data Science Academy, which seeks to democratize data science and AI education by providing free data science and machine learning training to underserved students.

If you have any questions regarding my research or want to collaborate feel free to contact me anytime. Contact Me

news

Jun 15, 2025 I will be giving a keynote speech at Empowering India Through Inclusive Generative AI workshop on 5th July 2025.
Jun 14, 2025 I will be giving a keynote speech at Workshop on Multilingual and Equitable Language Technologies (MELT) on October 10, 2025 at Palais des Congrès, Montreal, Canada
Jun 07, 2025 I will be giving a keynote speech at the York St John University on 19th June 2025.
Jun 04, 2025 I will be teaching Natural Language Processing (NLP) at the African Institute for Mathematical Sciences (AIMS) Camerron from Febuary 23 to March 14, 2025.
Jun 02, 2025 I will be teaching Natural Language Processing (NLP) in the AI for Science Master’s program at the African Institute for Mathematical Sciences (AIMS) in South Africa from November 24 to December 12, 2025.
May 05, 2025 Two papers accepted at ACL2025
May 05, 2025 Four Papers Accepted at AfricaNLP 2025
Mar 25, 2025 Our paper IrokoBench received the Outstanding Paper Award at NAACL 2025 in Albuquerque, New Mexico (April 30–May 7, 2025).
Mar 22, 2025 I will be attending NACCL 2025 in Albuquerque, New Mexico (April 30–May 7, 2025) to present the AfriHate paper.
Jun 23, 2025
Feb 01, 2025 Two papers accepted at NAACL2025
Jun 01, 2024 Co-Organizing SemEval-2025 Shared Task on Emotion Detection
Jan 18, 2024 Joining Imperial College London as a Google DeepMind Academic Fellow

latest posts

selected publications ( see up-to-date on Google Scholar )

  1. BRIGHTER: BRIdging the Gap in Human-Annotated Textual Emotion Recognition Datasets for 28 Languages
    Shamsuddeen Hassan Muhammad, Nedjma Ousidhoum, Idris Abdulmumin, and 8 more authors
    2025
  2. SemEval-2025 task 11: Bridging the gap in text-based emotion detection
    Shamsuddeen Hassan Muhammad, Nedjma Ousidhoum, Idris Abdulmumin, and 8 more authors
    2025
  3. AfriHate: A Multilingual Collection of Hate Speech and Abusive Language Datasets for African Languages
    Shamsuddeen Hassan Muhammad, Idris Abdulmumin, Abinew Ali Ayele, and 24 more authors
    In , 2025
  4. HausaNLP: Current Status, Challenges and Future Directions for Hausa Natural Language Processing
    Shamsuddeen Hassan Muhammad, Ibrahim Said Ahmad, Idris Abdulmumin, and 8 more authors
    In , 2025
  5. INJONGO: A Multicultural Intent Detection and Slot-filling Dataset for 16 African Languages
    Hao Yu, Jesujoba O Alabi, Andiswa Bukula, and 8 more authors
    ACL 2025, 2025
  6. The State of Large Language Models for African Languages: Progress and Challenges
    Kedir Yassin Hussen, Walelign Tewabe Sewunetie, Abinew Ali Ayele, and 3 more authors
    In , 2025
  7. POLAR: A Benchmark for Multilingual, Multicultural, and Multi-Event Online Polarization
    Usman Naseem, Juan Ren, Saba Anwar, and 14 more authors
    In , 2025
  8. AfroXLMR-Social: Adapting Pre-trained Language Models for African Languages Social Media Text
    Tadesse Destaw Belay, Israel Abebe Azime, Ibrahim Said Ahmad, and 4 more authors
    ArXiv, 2025
  9. Uhura: A Benchmark for Evaluating Scientific Question Answering and Truthfulness in Low-Resource African Languages
    Edward Bayes, Israel Abebe Azime, Jesujoba Oluwadara Alabi, and 11 more authors
    ArXiv, 2024
  10. Correcting FLORES Evaluation Dataset for Four African Languages
    Idris Abdulmumin, Sthembiso Mkhwanazi, Mahlatse S Mbooi, and 7 more authors
    In Conference on Machine Translation, 2024
  11. BLEnD: A Benchmark for LLMs on Everyday Knowledge in Diverse Cultures and Languages
    Junho Myung, Nayeon Lee, Yi Zhou, and 19 more authors
    ArXiv, 2024