
Advancing Somali-language AI and Natural Language Processing research through pioneering work in datasets, models, and innovation.
Jamhuriya University of Science and Technology (JUST) established the Somali-language AI and Innovation Lab (SAIL) based on a strong belief in AI's transformative potential to enhance healthcare, agriculture, operational efficiency, and service accessibility.
The lab advances Somali-language AI technologies and innovation while laying a strong foundation for Somali Natural Language Processing (SomaliNLP) research, building on years of work that brought Somali-language technology to regional and international research venues.
An estimated population of over 22 million people speak Somali across Somalia, Djibouti, Kenya, Ethiopia, and diaspora communities. Despite this wide usage, Somali remains severely under-resourced in AI research due to limited datasets, annotated corpora, and language models.

Foundational Text AI
Somali-Dialect ASR
JUST Digital Transformation
Building data, tools, models, and practical NLP solutions that empower Somali language in the digital and AI age.
Speech-to-Text systems and voice assistants capable of understanding complex Somali phonetics.
Smart learning platforms and AI-powered tutors to personalize the Somali learning experience.
Curating high-quality datasets and predictive modeling for socio-economic forecasting.
Transforming research into AI mobile apps and Smart Government solutions for efficiency.
Somali-language NLP / AI
Research Outputs / Papers
Research Funding Applications
Successful Funding Applications
Funding Success
Research Funding Success Rate
Research paper 1
International reviewer
“Peer reviewers emphasised that our work on fake news and toxicity detection makes a significant contribution to combating misinformation and toxicity on Somali social media, addressing critical gaps in online safety for low-resource African languages.”
International reviewer
“Reviewers noted that the SomaBERTa model and its accompanying datasets establish an important benchmark for Somali-language AI, enabling researchers and practitioners to build more advanced models and expand NLP applications in the region.”
International reviewer
“Multiple reviewers described the research as a strong and impactful contribution to African NLP, with well-supported results demonstrating clear improvements over existing multilingual and African language models.”
Research paper 2
International reviewer
“Reviewers also recognised our research on Somali lemmatization corpus development as a foundational contribution to Somali-language NLP, providing the first large-scale, linguistically grounded lemmatization lexicon and annotation platform that fills a critical resource gap for low-resource African languages.”
International reviewer
“Peer reviewers highlighted that the expert-validated lexicon and web-based annotation platform create a sustainable and expandable infrastructure for Somali and other morphologically rich African languages, supporting long-term inclusive AI development.”
International reviewer
“Reviewers emphasised that this resource will strengthen downstream NLP tasks, from POS tagging to information retrieval and hybrid neural-symbolic systems, positioning Somali as a well-supported language in the next generation of African AI technologies.”

The Somali-language AI and Innovation Lab (SAIL), together with the Jamhuriya Center for Graduate Studies (CGS) and CEALT at the University of Djibouti, successfully concluded the research project titled “Somali-Written Fake News on Social Media Using NLP and Deep Learning.”
View Event Details
A formal launch and dissemination ceremony organized by Jamhuriya University of Science and Technology to introduce and showcase the SAIL project initiative. The event will highlight key achievements, research outcomes, and innovation-driven activities aligned with the university’s mission of advancing education, research, and community impact.
View Event Details




