Synapse - Africa’s 4IR Trade & Innovation Magazine - 2nd Quarter 2021 Issue 12 by AI Media Group

Synapse - Africa’s 4IR Trade & Innovation Magazine - 2nd Quarter 2021 Issue 12

INVESTMENT

LACUNA FUND INVESTS $1M

in Datasets for Low Resource Languages in Sub-Saharan Africa Lacuna Fund — the world’s first collaborative effort to provide data scientists, researchers and social enterprises in low- and middle-income contexts globally with the resources they need to produce training datasets that address urgent problems in their communities — has invested $1-million in six projects which are creating openly accessible text and speech datasets that will fuel natural language processing (NLP) technologies in 29 languages across Africa.

he fund pointed out in a statement in late April that the supported projects will produce text and speech datasets for NLP technologies that will have significant downstream impacts on education, financial inclusion, healthcare, agriculture, communication, and disaster response in Sub-Saharan Africa. Lacuna Fund explained that the funding recipients will produce training datasets in Eastern, Western, and Southern Africa that will support a range of needs for low resource languages, including machine translation, speech recognition, named entity recognition and part of speech tagging, sentiment analysis, and multimodal datasets. All datasets produced will be locally developed and owned, and will be openly accessible to the international data community. “With over 50 impressive applications from, or in partnership with, organisations

SYNAPSE | 2ND QUARTER 2021

across Africa, there are many more initiatives poised for impact. This movement towards locally developed and owned datasets has only just begun, and with the right support and funding these initiatives will unlock the power of AI to deliver new social sector solutions and increase the presence of African countries on the international data map,” Lacuna Fund stated. Also commenting in the same statement, ABSA Chair of Data Science at University of Pretoria Vukosi Marivate drew attention to how the South African government has been using chatbots to provide daily updates on COVID. “Right now, translating those updates to Latin languages is really easy, but the datasets necessary to translate those updates to a range of African languages don’t exist, which means that the government isn’t currently able to communicate with many of its people in their native languages. That is one of the many examples of why we need this work now,” explained Marivate.

Meet the recipients Building an Annotated Spoken Corpus for Igbo NLP Tasks: This project addresses the gap in the availability of an Igbo spoken corpus for NLP tasks. Existing corpora—such as the Igbo web Corpus (IgWaC) and literary, religious and grammar texts—are either unannotated or not archived for research and NLP tasks. This study will create an annotated 1000-sentence corpus and 25 hours of unannotated audio data to launch an open access spoken corpus that would be available for research and NLP tasks. Data will be gathered from oral narratives and live Igbo news. Ethnographic interviews will be used to collect data that covers several domains of the Igbo life such as marriage, religion, language, burial, education, security, and trade. To ensure adequate representation, balance, and homogeneity, data collection will take place in the five south-eastern states where Igbo is predominantly spoken, and the team will recruit 50 different language speakers across the states to provide audio data. Igbo news recordings will be acquired from the Federal Radio Corporation of Nigeria across the five states. Igbo NLP Tasks Project Team member Gerald Nweya from the University of Ibadan said the team is excited to embark on this project due to the impact it will have on the NLP community as it it particularly concerns the Igbo language. “The need to build an annotated corpus of contemporary Igbo is one that is long overdue. It could be very interesting to study the language

Synapse - Africa’s 4IR Trade & Innovation Magazine - 2nd Quarter 2021 Issue 12

Articles inside

RPA: The Next Chapter In The Automation Story

WizzPass Workspace Booking - Optimise & streamline your workspace management

Invisio AI Scoops 3rd Place at 2020 SAB Foundation Social Innovation & Disability Empowerment Awards

AI GONE GLOBAL: Why 20,000+ Developers from Emerging Markets Signed Up for GTC

Nigerian insurtech startup Curacel raises $450k pre-seed round

Clevva joins Blue Prism's Digital Exchange

Smart Africa, Intel partner to build AI capacity building for African policymakers

WITS, partners release AI-powered Algorithm To Detect SA’s Third COVID-19 Infection Wave

UNESCO launches AI Needs Assessment Survey in Africa

UMOJAHACK AFRICA 2021: Over 1 000 students participate in Africa’s largest inter-university hackathon

ISHANGO, AIMS partnership to connect top African data scientists with international work experiences

Liquid Telecom rebrands to Liquid Intelligent Technologies

FROM GARAGE TO GLOBAL: How CompariSure’s conversational AI is driving digitisation within the Insurance industry

5 Steps To Building A People Analytics Function From The Ground Up

Willis Re Launches new South Africa Hail Catastrophe Risk Model

PUTTING AI INTO THE ENGINE ROOM

First Fon to French Neural Machine Translation Engine launched

TunBERT: InstaDeep, iCompass announce partnership on 1st AI-based Tunisian Dialect System

Grassroots NLP community Masakhane wins Wikimedia Foundation Research of the Year Award

How IBM Wants to Accelerate DX With Latest Breakthroughs in Hybrid Cloud, AI Capabilities

How the AU, Africa CDC will take On COVID-19 Through AI, Big Data

DeepMind Establishes Scholarship for Wits Masters Students

Innovation Factory (Africa) Challenge

Machine Learning Sandcastles

Automation 360: Automation Anywhere’s Cloud-Native Platform for Intelligent Automation

4 Reasons Why You Should Care About AI Governance Now

Wits Announces Team to Advance AI Research in Africa

SANRAL explores Machine Learning Applications for Road Safety, Congestion

Siemens, CSIR partner to boost SA 4IR skills

Strathmore Study Lays Bare Gender Inequality in African AI Industry

How Quantum Computing Could Propel Us Light Years Into The Future

NVIDIA unveils its 1st Data Centre CPU

NVIDIA Inception: Meet the African Startups Accepted Into the Programme

How The Pandemic Gave Birth to SA’s latest 4IR SaaS platform

Meet UCT’s 1st Google Research Scholar Program Recipients

SA Team Places 2nd at 2021 Imagine Cup Junior Virtual AI Hackathon, Girls Edition

These 3 African Startups Are Using AI, Data Science to Disrupt Fintech

Why Kenya’s Ajua acquired AI/ ML fintech startup WayaWaya

All You Need To Know About The EU’s DIGILOGIC initiative

Lacuna Fund invests $1m in Datasets for Low Resource African Languages

Human In The System: Understanding Customer Behaviours with ecosystem.Ai

Hyperautomation: A Case Study

AfDB provides $1m Grant for AI-based National Consumer Management Systems