Synapse - Africa’s 4IR Trade & Innovation Magazine - 2nd Quarter 2021 Issue 12

Page 14

INVESTMENT

LACUNA FUND INVESTS $1M

in Datasets for Low Resource Languages in Sub-Saharan Africa Lacuna Fund — the world’s first collaborative effort to provide data scientists, researchers and social enterprises in low- and middle-income contexts globally with the resources they need to produce training datasets that address urgent problems in their communities — has invested $1-million in six projects which are creating openly accessible text and speech datasets that will fuel natural language processing (NLP) technologies in 29 languages across Africa.

T

he fund pointed out in a statement in late April that the supported projects will produce text and speech datasets for NLP technologies that will have significant downstream impacts on education, financial inclusion, healthcare, agriculture, communication, and disaster response in Sub-Saharan Africa. Lacuna Fund explained that the funding recipients will produce training datasets in Eastern, Western, and Southern Africa that will support a range of needs for low resource languages, including machine translation, speech recognition, named entity recognition and part of speech tagging, sentiment analysis, and multimodal datasets. All datasets produced will be locally developed and owned, and will be openly accessible to the international data community. “With over 50 impressive applications from, or in partnership with, organisations

12

SYNAPSE | 2ND QUARTER 2021

across Africa, there are many more initiatives poised for impact. This movement towards locally developed and owned datasets has only just begun, and with the right support and funding these initiatives will unlock the power of AI to deliver new social sector solutions and increase the presence of African countries on the international data map,” Lacuna Fund stated. Also commenting in the same statement, ABSA Chair of Data Science at University of Pretoria Vukosi Marivate drew attention to how the South African government has been using chatbots to provide daily updates on COVID. “Right now, translating those updates to Latin languages is really easy, but the datasets necessary to translate those updates to a range of African languages don’t exist, which means that the government isn’t currently able to communicate with many of its people in their native languages. That is one of the many examples of why we need this work now,” explained Marivate.

Meet the recipients Building an Annotated Spoken Corpus for Igbo NLP Tasks: This project addresses the gap in the availability of an Igbo spoken corpus for NLP tasks. Existing corpora—such as the Igbo web Corpus (IgWaC) and literary, religious and grammar texts—are either unannotated or not archived for research and NLP tasks. This study will create an annotated 1000-sentence corpus and 25 hours of unannotated audio data to launch an open access spoken corpus that would be available for research and NLP tasks. Data will be gathered from oral narratives and live Igbo news. Ethnographic interviews will be used to collect data that covers several domains of the Igbo life such as marriage, religion, language, burial, education, security, and trade. To ensure adequate representation, balance, and homogeneity, data collection will take place in the five south-eastern states where Igbo is predominantly spoken, and the team will recruit 50 different language speakers across the states to provide audio data. Igbo news recordings will be acquired from the Federal Radio Corporation of Nigeria across the five states. Igbo NLP Tasks Project Team member Gerald Nweya from the University of Ibadan said the team is excited to embark on this project due to the impact it will have on the NLP community as it it particularly concerns the Igbo language. “The need to build an annotated corpus of contemporary Igbo is one that is long overdue. It could be very interesting to study the language


Turn static files into dynamic content formats.

Create a flipbook

Articles inside

RPA: The Next Chapter In The Automation Story

4min
page 29

WizzPass Workspace Booking - Optimise & streamline your workspace management

3min
page 55

Invisio AI Scoops 3rd Place at 2020 SAB Foundation Social Innovation & Disability Empowerment Awards

2min
page 54

AI GONE GLOBAL: Why 20,000+ Developers from Emerging Markets Signed Up for GTC

3min
page 50

Nigerian insurtech startup Curacel raises $450k pre-seed round

1min
page 40

Clevva joins Blue Prism's Digital Exchange

2min
page 40

Smart Africa, Intel partner to build AI capacity building for African policymakers

1min
page 33

WITS, partners release AI-powered Algorithm To Detect SA’s Third COVID-19 Infection Wave

2min
page 32

UNESCO launches AI Needs Assessment Survey in Africa

3min
page 31

UMOJAHACK AFRICA 2021: Over 1 000 students participate in Africa’s largest inter-university hackathon

3min
page 30

ISHANGO, AIMS partnership to connect top African data scientists with international work experiences

2min
page 28

Liquid Telecom rebrands to Liquid Intelligent Technologies

1min
page 28

FROM GARAGE TO GLOBAL: How CompariSure’s conversational AI is driving digitisation within the Insurance industry

3min
pages 26-27

5 Steps To Building A People Analytics Function From The Ground Up

4min
page 25

Willis Re Launches new South Africa Hail Catastrophe Risk Model

3min
page 24

PUTTING AI INTO THE ENGINE ROOM

5min
page 23

First Fon to French Neural Machine Translation Engine launched

1min
page 16

TunBERT: InstaDeep, iCompass announce partnership on 1st AI-based Tunisian Dialect System

2min
page 16

Grassroots NLP community Masakhane wins Wikimedia Foundation Research of the Year Award

1min
page 6

How IBM Wants to Accelerate DX With Latest Breakthroughs in Hybrid Cloud, AI Capabilities

6min
pages 52-53

How the AU, Africa CDC will take On COVID-19 Through AI, Big Data

2min
page 48

DeepMind Establishes Scholarship for Wits Masters Students

2min
pages 49-50

Innovation Factory (Africa) Challenge

1min
page 51

Machine Learning Sandcastles

12min
pages 42-45

Automation 360: Automation Anywhere’s Cloud-Native Platform for Intelligent Automation

4min
pages 46-47

4 Reasons Why You Should Care About AI Governance Now

4min
page 41

Wits Announces Team to Advance AI Research in Africa

3min
page 39

SANRAL explores Machine Learning Applications for Road Safety, Congestion

2min
page 34

Siemens, CSIR partner to boost SA 4IR skills

2min
page 38

Strathmore Study Lays Bare Gender Inequality in African AI Industry

2min
page 35

How Quantum Computing Could Propel Us Light Years Into The Future

4min
pages 36-37

NVIDIA unveils its 1st Data Centre CPU

2min
page 22

NVIDIA Inception: Meet the African Startups Accepted Into the Programme

1min
page 33

How The Pandemic Gave Birth to SA’s latest 4IR SaaS platform

10min
pages 20-21

Meet UCT’s 1st Google Research Scholar Program Recipients

4min
pages 18-19

SA Team Places 2nd at 2021 Imagine Cup Junior Virtual AI Hackathon, Girls Edition

3min
page 17

These 3 African Startups Are Using AI, Data Science to Disrupt Fintech

1min
page 12

Why Kenya’s Ajua acquired AI/ ML fintech startup WayaWaya

3min
page 13

All You Need To Know About The EU’s DIGILOGIC initiative

5min
pages 8-9

Lacuna Fund invests $1m in Datasets for Low Resource African Languages

11min
pages 14-15

Human In The System: Understanding Customer Behaviours with ecosystem.Ai

3min
pages 10-11

Hyperautomation: A Case Study

3min
page 7

AfDB provides $1m Grant for AI-based National Consumer Management Systems

1min
page 6
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.