Synapse - Africa’s 4IR Trade & Innovation Magazine - 2nd Quarter 2021 Issue 12

Page 16

NEWS

INSTADEEP,

iCompass partner on TunBERT - First AI-based Tunisian Dialect System InstaDeep, an AI startup founded by Tunisian co-founders Karim Beguir and Zohra Slim, and Tunis-based startup iCompass in March jointly revealed a collaborative Natural Language Processing (NLP) project that will lead to the development of a language model for Tunisian dialect, TunBERT. The project will evaluate TunBERT on several tasks such as sentiment analysis, dialect classification, reading, comprehension, and question answering. The partnership aims to apply the latest advances in AI and Machine Learning (ML) to explore and strengthen research in the fast-emerging Tunisian AI tech ecosystem. “We’re excited to reveal TunBERT, a joint research project between iCompass and InstaDeep that redefines state-of-the-art for the Tunisian dialect. This work also highlights the positive results that are achieved when leading AI startups collaborate, benefiting the

Tunisian tech ecosystem as a whole,” said InstaDeep CEO Karim Beguir. Bidirectional Encoder Representations from Transformers (BERT) has become a state-of-the-art model for language understanding. With its success, available models have been trained on Indo-European languages such as English, French, German etc., but similar research for underrepresented languages remains sparse and in its early stage. Along with jointly writing and de-bugging the code, iCompass and InstaDeep’s research engineers have launched multiple successful experiments. iCompass CTO & co-founder Dr Hatem Haddad explained that the collaboration aims to push forward and advance the development of AI research in the emerging and prominent field of NLP and language models. “Our ultimate goal is to empower Tunisian talent and foster an environment where AI innovation can grow, and together our teams are pushing boundaries” said Dr Haddad.

TunBERT is developed based on NVIDIA’s NeMo toolkit which the research team used to adapt and fine-tune the neural network on relevant data to pre-train the language model on a large-scale Tunisian corpus, taking advantage of the BERT model that was optimised by NVIDIA. TunBERT’s pre-training and fine-tuning steps converged faster and in a distributed and optimised way thanks to the use of multiple NVIDIA V100 GPUs. This implementation provided more efficient training using Tensor Core mixed precision capabilities and the NeMo Toolkit. Through this approach, the contextualised text representation models learned an effective embedding of the natural language, making it machine-understandable and achieving tremendous performance results. Comparing the NVIDIA-optimised BERT model results to the original BERT implementation shows that the NVIDIA-optimised BERT-model performs better on the different downstream tasks, while using the same compute power.

INNOVATION

FIRST FON TO FRENCH

Neural Machine Translation Engine launched edAI researchers Chris Emezue and Bonaventure Dossou have launched FFRTranslate, the first Neural Machine Translation engine from Fon — a very low-resource and tonal language — to French and vice-versa. Fon shares tonal and analytical similarities with the Niger-Congo languages which include Igbo, Hausa, Yoruba, and Swahili. The engine will promote better communication in Fon and could enable companies to translate texts and messages from Fon to French and vice versa. Dossou described working on the engine as being “an awesome ride”. “We’re thankful to everybody that supported us especially our beloved Masakhane NLP family, and the broader

14

SYNAPSE | 2ND QUARTER 2021

NLP community. We believe that this is a huge step toward empowering African endangered languages,” he said in a LinkedIn post announcing the launch. Other contributors who worked on the project include Fabroni Yoclounon and Ricardo Ahounvlame.Read more about the translation engine in this Synapse Issue 8 article here.

Left to right: Chris Emezue, Bonaventure Dossou


Turn static files into dynamic content formats.

Create a flipbook

Articles inside

RPA: The Next Chapter In The Automation Story

4min
page 29

WizzPass Workspace Booking - Optimise & streamline your workspace management

3min
page 55

Invisio AI Scoops 3rd Place at 2020 SAB Foundation Social Innovation & Disability Empowerment Awards

2min
page 54

AI GONE GLOBAL: Why 20,000+ Developers from Emerging Markets Signed Up for GTC

3min
page 50

Nigerian insurtech startup Curacel raises $450k pre-seed round

1min
page 40

Clevva joins Blue Prism's Digital Exchange

2min
page 40

Smart Africa, Intel partner to build AI capacity building for African policymakers

1min
page 33

WITS, partners release AI-powered Algorithm To Detect SA’s Third COVID-19 Infection Wave

2min
page 32

UNESCO launches AI Needs Assessment Survey in Africa

3min
page 31

UMOJAHACK AFRICA 2021: Over 1 000 students participate in Africa’s largest inter-university hackathon

3min
page 30

ISHANGO, AIMS partnership to connect top African data scientists with international work experiences

2min
page 28

Liquid Telecom rebrands to Liquid Intelligent Technologies

1min
page 28

FROM GARAGE TO GLOBAL: How CompariSure’s conversational AI is driving digitisation within the Insurance industry

3min
pages 26-27

5 Steps To Building A People Analytics Function From The Ground Up

4min
page 25

Willis Re Launches new South Africa Hail Catastrophe Risk Model

3min
page 24

PUTTING AI INTO THE ENGINE ROOM

5min
page 23

First Fon to French Neural Machine Translation Engine launched

1min
page 16

TunBERT: InstaDeep, iCompass announce partnership on 1st AI-based Tunisian Dialect System

2min
page 16

Grassroots NLP community Masakhane wins Wikimedia Foundation Research of the Year Award

1min
page 6

How IBM Wants to Accelerate DX With Latest Breakthroughs in Hybrid Cloud, AI Capabilities

6min
pages 52-53

How the AU, Africa CDC will take On COVID-19 Through AI, Big Data

2min
page 48

DeepMind Establishes Scholarship for Wits Masters Students

2min
pages 49-50

Innovation Factory (Africa) Challenge

1min
page 51

Machine Learning Sandcastles

12min
pages 42-45

Automation 360: Automation Anywhere’s Cloud-Native Platform for Intelligent Automation

4min
pages 46-47

4 Reasons Why You Should Care About AI Governance Now

4min
page 41

Wits Announces Team to Advance AI Research in Africa

3min
page 39

SANRAL explores Machine Learning Applications for Road Safety, Congestion

2min
page 34

Siemens, CSIR partner to boost SA 4IR skills

2min
page 38

Strathmore Study Lays Bare Gender Inequality in African AI Industry

2min
page 35

How Quantum Computing Could Propel Us Light Years Into The Future

4min
pages 36-37

NVIDIA unveils its 1st Data Centre CPU

2min
page 22

NVIDIA Inception: Meet the African Startups Accepted Into the Programme

1min
page 33

How The Pandemic Gave Birth to SA’s latest 4IR SaaS platform

10min
pages 20-21

Meet UCT’s 1st Google Research Scholar Program Recipients

4min
pages 18-19

SA Team Places 2nd at 2021 Imagine Cup Junior Virtual AI Hackathon, Girls Edition

3min
page 17

These 3 African Startups Are Using AI, Data Science to Disrupt Fintech

1min
page 12

Why Kenya’s Ajua acquired AI/ ML fintech startup WayaWaya

3min
page 13

All You Need To Know About The EU’s DIGILOGIC initiative

5min
pages 8-9

Lacuna Fund invests $1m in Datasets for Low Resource African Languages

11min
pages 14-15

Human In The System: Understanding Customer Behaviours with ecosystem.Ai

3min
pages 10-11

Hyperautomation: A Case Study

3min
page 7

AfDB provides $1m Grant for AI-based National Consumer Management Systems

1min
page 6
Issuu converts static files into: digital portfolios, online yearbooks, online catalogs, digital photo albums and more. Sign up and create your flipbook.