Our speech APIs can be used on their own to
create other voice enabled solutions in
Our Speech Tools
Our easy to use APIs allow you to build powerful
speech enabled solutions in African languages.
Text to Speech
The TTS module takes in a text input and converts it to
speech. The speech generated is supposed to be
intelligible and it should sound as natural as possible.
This module is useful in making computer generated
voices. This can be used in dialogue systems like Siri or
voice enabled GPS systems. This module returns an
audio file from a given text input.
English, IsiXhosa, IsiZulu, and Sepedi.
Natural Sounding Voices
Deliver high-quality and natural sounding voices.
Speech to Text
An ASR system is a system that converts speech to text,
or simply, a system that allows machines to “hear”
natural speech. This helps users interact with computer
systems using speech. This module returns a
transcription of a given speech audio file.
Our ASR models have an average
WER of 12%.
This module uses voice recognition to identify and
validate a person. The module matches a speech
phrase to an ID. This can be used in call centres, for
example, to verify the caller’s identity.
Use voice to verify speakers.
Identify individual speakers within a group.