Off-the-shelf datasets for sale

African speech datasets for AI

Fifty-hour conversational speech collections, ready for benchmarking and model training. Each dataset is planned, collected, and annotated with NLP best practice in mind.

Need something different? Talk to us about custom datasets.

Low‑risk evaluation Enterprise licensing Production‑ready delivery Hard‑to‑source languages
Free & open

Africa Next Voices (Swivuriso)

Large-scale multilingual speech dataset for 7 South African languages—over 3,000 hours in total. Built for ASR research and inclusive technologies. Available free on Hugging Face (CC BY 4.0). Way With Words produced the South African component with DSFSI.

isiZulu Free
South Africa Scripted & unscriptedASRMultilingual
503h Hugging Face

Over 500 hours of isiZulu speech from the Swivuriso dataset—scripted and unscripted, first-language speakers—for ASR and inclusive speech technology.

Updated: November 2025 View more
isiXhosa Free
South Africa Scripted & unscriptedASRMultilingual
504h Hugging Face

Over 500 hours of isiXhosa speech from Swivuriso—scripted and unscripted, first-language speakers—for ASR and inclusive speech technology.

Updated: November 2025 View more
Sesotho Free
South Africa Scripted & unscriptedASRMultilingual
504h Hugging Face

Over 500 hours of Sesotho speech from Swivuriso—scripted and unscripted, first-language speakers—for ASR and inclusive speech technology.

Updated: November 2025 View more
Setswana Free
South Africa Scripted & unscriptedASRMultilingual
502h Hugging Face

Over 500 hours of Setswana speech from Swivuriso—scripted and unscripted, first-language speakers—for ASR and inclusive speech technology.

Updated: November 2025 View more
Xitsonga Free
South Africa Scripted & unscriptedASRMultilingual
500h Hugging Face

Over 500 hours of Xitsonga speech from Swivuriso—scripted and unscripted, first-language speakers—for ASR and inclusive speech technology.

Updated: November 2025 View more
Tshivenda Free
South Africa Scripted & unscriptedASRMultilingual
251h Hugging Face

Over 250 hours of Tshivenda speech from Swivuriso—scripted and unscripted, first-language speakers—for ASR and inclusive speech technology.

Updated: November 2025 View more
isiNdebele Free
South Africa Scripted & unscriptedASRMultilingual
252h Hugging Face

Over 250 hours of isiNdebele speech from Swivuriso—scripted and unscripted, first-language speakers—for ASR and inclusive speech technology.

Updated: November 2025 View more

Get the full dataset on Hugging Face — accept the use conditions to access. Not for TTS, voice cloning, or voice synthesis.

Need a custom collection or different languages?

Did you know we started out collecting UK, Australian, Irish and Scottish English data for major data providers?

We can do the same for you, in any language or domain. Just ask.

Talk to us about custom datasets