Africa Next Voices

Yoruba speech dataset

Part of the African Voices multilingual speech initiative (Nigerian languages). Audio with matching transcriptions is distributed through the African Voices download hub under that project's terms and release schedule. Way With Words contributed to this strand of ANV alongside other partners. This entry points to the Yoruba configuration on the African Voices download hub. The public site advertises about 1900 hours of audio across Hausa, Igbo, Nigerian Pidgin, and Yorùbá; the hours figure here scales that total by this language’s share of published Hugging Face export rows (`9jalingo-*` splits). Splits and attribution evolve—confirm on the hub before benchmarking.

Looking for more options? Browse the full African speech datasets catalog or see our community-centric data licensing framework.

Key details

Hours available
361
Speakers
0
Access
Varies by release
Audio format
WAV (per release)
Accents
Nigerian Yoruba
African Voices download

Dataset details

Hours available

361

Age range

Varies

Download size

Varies by release

Number of speakers

0

Audio format

WAV (per release)

Accents

Nigerian Yoruba

Additional information

African Voices (ANV)

This language configuration is part of the African Voices multilingual speech initiative, with downloads and documentation hosted on africanvoices.io. Releases are under active development; always use the latest version and follow the hub's citation and licensing terms for attribution and benchmarking.

Need a different dataset?

We can design and deliver bespoke speech collections for your languages, domains, and scale. Tell us what you need and we'll get back within 1–2 business days.