English speech dataset
High-quality South African English conversational speech data designed for ASR training, benchmarking, and multilingual AI development. This dataset focuses on real-world contact-centre style interactions, balanced speaker demographics, and diverse regional accents to support production-ready speech recognition systems.
Key details
- Hours available
- 50 hours
- Speakers
- 63
- Download size
- 38GB
- Audio format
- WAV
- Accents
- South African English
Audio demo
Dataset details
Hours available
50 hours
Age range
18 – 69
Download size
38GB
Number of speakers
63
Audio format
WAV
Accents
South African English
Dataset demographics
Age range distribution
Recorders per age group
- [18 – 29] 27 Recorders
- [30 – 40] 28 Recorders
- [50 – 69] 8 Recorders
Gender split across recorded hours
Recorders per gender
- Women 28 Recorders
- Men 35 Recorders
Hours collected across domains
Runtime per domain
- Retail 12:27:41
- Debt Collection 12:22:25
- Insurance 12:21:50
- Travel 12:50:26
Additional information
How are dataset recordings structured?
Our off-the-shelf dataset collections comprise unscripted, natural conversations conducted by call recorders recruited, trained, and approved to simulate real-world conversations in common domains. Recordings and transcripts include routine security verifications such as ID, email, and phone number validation.
How do you recruit for speech collection datasets?
Our priority is to create datasets that are unbiased and cover as wide a range of demographics as possible. That is the first consideration when we begin the planning and recruitment process of any speech collection dataset project.
What kind of agreement is in place for the purchase of this dataset?
A Licence Agreement governs the sale and usage of this speech collection dataset. Our off-the-shelf options are available for clients to test and benchmark before larger, custom commitments can be considered that are better suited to client requirements and conventions.
Need a different dataset?
We can design and deliver bespoke speech collections for your languages, domains, and scale. Tell us what you need and we'll get back within 1–2 business days.