Speech dataset

English speech dataset

High-quality South African English conversational speech data designed for ASR training, benchmarking, and multilingual AI development. This dataset focuses on real-world contact-centre style interactions, balanced speaker demographics, and diverse regional accents to support production-ready speech recognition systems.

Key details

Hours available: 50 hours
Speakers: 63
Download size: 38GB
Audio format: WAV
Accents: South African English

Audio demo

Dataset details

Hours available

50 hours

Age range

18 – 69

Download size

38GB

Number of speakers

Audio format

WAV

Accents

South African English

Dataset demographics

Age range distribution

Recorders per age group

[18 – 29] 27 Recorders
[30 – 40] 28 Recorders
[50 – 69] 8 Recorders

Gender split across recorded hours

Recorders per gender

Women 28 Recorders
Men 35 Recorders

Hours collected across domains

Runtime per domain

Retail 12:27:41
Debt Collection 12:22:25
Insurance 12:21:50
Travel 12:50:26

Additional information

How are dataset recordings structured?

Our off-the-shelf dataset collections comprise unscripted, natural conversations conducted by call recorders recruited, trained, and approved to simulate real-world conversations in common domains. Recordings and transcripts include routine security verifications such as ID, email, and phone number validation.

How do you recruit for speech collection datasets?

Our priority is to create datasets that are unbiased and cover as wide a range of demographics as possible. That is the first consideration when we begin the planning and recruitment process of any speech collection dataset project.

What kind of agreement is in place for the purchase of this dataset?

A Licence Agreement governs the sale and usage of this speech collection dataset. Our off-the-shelf options are available for clients to test and benchmark before larger, custom commitments can be considered that are better suited to client requirements and conventions.

Need a different dataset?

We can design and deliver bespoke speech collections for your languages, domains, and scale. Tell us what you need and we'll get back within 1–2 business days.

Talk to us about custom datasets Back to all datasets