---
title: "Somali Speech Dataset – Kenyan ANV ASR Data (Free) | Way With Words"
description: "Somali speech from the Kenyan African Next Voices collection on Hugging Face—complements other Somali ASR resources."
image: "https://waywithwords.ai/og-default.png"
---

Africa Next Voices

# Somali speech dataset

African Next Voices: Data collection in Kenya (KenCorpus Consortium, Gates Foundation). Scripted and unscripted speech across multiple domains, collected through ethical, community-led processes. CC BY 4.0. See the Hugging Face organization for the latest splits and attribution. This configuration covers Somali as collected in Kenya (distinct from other ANV geography tracks). Check the dataset card for hours, transcription coverage, and license.

Looking for more options? Browse the [full African speech datasets catalog](/datasets) or see our [community-centric data licensing framework](/esethu).

## Key details

Hours available

502

Speakers

0

Access

Hugging Face

Audio format

WAV (per dataset card)

Accents

Kenyan Somali (Maxatire)

[Get dataset on Hugging Face →](https://huggingface.co/datasets/Anv-ke/Somali)

## Dataset details

Hours available

502

Age range

18 - 60+

Download size

Hugging Face

Number of speakers

0

Audio format

WAV (per dataset card)

Accents

Kenyan Somali (Maxatire)

## Additional information

### African Next Voices — Kenya

This listing points to African Next Voices in Kenya (KenCorpus Consortium, Gates Foundation): scripted and unscripted speech collected through community-led processes, with per-language dataset repos under the [Anv-ke](https://huggingface.co/Anv-ke) organization on Hugging Face. The public cards describe domains, splits, transcription coverage, and ethical use; treat releases as work in progress and follow CC BY 4.0 attribution on the dataset card.

 

## More languages & resources

Open the Hugging Face dataset card for this language for loading instructions, columns, and the latest statistics. The [Anv-ke](https://huggingface.co/Anv-ke) organization lists sibling repos (Dholuo, Kikuyu, Somali, Kalenjin, Maasai). Use only as permitted on the card (research and ASR-related development; no surveillance or unethical profiling).

[Open on Hugging Face →](https://huggingface.co/datasets/Anv-ke/Somali) [Back to all datasets](/datasets)

```json
{"@context":"https://schema.org","@type":"Organization","name":"Way With Words AI","url":"https://waywithwords.ai","email":"hello@waywithwords.ai","contactPoint":[{"@type":"ContactPoint","contactType":"customer support","telephone":"+44 208 157 9929","email":"hello@waywithwords.ai","areaServed":"GB","availableLanguage":"en"},{"@type":"ContactPoint","contactType":"customer support","telephone":"+27 21 879 3552","email":"hello@waywithwords.ai","areaServed":"ZA","availableLanguage":"en"}],"location":[{"@type":"Place","name":"Way With Words Limited (UK Office)","address":{"@type":"PostalAddress","streetAddress":"Caledonian House Business Centre, 164 High Street","addressLocality":"Elgin","postalCode":"IV30 1BD","addressCountry":"GB"}},{"@type":"Place","name":"Way With Words SA (Pty) Ltd (South Africa & SADC Office)","address":{"@type":"PostalAddress","streetAddress":"First Floor, Vineyards Square North, The Vineyards Office Estate, 99 Jip de Jager Drive, Bellville","addressLocality":"Cape Town","postalCode":"7530","addressCountry":"ZA"}}]}
{"@context":"https://schema.org","@type":"Dataset","name":"Kenyan ANV Somali Speech Dataset","description":"Somali speech from the Kenyan African Next Voices collection on Hugging Face—complements other Somali ASR resources.","url":"https://waywithwords.ai/datasets/anv-ke-somali","license":"CC BY 4.0","creator":{"@type":"Organization","name":"Way With Words"},"keywords":["Somali speech dataset","Kenyan ASR data","African Next Voices"]}
{"@context":"https://schema.org","@type":"BreadcrumbList","itemListElement":[{"@type":"ListItem","position":1,"name":"Home","item":"https://waywithwords.ai/"},{"@type":"ListItem","position":2,"name":"Datasets","item":"https://waywithwords.ai/datasets"},{"@type":"ListItem","position":3,"name":"Somali speech dataset","item":"https://waywithwords.ai/datasets/anv-ke-somali"}]}
```