top of page

ML Data Ocean has accumulated 10,000 hours of voice data in 50+ languages and dialects, 3.5 million pieces of image and video data containing 100,000 people, and 4.5 TB of text data.

Speech Recognition Datasets

Language

Polish accent English

Population

Poland

Collection Environment

High-fidelity recording (silent environment)

Collection Diversity

ASR

Collection Device

​

Format

.wav

Application Scenarios

​

Back
bottom of page