ML Data Ocean has accumulated 10,000 hours of voice data in 50+ languages and dialects, 3.5 million pieces of image and video data containing 100,000 people, and 4.5 TB of text data.

Speech Recognition Datasets

Language

Polish accent English

Population

Poland

Collection Environment

High-fidelity recording (silent environment)

Collection Diversity

ASR

Collection Device

Format

.wav

Application Scenarios