Luel Logo
Luel Logo
HomeFor ContributorsFor Enterprises
Datasets/Speech
Speech Datasets

Speech datasets for AI training

High-quality speech datasets for ASR, TTS, and voice AI training.

Browse categories

Show allSpeechSensorVideo

23 results in Speech

Tip: use search + filters to narrow down quickly.

Request samples or a custom build
SpeechCustom

Japanese Conversational Speech

Multi-speaker Japanese dialogue with stereo speaker separation and emotion annotations

Languages1

Languages

Japanese
  • Stereo speaker separation: L/R channel isolation for perfect speaker extraction
  • High-density emotion annotations per utterance
  • Per-utterance emotion labels with confidence scores
  • +4 more highlights
Explore dataset
SpeechEnterprise

English Conversational Speech

Stereo multi-speaker dialogue recordings with L/R speaker separation and emotion annotations

Languages1

Languages

English
  • Stereo speaker separation: L/R channel isolation for perfect speaker extraction
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
  • +4 more highlights
Explore dataset
SpeechEnterprise

French Conversational Speech

Stereo multi-speaker French dialogue recordings with L/R speaker separation and emotion annotations

Languages1

Languages

French
  • Stereo speaker separation: L/R channel isolation for perfect speaker extraction
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories including Anger, Doubt, Excitement, Determination
  • +4 more highlights
Explore dataset
SpeechEnterprise

German Conversational Speech

Stereo multi-speaker German dialogue recordings with L/R speaker separation and emotion annotations

Languages1

Languages

German
  • Stereo speaker separation: L/R channel isolation for perfect speaker extraction
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
  • +4 more highlights
Explore dataset
SpeechEnterprise

English Monologue Speech

Professional single-speaker recordings with word-level timestamps and emotion annotations

Languages1

Languages

English
  • Word-level timestamps for each utterance
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
  • +4 more highlights
Explore dataset
SpeechEnterprise

French Monologue Speech

Professional single-speaker French recordings with word-level timestamps and emotion annotations

Languages1

Languages

French
  • Word-level timestamps for precise alignment
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
  • +4 more highlights
Explore dataset
SpeechEnterprise

German Monologue Speech

Professional single-speaker German recordings with word-level timestamps and emotion annotations

Languages1

Languages

German
  • Word-level timestamps for precise alignment
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
  • +4 more highlights
Explore dataset
SpeechEnterprise

Japanese Monologue Speech

Professional single-speaker Japanese recordings with word-level timestamps and emotion annotations

Languages1

Languages

Japanese
  • Word-level timestamps for precise alignment
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Interest, Confusion, Amusement, Calmness, and more
  • +4 more highlights
Explore dataset
SpeechCustom

Doctor-Patient Consultation

Clinical consultation dialogues between doctors and patients

Languages2

Languages

EnglishUrdu
  • Fully transcribed clinical dialogues
  • Diverse hospital settings: surgeons, endocrinologists, cardiologists, neurologists, etc.
  • Realistic clinical dialogue patterns
  • +2 more highlights
Explore dataset
SpeechCustom

Telugu Expressive TTS Voice

Natural Telugu speech recordings from native speakers across major regions

Languages1

Languages

Telugu
  • Fully transcribed with phoneme-level alignment
  • Native Telugu speakers across major regions
  • Comprehensive emotion and style coverage
  • +2 more highlights
Explore dataset
SpeechCustom

Spanish Finance Conversation

Customer service conversations in finance & banking contexts

Languages1Clips9,000+

Languages

Spanish
  • Dual-channel recording with clear speaker separation
  • Fully transcribed with speaker diarization
  • Multiple conversation types and scenarios
  • +2 more highlights
Explore dataset
SpeechCustom

Nighttime Traffic Audio Narrations

Urban audio narrations with ambient noise profiling

Languages1

Languages

English
  • Fully transcribed narrations
  • Real-world urban noise environments
  • Diverse noise profiles and locations
  • +2 more highlights
Explore dataset
SpeechCustom

Spanish-English Contact Center ASR

Bilingual Spanish-English contact center conversations

Languages2

Languages

SpanishEnglish
  • Fully transcribed bilingual conversations
  • Bilingual Spanish-English conversations
  • Dual-channel recordings with speaker separation
  • +2 more highlights
Explore dataset
SpeechCustom

Hindi Monologue Speech

Professional single-speaker Hindi recordings with word-level timestamps and emotion annotations

Languages1

Languages

Hindi
  • Word-level timestamps for precise alignment
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
  • +4 more highlights
Explore dataset
SpeechCustom

Hindi Conversational Speech

Stereo multi-speaker Hindi dialogue recordings with L/R speaker separation and emotion annotations

Languages1

Languages

Hindi
  • Stereo speaker separation: L/R channel isolation for perfect speaker extraction
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories for comprehensive sentiment coverage
  • +4 more highlights
Explore dataset
SpeechCustom

Tamil Monologue Speech

Professional single-speaker Tamil recordings with word-level timestamps and emotion annotations

Languages1

Languages

Tamil
  • Word-level timestamps for precise alignment
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
  • +4 more highlights
Explore dataset
SpeechCustom

Tamil Conversational Speech

Stereo multi-speaker Tamil dialogue recordings with L/R speaker separation and emotion annotations

Languages1

Languages

Tamil
  • Stereo speaker separation: L/R channel isolation for perfect speaker extraction
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories for comprehensive sentiment coverage
  • +4 more highlights
Explore dataset
SpeechCustom

Marathi Monologue Speech

Professional single-speaker Marathi recordings with word-level timestamps and emotion annotations

Languages1

Languages

Marathi
  • Word-level timestamps for precise alignment
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories: Joy, Determination, Interest, Calmness, Confusion, and more
  • +4 more highlights
Explore dataset
SpeechCustom

Marathi Conversational Speech

Stereo multi-speaker Marathi dialogue recordings with L/R speaker separation and emotion annotations

Languages1

Languages

Marathi
  • Stereo speaker separation: L/R channel isolation for perfect speaker extraction
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories for comprehensive sentiment coverage
  • +4 more highlights
Explore dataset
SpeechCustom

Spanish Conversational Speech

Stereo multi-speaker Spanish dialogue recordings with L/R speaker separation and emotion annotations

Languages1

Languages

Spanish
  • Stereo speaker separation: L/R channel isolation for perfect speaker extraction
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories for comprehensive sentiment coverage
  • +4 more highlights
Explore dataset
SpeechCustom

Telugu Conversational Speech

Stereo multi-speaker Telugu dialogue recordings with speaker diarization and emotion annotations

Languages1

Languages

Telugu
  • Stereo speaker separation: L/R channel isolation for perfect speaker extraction
  • Per-utterance emotion labels with confidence scores
  • 18 emotion categories for comprehensive sentiment coverage
  • +4 more highlights
Explore dataset
SpeechCustom

Spanish Customer Support Conversations

Stereo role-play customer service dialogues in Spanish with L/R speaker separation

Languages1

Languages

Spanish
  • Stereo speaker separation: agent on one channel, customer on the other
  • Structured customer support scenarios with intent labels
  • Hotel cancellation, reservation changes, billing, and complaint dialogues
  • +4 more highlights
Explore dataset
SpeechEnterprise

Chinese Mandarin Speech

Professional Mandarin Chinese recordings for ASR, TTS, and voice AI training

Languages1

Languages

Chinese Mandarin
  • Native Mandarin speakers with regional accent diversity
  • Dual recording conditions: studio-quality and natural ambient
  • Word-level timestamps for precise alignment
  • +4 more highlights
Explore dataset
MarketplaceOpen to All

Your Dataset Here

License your multimodal dataset through the Luel catalog for global visibility and enterprise-grade trust.

  • Rights-cleared licensing with full legal protection
  • Enterprise visibility to verified AI buyers worldwide
  • Luel quality badge & fidelity verification included
List your dataset

Need a custom
speech dataset?

We can build custom collections tailored to your specific requirements, languages, and use cases.

Talk with our team
Luel Logo

The leading AI training data marketplace. Connect companies with contributors to create high-quality datasets.

For Contributors

Start ContributingUpload ContentView EarningsMy Submissions

For Enterprise

View CatalogBrowse DatasetsRequest a Custom DatasetUpload Your Dataset

Company

CommunitySupportPrivacy PolicyTerms of Service

© 2026 Luel. All rights reserved.

Built for the future of AI training data