Logo
find AI list
TasksToolsCompareWorkflows
Submit ToolSubmit
Log in
Logo
find AI list

Search by task, compare top tools, and use proven workflows to choose the right AI tool faster.

Platform

  • Tasks
  • Tools
  • Compare
  • Alternatives
  • Workflows
  • Reports
  • Best Tools by Persona
  • Best Tools by Role
  • Stacks
  • Models
  • Agents
  • AI News

Company

  • About
  • Blog
  • FAQ
  • Contact
  • Editorial Policy
  • Privacy
  • Terms

Contribute

  • Submit Tool
  • Manage Tool
  • Request Tool

Stay Updated

Get new tools, workflows, and AI updates in your inbox.

© 2026 findAIList. All rights reserved.

Privacy PolicyTerms of ServiceEditorial PolicyRefund Policy
Home/Tasks/VCTK Dataset
VCTK Dataset logo

VCTK Dataset

Visit Website

Quick Tool Decision

Should you use VCTK Dataset?

The VCTK Corpus provides diverse English speech data from 110 speakers, ideal for voice cloning and speech synthesis research.

Category

Data & ML

Data confidence: release and verification fields are source-audited when available; other summary fields are community-aggregated.

Visit Tool WebsiteOpen Detailed Profile
OverviewFAQPricingAlternativesReviews

Overview

The VCTK Corpus, also known as the CSTR VCTK Corpus, is a collection of speech data from 110 English speakers with varied accents. Each speaker recorded approximately 400 sentences, sourced from newspapers, a rainbow passage, and an elicitation paragraph. This diverse dataset is designed to support research in text-to-speech synthesis, particularly speaker-adaptive methods and neural waveform modeling. The recordings, captured using high-quality microphones in a hemi-anechoic chamber, are processed to 16 bits and downsampled to 48 kHz. The corpus includes transcript files for most speakers, facilitating alignment and training. It is particularly useful for training HMM-based and DNN-based speech synthesis systems, offering a comprehensive resource for advancing voice cloning and speech technology research and development. It was referenced by Google DeepMind in their work on WaveNet.

Common tasks

Training speech synthesis modelsDeveloping voice cloning systemsResearching speaker adaptation techniquesEvaluating text-to-speech algorithmsCreating multi-speaker speech datasetsAnalyzing regional accents in speechExperimenting with neural waveform modeling

FAQ

View all

Full FAQ is available in the detailed profile.

FAQ+-

Full FAQ is available in the detailed profile.

View all

Pricing

View pricing

Pricing varies

Plan-level pricing details are still being validated for this tool.

Pros & Cons

Pros/cons are still being audited for this tool.

Reviews & Ratings

Share your experience, and users can reply directly under each review.

Reviews load as you scroll.
Need advanced specs, integrations, implementation notes, and deeper comparisons? Open the Detailed Profile.

Pricing varies

Model not listed

ReviewsVisit