Sourcify
Effortlessly find and manage open-source dependencies for your projects.

A large-scale multilingual speech-to-text translation corpus.

CoVoST (Conversational Voice-to-Speech Translation) is a large-scale, multilingual speech-to-text translation corpus developed by Facebook Research. It addresses the lack of parallel data for end-to-end speech translation (ST) model training. Built upon the Common Voice dataset, CoVoST includes translations from English into 15 languages and from 21 languages into English. The corpus comprises approximately 2,880 hours of speech data from 78,000 speakers. It is designed to foster ST research by providing a diversified, openly licensed dataset. CoVoST facilitates the training of end-to-end ST models, which offer system simplicity, lower inference latency, and reduced compounding errors compared to cascaded ST systems. Data splitting scripts and Fairseq S2T examples are provided to facilitate model training.
CoVoST (Conversational Voice-to-Speech Translation) is a large-scale, multilingual speech-to-text translation corpus developed by Facebook Research.
Explore all tools that specialize in end-to-end model training. This domain focus ensures CoVoST delivers optimized results for this specific requirement.
CoVoST provides a substantial amount of data, covering multiple languages and translation directions, which allows for training robust and generalizable ST models.
Supports direct training of speech-to-text translation models, eliminating the need for intermediate ASR and MT components.
Provides scripts to generate train, development, and test splits from the corpus, ensuring consistent evaluation methodologies.
Leverages the Common Voice dataset, providing a large, diverse, and publicly available source of speech data.
Includes an out-of-domain evaluation set from Tatoeba, allowing for assessment of model performance in real-world scenarios.
Download Common Voice audio clips and transcripts.
Download CoVoST translations.
Generate data splits using the provided script (get_covost_splits.py).
Specify the version, source language, target language, root path, and Common Voice TSV path.
Obtain train, development, and test TSV files.
All Set
Ready to go
Verified feedback from other users.
"CoVoST is highly regarded for its comprehensive multilingual coverage and free availability, enhancing speech-to-text translation research."
Post questions, share tips, and help other users.
Effortlessly find and manage open-source dependencies for your projects.

End-to-end typesafe APIs made easy.

Page speed monitoring with Lighthouse, focusing on user experience metrics and data visualization.

Topcoder is a pioneer in crowdsourcing, connecting businesses with a global talent network to solve technical challenges.

Explore millions of Discord Bots and Discord Apps.

Build internal tools 10x faster with an open-source low-code platform.

Open-source RAG evaluation tool for assessing accuracy, context quality, and latency of RAG systems.

AI-powered synthetic data generation for software and AI development, ensuring compliance and accelerating engineering velocity.