HomeRepositoriesModelsDocsBlogPricingLoginSign up
Repositories
Models
Blog
Community
Pricing
Search
nurul-oxen/
CoVoST-Delta-Segment-12
DataBranchesEvaluationsFine-tune
CoVoST-Delta-Segment-12
public
1.4 gb
441K
CoVoST-Delta-Segment-12
/
1 branch
Loading...
About

CoVoST 2 is a large-scale multilingual speech translation corpus covering translations from 21 languages into English and from English into 15 languages. The dataset is created using Mozillas open-source Common Voice database of crowdsourced voice recordings.

4 commits
1 contributor
0 downloads
1.4 gb
0 stars
Repository contents
41K audio files > 99%
4 text files < 1%
Contributors
@nurul-oxen
Copyright © 2026 Oxen Labs, Inc., All Rights Reserved
CareersPrivacy PolicyTerms and Conditions
Loading...