1 to 4 of 4 Results
Jun 13, 2020
Beilharz, Benjamin; Sun, Xin, 2019, "LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition", https://doi.org/10.11588/data/TMEDTX, heiDATA, V2
This dataset is a corpus of sentence-aligned triples of German audio, German text, and English translation, based on German audio books. The corpus consists of over 100 hours of audio material and over 50k parallel sentences. The speech data are low in disfluencies because of the... |
Jun 13, 2020 -
LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition
Gzip Archive - 20.3 GB -
MD5: 9fb23ee878584f4cab717e348cdeeaaf
|
Jun 13, 2020 -
LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition
Gzip Archive - 17.2 GB -
MD5: daf33d0f1242bad5a623b061fbaa426d
|
Oct 21, 2019 -
LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition
Markdown Text - 4.3 KB -
MD5: 971f62ef7dc31254dfc0e25f14347bc1
|