Statistical Natural Language Processing Group

Publication Year: 2019

1 to 4 of 4 Results

LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition Jun 13, 2020 Beilharz, Benjamin; Sun, Xin, 2019, "LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition", https://doi.org/10.11588/data/TMEDTX, heiDATA, V2 This dataset is a corpus of sentence-aligned triples of German audio, German text, and English translation, based on German audio books. The corpus consists of over 100 hours of audio material and over 50k parallel sentences. The speech data are low in disfluencies because of the...
librivoxdeen-1.01_part1.tar.gz Jun 13, 2020 - LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition Gzip Archive - 20.3 GB - MD5: 9fb23ee878584f4cab717e348cdeeaaf Data
librivoxdeen-1.01_part2.tar.gz Jun 13, 2020 - LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition Gzip Archive - 17.2 GB - MD5: daf33d0f1242bad5a623b061fbaa426d Data
README.md Oct 21, 2019 - LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition Markdown Text - 4.3 KB - MD5: 971f62ef7dc31254dfc0e25f14347bc1 Documentation

LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition

Jun 13, 2020

Beilharz, Benjamin; Sun, Xin, 2019, "LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition", https://doi.org/10.11588/data/TMEDTX, heiDATA, V2

This dataset is a corpus of sentence-aligned triples of German audio, German text, and English translation, based on German audio books. The corpus consists of over 100 hours of audio material and over 50k parallel sentences. The speech data are low in disfluencies because of the...

librivoxdeen-1.01_part1.tar.gz

Jun 13, 2020 - LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition

Gzip Archive - 20.3 GB -

Data

librivoxdeen-1.01_part2.tar.gz

Jun 13, 2020 - LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition

Gzip Archive - 17.2 GB -

Data

README.md

Oct 21, 2019 - LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition

Markdown Text - 4.3 KB -

Documentation

Add Data