61 to 63 of 63 Results
Jun 13, 2020 - Statistical Natural Language Processing Group
Beilharz, Benjamin; Sun, Xin, 2019, "LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition", https://doi.org/10.11588/data/TMEDTX, heiDATA, V2
This dataset is a corpus of sentence-aligned triples of German audio, German text, and English translation, based on German audio books. The corpus consists of over 100 hours of audio material and over 50k parallel sentences. The speech data are low in disfluencies because of the... |
Mar 26, 2021 - IWR Computer Graphics
Mara, Hubert, 2019, "HeiCuBeDa Hilprecht - Heidelberg Cuneiform Benchmark Dataset for the Hilprecht Collection", https://doi.org/10.11588/data/IE8CCN, heiDATA, V2
The number of known cuneiform tablets is assumed to be in the hundreds of thousands. A fraction has been published by printing photographs and manual tracings in books, which is collected by the online Cuneiform Digital Library Initiative (CDLI) catalog including some of these im... |
Feb 19, 2024 - SFB 933 Materiale Textkulturen - Teilprojekt C05
Philipp Friedhofen, Ludger Lieb, Michael R. Ott, Laura Velte, 2019, "Erzählte Inschriften in der Literatur des Mittelalters (Projektdatenbank)", https://doi.org/10.11588/data/0HJAJS, heiDATA, V3, UNF:6:zYyA6vs0VkcR2qiIHbGcVw== [fileUNF]
Diese Datenpublikation entstammt dem Teilprojekt C05 (»Inschriftlichkeit. Reflexionen materialer Textkultur in der Literatur des 12. bis 17. Jahrhunderts«) des Sonderforschungsbereichs 933 (»Materiale Textkulturen«, Förderzeitraum: 2011–2023). Im Rahmen des Teilprojekts wurden er... |