1 to 10 of 121 Results
May 21, 2014
The Statistical Natural Language Processing Group is part of the Department of Computational Linguistics. Our research addresses various aspects of the problem of the confusion of languages, by means of statistical learning techniques. Research topics include the following: Stati... |
Jun 16, 2014 - Statistical Natural Language Processing Group
Sokolov, Artem; Jehl Laura; Hieber Felix; Ruppert, Eugen; Riezler, Stefan, 2014, "BoostCLIR: JP-EN Relevance Marked Patent Corpus", https://doi.org/10.11588/data/10001, heiDATA, V1
BoostCLIR is a bilingual (Japanese-English) corpus of patent abstracts, extracted from the MAREC patent data, and the data from the NTCIR PatentMT workshop collections, accompanied with relevance judgements for the task of patent prior-art search. Important: The English side of t... |
Jun 16, 2014 - Statistical Natural Language Processing Group
Wäschle, Katharina; Riezler, Stefan, 2014, "PatTR: Patent Translation Resource", https://doi.org/10.11588/data/10002, heiDATA, V3
PatTR is a sentence-parallel corpus extracted from the MAREC patent collection. The current version contains more than 22 million German-English and 18 million French-English parallel sentences collected from all patent text sections as well as 5 million German-French sentence pa... |
Jun 18, 2014 - Statistical Natural Language Processing Group
Hieber, Felix; Schamoni, Shigehiko; Sokolov, Artem; Riezler, Stefan, 2014, "WikiCLIR: A Cross-Lingual Retrieval Dataset from Wikipedia", https://doi.org/10.11588/data/10003, heiDATA, V1
WikiCLIR is a large-scale (German-English) retrieval data set for Cross-Language Information Retrieval (CLIR). It contains a total of 245,294 German single-sentence queries with 3,200,393 automatically extracted relevance judgments for 1,226,741 English Wikipedia articles as docu... |
Aug 13, 2014 - Database Systems Research Group
Strötgen, Jannik; Gertz, Michael, 2014, "WikiWarsDE Corpus", https://doi.org/10.11588/data/10026, heiDATA, V1
The WikiWarsDE corpus is a German corpus containing Wikipedia articles with annotations of temporal expressions. Its creation was motivated by the English WikiWars corpus (Mazur & Dale 2010). WikiWarsDE was developed to support research on temporal information extraction and norm... |
Aug 13, 2014
Data publications of the database systems research group at Heidelberg University. |
Oct 15, 2014
This Dataverse contains research data of the Institute for Theoretical Physics at Heidelberg University. |
Apr 2, 2015 - IWR Computer Graphics
Krömker, Susanne; Mara, Hubert, 2015, "Seal of the University of Heidelberg - Siegel UAH SG 11", https://doi.org/10.11588/data/10044, heiDATA, V1
Capturing the Heidelberg University's sealings with a 3D-scanner provides new visualizations and virtual restoration possibilities for the partially preserved damaged objects. High-resolution 3D-models are analyzed due to their surface features, which cannot be documented with ph... |
Apr 2, 2015 - IWR Computer Graphics
Krömker, Susanne; Mara, Hubert, 2015, "Seal of the University of Heidelberg - Siegel UAH SG 5", https://doi.org/10.11588/data/10000, heiDATA, V1
Capturing the Heidelberg University's sealings with a 3D-scanner provides new visualizations and virtual restoration possibilities for the partially preserved damaged objects. High-resolution 3D-models are analyzed due to their surface features, which cannot be documented with ph... |
Apr 2, 2015 - IWR Computer Graphics
Krömker, Susanne; Mara, Hubert, 2015, "Seal of the University of Heidelberg - Siegel UAH XII 2 87", https://doi.org/10.11588/data/10045, heiDATA, V1
Capturing the Heidelberg University's sealings with a 3D-scanner provides new visualizations and virtual restoration possibilities for the partially preserved damaged objects. High-resolution 3D-models are analyzed due to their surface features, which cannot be documented with ph... |