Metrics
223,007 Downloads
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

There was an error with your search parameters. Please clear your search and try again.

1 to 10 of 18 Results
Apr 24, 2024 - AIPHES
Mihaylov, Todor, 2024, "Knowledge-Enhanced Neural Networks for Machine Reading Comprehension [Source Code and Additional Material]", https://doi.org/10.11588/data/HU3ARF, heiDATA, V1
Machine Reading Comprehension is a language understanding task where a system is expected to read a given passage of text and typically answer questions about it. When humans assess the task of reading comprehension, in addition to the presented text, they usually use the knowled...
RATIO_EXPLAIN(Heidelberg University, Department of Computational Linguistics)
Feb 26, 2024
Open Research Data from the ExpLAIN project, a joint research project of the NLP Group at the Computational Linguistics Department of Heidelberg University and the Data and Web Science Groupat University of Mannheim.
Natural Language Processing Group(Universität Heidelberg)
Jan 17, 2024
The main purpose of language is to encode and communicate information of all sorts. Our research focuses on semantics — the study of meaning — and how a machine can assign meaning to utterances: words, sentences and texts, as humans can do. Our work is linguistically informed and...
HITS MBM(Heidelberg Institute for Theoretical Studies)
Nov 8, 2022
Data publications from the the Molecular Biomechanics (MBM) group at HITS. Shareholders of HITS are the “HITS-Stiftung”, Heidelberg University and the Karlsruhe Institute of Technology (KIT)
Jun 13, 2020 - Statistical Natural Language Processing Group
Beilharz, Benjamin; Sun, Xin, 2019, "LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition", https://doi.org/10.11588/data/TMEDTX, heiDATA, V2
This dataset is a corpus of sentence-aligned triples of German audio, German text, and English translation, based on German audio books. The corpus consists of over 100 hours of audio material and over 50k parallel sentences. The speech data are low in disfluencies because of the...
Jan 15, 2020 - 3D Spatial Data Processing
Herfort, Benjamin; Anders, Katharina; Marx, Sabrina; Eberlein, Stefan; Höfle, Bernhard, 2020, "3D Micro-Mapping of Subsidence Stations [Source Code and Data]", https://doi.org/10.11588/data/OU8YA1, heiDATA, V1
This dataset comprises the source code to reproduce the 3D micro-mapping tool for plane adjustment at subsidence stations. In this project, users adjust a plane (height and orientation) at the positions of fixed poles, so-called subsidence stations, to provide information on the...
Sep 2, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)
Wiegand, Michael, 2019, "Opinion role extractor", https://doi.org/10.11588/data/3W7AQP, heiDATA, V1
System for the Extraction of Subjective Expressions, Sentiment Sources and Sentiment Targets from German Text
Feb 6, 2019 - AIPHES
Heinzerling, Benjamin, 2019, "BPEmb: Pre-trained Subword Embeddings in 275 Languages (LREC 2018)", https://doi.org/10.11588/data/V9CXPR, heiDATA, V1
BPEmb is a collection of pre-trained subword unit embeddings in 275 languages, based on Byte-Pair Encoding (BPE). In an evaluation using fine-grained entity typing as testbed, BPEmb performs competitively, and for some languages better than alternative subword approaches, while r...
Feb 6, 2019 - AIPHES
Heinzerling, Benjamin, 2019, "Source Code, Data and Additional Material for the Thesis: "Aspects of Coherence for Entity Analysis"", https://doi.org/10.11588/data/9JKAVW, heiDATA, V1
This dataset contains source code and system output used in the PhD thesis "Aspects of Coherence for Entity Analysis". This dataset is split into three parts corresponding to the chapters describing the three main contributions of the thesis: chapter3.tar.gz: Java source code for...
Feb 4, 2019 - AIPHES
Marasovic, Ana, 2019, "Abstract Anaphora Resolution [Source Code]", https://doi.org/10.11588/data/UDMPY5, heiDATA, V1
Abstract Anaphora Resolution (AAR) aims to find the interpretation of nominal expressions (e.g., this result, those two actions) and pronominal expressions (e.g., this, that, it) that refer to abstract-object-antecedents such as facts, events, plans, actions, or situations. The f...
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.