Data publications from the DFG-funded research training group on Adaptive Information Processing from Heterogeneous Sources (AIPHES) at the CS Department at the Technical University of Darmstadt, the Institute for Computational Linguistics at the University of Heidelberg and the NLP Group at HITS.
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

1 to 6 of 6 Results
Jul 4, 2023
Paul, Debjit, 2023, "Source Code, Data and Additional Material for the Thesis: "Social Commonsense Reasoning with Structured Knowledge in Text"", https://doi.org/10.11588/data/C56QUV, heiDATA, V1
Understanding a social situation requires the ability to reason about the underlying emotions and behaviour of others. For example, when we read a personal story, we use our prior commonsense knowledge and social intelligence to infer the emotions, motives, and anticipate the act...
Feb 6, 2019
Heinzerling, Benjamin, 2019, "BPEmb: Pre-trained Subword Embeddings in 275 Languages (LREC 2018)", https://doi.org/10.11588/data/V9CXPR, heiDATA, V1
BPEmb is a collection of pre-trained subword unit embeddings in 275 languages, based on Byte-Pair Encoding (BPE). In an evaluation using fine-grained entity typing as testbed, BPEmb performs competitively, and for some languages better than alternative subword approaches, while r...
Feb 6, 2019
Heinzerling, Benjamin, 2019, "Source Code, Data and Additional Material for the Thesis: "Aspects of Coherence for Entity Analysis"", https://doi.org/10.11588/data/9JKAVW, heiDATA, V1
This dataset contains source code and system output used in the PhD thesis "Aspects of Coherence for Entity Analysis". This dataset is split into three parts corresponding to the chapters describing the three main contributions of the thesis: chapter3.tar.gz: Java source code for...
Feb 4, 2019
Marasovic, Ana, 2019, "Abstract Anaphora Resolution [Source Code]", https://doi.org/10.11588/data/UDMPY5, heiDATA, V1
Abstract Anaphora Resolution (AAR) aims to find the interpretation of nominal expressions (e.g., this result, those two actions) and pronominal expressions (e.g., this, that, it) that refer to abstract-object-antecedents such as facts, events, plans, actions, or situations. The f...
Feb 4, 2019
Marasovic, Ana, 2019, "SRL4ORL: Improving Opinion Role Labeling Using Multi-Task Learning With Semantic Role Labeling [Source Code]", https://doi.org/10.11588/data/LWN9XE, heiDATA, V1
This repository contains code for reproducing experiments done in Marasovic and Frank (2018). Paper abstract: For over a decade, machine learning has been used to extract opinion-holder-target structures from text to answer the question "Who expressed what kind of sentiment towar...
Jan 31, 2019
Heinzerling, Benjamin, 2019, "Selectional Preference Embeddings (EMNLP 2017)", https://doi.org/10.11588/data/FJQ4XL, heiDATA, V1
Joint embeddings of selectional preferences, words, and fine-grained entity types. The vocabulary consists of: verbs and their dependency relation separated by "@", e.g. "sink@nsubj" or "elect@dobj" words and short noun phrases, e.g. "Titanic" fine-grained entity types using the...
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.