Metrics
193,550 Downloads
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

81 to 90 of 93 Results
Jul 4, 2023 - AIPHES
Paul, Debjit, 2023, "Source Code, Data and Additional Material for the Thesis: "Social Commonsense Reasoning with Structured Knowledge in Text"", https://doi.org/10.11588/data/C56QUV, heiDATA, V1
Understanding a social situation requires the ability to reason about the underlying emotions and behaviour of others. For example, when we read a personal story, we use our prior commonsense knowledge and social intelligence to infer the emotions, motives, and anticipate the act...
Feb 4, 2019 - AIPHES
Marasovic, Ana, 2019, "SRL4ORL: Improving Opinion Role Labeling Using Multi-Task Learning With Semantic Role Labeling [Source Code]", https://doi.org/10.11588/data/LWN9XE, heiDATA, V1
This repository contains code for reproducing experiments done in Marasovic and Frank (2018). Paper abstract: For over a decade, machine learning has been used to extract opinion-holder-target structures from text to answer the question "Who expressed what kind of sentiment towar...
Jul 14, 2022 - Computer Assisted Clinical Medicine
Zöllner, Frank, 2022, "Synthesis of CT images from digital body phantoms using CycleGAN [dataset]", https://doi.org/10.11588/data/7NRFYC, heiDATA, V1
The potential of medical image analysis with neural networks is limited by the restricted availability of extensive data sets. The incorporation of synthetic training data is one approach to bypass this shortcoming, as synthetic data offer accurate annotations and unlimited data...
Jul 25, 2022 - Scientific Software Center (SSC)
Uieda, Leonardo, 2022, "Test data for the Pooch library", https://doi.org/10.11588/data/TKCFEF, heiDATA, V1
Pooch is an open-source Python library for data download. This archive contains testing data for Pooch's DataVerse download functionality.
Nov 2, 2016 - Perspektive Bibliothek
Drees, Bastian, 2016, "Text und Data Mining an wissenschaftlichen Repositorien und Publikationsservern in Deutschland - Zusammenfassung der Ergebnisse einer Umfrage im Februar und März 2016", https://doi.org/10.11588/data/10090, heiDATA, V2
Es wurden die auf den Homepages angegebenen Ansprechpartner wissenschaftlicher Repositorien und Publikationsserver in Deutschland zu ihren Erfahrungen mit Text und Data Mining befragt. Die Befragung fand zwischen dem 22. und 26.2.2016 per E-Mail statt. Es wurden Ansprechpartner v...
Oct 7, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)
Marasović, Ana; Zhou, Mengfei; Frank, Anette, 2019, "The MSC Data Set", https://doi.org/10.11588/data/JEESIQ, heiDATA, V1
From this page you can download resources we created for modal sense classification as reported in Zhou et al. (2015), Marasović et al. (2016) and Marasović and Frank (2015) (see "Related Publication" below): Heuristically sense-annotated training data acquired from EUROPARL and...
Nov 13, 2023 - Neural Techniques for German Dependency Parsing
Do, Bich-Ngoc; Rehbein, Ines, 2023, "Tool for Extracting PP Attachment Disambiguation Dataset", https://doi.org/10.11588/data/RHD3KS, heiDATA, V1
This resource contains code to extract a PP attachment disambiguation dataset as described in the paper: Do and Rehbein (2020). "Parsers Know Best: German PP Attachment Revisited". The input is in CoNLL format, and the output format is similar to the one described in de Kok et al...
Nov 13, 2023 - Neural Techniques for German Dependency Parsing
Do, Bich-Ngoc; Rehbein, Ines, 2023, "Topological Field Labeler for German", https://doi.org/10.11588/data/YYNQFF, heiDATA, V1
This resource contains the code of the topological labeler used in the paper: Do and Rehbein (2020). "Parsers Know Best: German PP Attachment Revisited". For this tool, labeling topological field is formulated as a sequence labeling task. We also include in this resource two pre-...
Mar 26, 2020 - Empirical Linguistics and Computational Language Modeling (LiMo)
Rehbein, Ines; Ruppenhofer, Josef; Do, Bich-Ngoc, 2020, "tweeDe", https://doi.org/10.11588/data/S90S35, heiDATA, V1
A German UD Twitter treebank, with >12,000 tokens from 519 tweets, annotated in the Universal Dependencies framework
Aug 23, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)
van den Berg, Esther; Korfhage, Katharina; Ruppenhofer, Josef; Wiegand, Michael; Markert, Katja, 2019, "Twitter Titling Corpus", https://doi.org/10.11588/data/IOHXDF, heiDATA, V1, UNF:6:+F3lLKziwMvjy+xyktkilw== [fileUNF]
The Twitter Titling Corpus contains 4002 stance-annotated tweets collected between 20 June 2017 and 30 August 2017 mentioning 6 presidents. Each tweet is annotated for the naming form used to refer to the president, for the purpose of a study on the relation between naming variat...
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.