Metrics
189,636 Downloads
heiDATA is an institutional repository for Open Research Data from Heidelberg University. It is managed by Heidelberg University's Research Data Unit, a joint institution of the University Library and the Computing Centre. If you are interested in publishing your data here, please have a look at our user guide and get in touch with us. Detailed information on heiDATA can be found in our service description.
Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

491 to 500 of 700 Results
Sep 2, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)
Wiegand, Michael, 2019, "GermEval-2018 Corpus (DE)", https://doi.org/10.11588/data/0B5VML, heiDATA, V1
This dataset comprises the training and test data (German tweets) from the GermEval 2018 Shared on Offensive Language Detection.
Sep 2, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)
Wiegand, Michael, 2019, "Lexicon of Abusive Words (EN)", https://doi.org/10.11588/data/MKPEYV, heiDATA, V1
This goldstandard contains a bootstrapped lexicon of abusive words. The lexicon comprises a large set of English negative polar expressions annotated as either abusive or not.
Sep 2, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)
Wiegand, Michael, 2019, "Opinion role extractor", https://doi.org/10.11588/data/3W7AQP, heiDATA, V1
System for the Extraction of Subjective Expressions, Sentiment Sources and Sentiment Targets from German Text
Sep 2, 2019 - Heidelberg University Language and Cognition Lab
Gerwien, Johannes, 2019, "The interpretation and prediction of event participants in Mandarin verb-final active and passive sentences [Dataset]", https://doi.org/10.11588/data/L7QPUY, heiDATA, V1, UNF:6:Gpy3ySsey0gDHrTBkgp1Bg== [fileUNF]
This data set contains eye tracking data collected with an SMI RED 500 eye tracking system. The experimental design, elicitation method, coding, and criteria for excluding/including data are documented in the article: Gerwien, J. (2019) "The interpretation and prediction of event...
Sep 2, 2019 - AWI Experimental Economics
Oechssler, Jörg; Rau, Hannes; Roomets, Alex, 2019, "Hedging, ambiguity, and the reversal of order axiom [Dataset]", https://doi.org/10.11588/data/1XDKHZ, heiDATA, V1, UNF:6:c8rHrHnmCxS3BC4O6euGVQ== [fileUNF]
We ran experiments that gave subjects a straight-forward and simple opportunity to hedge away ambiguity in an Ellsberg-style experiment. Subjects had to make bets on the combined outcomes of a fair coin and a draw from an ambiguous urn. By modifying the timing of the draw, coin f...
Aug 28, 2019 - Germania
Roxburgh, Marcus; Olli, Maarja, 2019, "Eyes to the North: a multi-element analysis of copper-alloy eye brooches in the eastern Baltic, produced during the Roman Iron Age [Supplement]", https://doi.org/10.11588/data/7WOCTK, heiDATA, V1, UNF:6:EnP0e/xxKuwYHp2brVn3sw== [fileUNF]
Roman Iron Age. Their forms bear strong similarities to those found much further south in Germania and the northern Roman provinces, leading to the conclusion that they originally arrived in the region as imports, perhaps by sea from an as yet undiscovered production centre in an...
Aug 23, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)
van den Berg, Esther; Korfhage, Katharina; Ruppenhofer, Josef; Wiegand, Michael; Markert, Katja, 2019, "Twitter Titling Corpus", https://doi.org/10.11588/data/IOHXDF, heiDATA, V1, UNF:6:+F3lLKziwMvjy+xyktkilw== [fileUNF]
The Twitter Titling Corpus contains 4002 stance-annotated tweets collected between 20 June 2017 and 30 August 2017 mentioning 6 presidents. Each tweet is annotated for the naming form used to refer to the president, for the purpose of a study on the relation between naming variat...
Aug 19, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)
Kotnis, Bhushan, 2019, "Negative Sampling for Learning Knowledge Graph Embeddings", https://doi.org/10.11588/data/YYULL2, heiDATA, V1
Reimplementation of four KG factorization methods and six negative sampling methods. Abstract Knowledge graphs are large, useful, but incomplete knowledge repositories. They encode knowledge through entities and relations which define each other through the connective structure o...
Aug 19, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)
Kotnis, Bhushan, 2019, "KGE Algorithms", https://doi.org/10.11588/data/CSXYSS, heiDATA, V1
An updated method for link prediction that uses a regularization factor that models relation argument types Abstract (Kotnis and Nastase, 2017): Learning relations based on evidence from knowledge repositories relies on processing the available relation instances. Knowledge repos...
Jul 15, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)
Nastase, Vivi; Kotnis, Bhushan, 2019, "Abstract graphs, abstract paths, grounded paths for Freebase and NELL", https://doi.org/10.11588/data/AVLFPZ, heiDATA, V1
We describe a method for representing knowledge graphs that capture an intensional representation of the original extensional information. This representation is very compact, and it abstracts away from individual links, allowing us to find better path candidates, as shown by the...
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.