heiDATA

Metrics

193,550 Downloads

Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Deposit Date: 2019 Subject: Arts and Humanities

1 to 10 of 14 Results

Zur absoluten Chronologie der Einzelgrabkultur in Norddeutschland und Nordjütland [Supplement] Nov 22, 2019 - Germania Brozio, Jan Piet, 2019, "Zur absoluten Chronologie der Einzelgrabkultur in Norddeutschland und Nordjütland [Supplement]", https://doi.org/10.11588/data/HBAVWG, heiDATA, V2, UNF:6:PZt1V6obgH7b1f61ZaYOBA== [fileUNF] Mit der Absicht ein typochronologisches Modell der geschweiften Becher der norddeutschen Einzelgrabkultur und damit eine detaillierte chronologische Differenzierung für die Diskussion historischer Prozesse zu entwickeln, erfolgt zunächst eine absolutchronologische Einordnung der...
Sentiment View Lexicon (EN) Sep 5, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo) Wiegand, Michael; Ruppenhofer, Josef; Schulder, Marc, 2019, "Sentiment View Lexicon (EN)", https://doi.org/10.11588/data/2JK48O, heiDATA, V1 This gold standard contains sentiment expressions (verbs, nouns and adjectives) that have been annotated according to their (prior) sentiment view. Each sentiment expression is labelled either as actor or speaker view.
Sentiment Compound Data (DE) Sep 5, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo) Wiegand, Michael; Bocionek, Christine; Ruppenhofer, Josef, 2019, "Sentiment Compound Data (DE)", https://doi.org/10.11588/data/LSTRK3, heiDATA, V1 This dataset contains gold standards that are required for building a classifier that automatically extracts opinion (noun) compounds.
Pre-trained POS tagging models for German social media Mar 26, 2020 - Empirical Linguistics and Computational Language Modeling (LiMo) Rehbein, Ines; Ruppenhofer, Josef; Zimmermann, Victor, 2020, "Pre-trained POS tagging models for German social media", https://doi.org/10.11588/data/W3JBV4, heiDATA, V1 Pre-trained POS tagging models for the HunPos tagger (Halácsy et al. 2007) the biLSTM-char-CRF tagger (Reimers & Gurevych 2017) Online-Flors (Yin et al. 2015). References: Halácsy, P., Kornai, A., and Oravecz, C. (2007). HunPos: An open source trigram tagger. In Proceedings of th...
MACE-AL-TREE Mar 26, 2020 - Empirical Linguistics and Computational Language Modeling (LiMo) Rehbein, Ines; Ruppenhofer, Josef, 2020, "MACE-AL-TREE", https://doi.org/10.11588/data/THPEBR, heiDATA, V1 An method for detecting noise in automatically annotated dependency parse trees, combining MACE (Hovy et al. 2013) with Active Learning.
MACE-AL Mar 26, 2020 - Empirical Linguistics and Computational Language Modeling (LiMo) Rehbein, Ines; Ruppenhofer, Josef; Steen, Julius, 2020, "MACE-AL", https://doi.org/10.11588/data/C2OQN4, heiDATA, V1 A method for detecting noise in automatically annotated sequence-labelled data, combining MACE (Hovy et al. 2013) with Active Learning.
Lexicon of Abusive Words (EN) Sep 2, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo) Wiegand, Michael, 2019, "Lexicon of Abusive Words (EN)", https://doi.org/10.11588/data/MKPEYV, heiDATA, V1 This goldstandard contains a bootstrapped lexicon of abusive words. The lexicon comprises a large set of English negative polar expressions annotated as either abusive or not.
Lauchheim II.2. Katalog der Gräber 301–600 Jul 2, 2019 - Propylaeum@heiDATA Höke, Benjamin; Gauß, Florian; Peek, Christina; Stelzner, Jörg, 2019, "Lauchheim II.2. Katalog der Gräber 301–600", https://doi.org/10.11588/data/HB97MY, heiDATA, V1 Mit rund 1300 Gräbern aus dem Zeitraum vom späten 5. bis zum späten 7. Jahrhundert ist das Gräberfeld von Lauchheim 'Wasserfurche' (Ostalbkreis) bis heute der größte bekannte merowingerzeitliche Bestattungsplatz Süddeutschlands. In den Jahren 1986 bis 1996 wurde das fast vollstän...
HeiCuBeDa Hilprecht - Heidelberg Cuneiform Benchmark Dataset for the Hilprecht Collection Mar 26, 2021 - IWR Computer Graphics Mara, Hubert, 2019, "HeiCuBeDa Hilprecht - Heidelberg Cuneiform Benchmark Dataset for the Hilprecht Collection", https://doi.org/10.11588/data/IE8CCN, heiDATA, V2 The number of known cuneiform tablets is assumed to be in the hundreds of thousands. A fraction has been published by printing photographs and manual tracings in books, which is collected by the online Cuneiform Digital Library Initiative (CDLI) catalog including some of these im...
GermEval-2018 Corpus (DE) Sep 2, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo) Wiegand, Michael, 2019, "GermEval-2018 Corpus (DE)", https://doi.org/10.11588/data/0B5VML, heiDATA, V1 This dataset comprises the training and test data (German tweets) from the GermEval 2018 Shared on Offensive Language Detection.

Zur absoluten Chronologie der Einzelgrabkultur in Norddeutschland und Nordjütland [Supplement]

Nov 22, 2019 - Germania

Brozio, Jan Piet, 2019, "Zur absoluten Chronologie der Einzelgrabkultur in Norddeutschland und Nordjütland [Supplement]", https://doi.org/10.11588/data/HBAVWG, heiDATA, V2, UNF:6:PZt1V6obgH7b1f61ZaYOBA== [fileUNF]

Mit der Absicht ein typochronologisches Modell der geschweiften Becher der norddeutschen Einzelgrabkultur und damit eine detaillierte chronologische Differenzierung für die Diskussion historischer Prozesse zu entwickeln, erfolgt zunächst eine absolutchronologische Einordnung der...

Sentiment View Lexicon (EN)

Sep 5, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)

Wiegand, Michael; Ruppenhofer, Josef; Schulder, Marc, 2019, "Sentiment View Lexicon (EN)", https://doi.org/10.11588/data/2JK48O, heiDATA, V1

This gold standard contains sentiment expressions (verbs, nouns and adjectives) that have been annotated according to their (prior) sentiment view. Each sentiment expression is labelled either as actor or speaker view.

Sentiment Compound Data (DE)

Sep 5, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)

Wiegand, Michael; Bocionek, Christine; Ruppenhofer, Josef, 2019, "Sentiment Compound Data (DE)", https://doi.org/10.11588/data/LSTRK3, heiDATA, V1

This dataset contains gold standards that are required for building a classifier that automatically extracts opinion (noun) compounds.

Pre-trained POS tagging models for German social media

Mar 26, 2020 - Empirical Linguistics and Computational Language Modeling (LiMo)

Rehbein, Ines; Ruppenhofer, Josef; Zimmermann, Victor, 2020, "Pre-trained POS tagging models for German social media", https://doi.org/10.11588/data/W3JBV4, heiDATA, V1

Pre-trained POS tagging models for the HunPos tagger (Halácsy et al. 2007) the biLSTM-char-CRF tagger (Reimers & Gurevych 2017) Online-Flors (Yin et al. 2015). References: Halácsy, P., Kornai, A., and Oravecz, C. (2007). HunPos: An open source trigram tagger. In Proceedings of th...

MACE-AL-TREE

Mar 26, 2020 - Empirical Linguistics and Computational Language Modeling (LiMo)

Rehbein, Ines; Ruppenhofer, Josef, 2020, "MACE-AL-TREE", https://doi.org/10.11588/data/THPEBR, heiDATA, V1

An method for detecting noise in automatically annotated dependency parse trees, combining MACE (Hovy et al. 2013) with Active Learning.

MACE-AL

Mar 26, 2020 - Empirical Linguistics and Computational Language Modeling (LiMo)

Rehbein, Ines; Ruppenhofer, Josef; Steen, Julius, 2020, "MACE-AL", https://doi.org/10.11588/data/C2OQN4, heiDATA, V1

A method for detecting noise in automatically annotated sequence-labelled data, combining MACE (Hovy et al. 2013) with Active Learning.

Lexicon of Abusive Words (EN)

Sep 2, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)

Wiegand, Michael, 2019, "Lexicon of Abusive Words (EN)", https://doi.org/10.11588/data/MKPEYV, heiDATA, V1

This goldstandard contains a bootstrapped lexicon of abusive words. The lexicon comprises a large set of English negative polar expressions annotated as either abusive or not.

Lauchheim II.2. Katalog der Gräber 301–600

Jul 2, 2019 - Propylaeum@heiDATA

Höke, Benjamin; Gauß, Florian; Peek, Christina; Stelzner, Jörg, 2019, "Lauchheim II.2. Katalog der Gräber 301–600", https://doi.org/10.11588/data/HB97MY, heiDATA, V1

Mit rund 1300 Gräbern aus dem Zeitraum vom späten 5. bis zum späten 7. Jahrhundert ist das Gräberfeld von Lauchheim 'Wasserfurche' (Ostalbkreis) bis heute der größte bekannte merowingerzeitliche Bestattungsplatz Süddeutschlands. In den Jahren 1986 bis 1996 wurde das fast vollstän...

HeiCuBeDa Hilprecht - Heidelberg Cuneiform Benchmark Dataset for the Hilprecht Collection

Mar 26, 2021 - IWR Computer Graphics

Mara, Hubert, 2019, "HeiCuBeDa Hilprecht - Heidelberg Cuneiform Benchmark Dataset for the Hilprecht Collection", https://doi.org/10.11588/data/IE8CCN, heiDATA, V2

The number of known cuneiform tablets is assumed to be in the hundreds of thousands. A fraction has been published by printing photographs and manual tracings in books, which is collected by the online Cuneiform Digital Library Initiative (CDLI) catalog including some of these im...

GermEval-2018 Corpus (DE)

Sep 2, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)

Wiegand, Michael, 2019, "GermEval-2018 Corpus (DE)", https://doi.org/10.11588/data/0B5VML, heiDATA, V1

This dataset comprises the training and test data (German tweets) from the GermEval 2018 Shared on Offensive Language Detection.

Add Data

Share Dataverse

Link Dataverse

Reset Modifications