GermEval-2018 Corpus (DE) (ICPSR doi:10.11588/data/0B5VML)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

GermEval-2018 Corpus (DE)

Identification Number:

doi:10.11588/data/0B5VML

Distributor:

heiDATA

Date of Distribution:

2019-09-02

Version:

1

Bibliographic Citation:

Wiegand, Michael, 2019, "GermEval-2018 Corpus (DE)", https://doi.org/10.11588/data/0B5VML, heiDATA, V1

Study Description

Citation

Title:

GermEval-2018 Corpus (DE)

Identification Number:

doi:10.11588/data/0B5VML

Authoring Entity:

Wiegand, Michael (Spoken Language Systems, Saarland University (2010-2018), Leibniz Institute for the German Language (since 2019))

Date of Production:

2018

Distributor:

heiDATA

Date of Distribution:

2019-09-02

Study Scope

Keywords:

Arts and Humanities, Computer and Information Science, sentiment, German, offensive language, tweets, Twitter, social media, GermEval Shared Task

Topic Classification:

offensive language detection

Abstract:

This dataset comprises the training and test data (German tweets) from the GermEval 2018 Shared on Offensive Language Detection.

Kind of Data:

text files, tab separated values

Methodology and Processing

Other Study-Related Materials

Label:

GermEval-2018-Data-master.zip

Notes:

application/zip

Other Study-Related Materials

Label:

README.md

Notes:

text/markdown