German Twitter Titling Corpus (ICPSR doi:10.11588/data/AOSUY6)

View:

Part 1: Document Description
Part 2: Study Description
Part 3: Data Files Description
Part 4: Variable Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

German Twitter Titling Corpus

Identification Number:

doi:10.11588/data/AOSUY6

Distributor:

heiDATA

Date of Distribution:

2020-03-06

Version:

1

Bibliographic Citation:

van den Berg, Esther, 2020, "German Twitter Titling Corpus", https://doi.org/10.11588/data/AOSUY6, heiDATA, V1, UNF:6:xIy4tRguIiz8xpg52FlxOA== [fileUNF]

Study Description

Citation

Title:

German Twitter Titling Corpus

Identification Number:

doi:10.11588/data/AOSUY6

Authoring Entity:

van den Berg, Esther (Leibniz Institute for the German Language / Department of Computational Linguistics, Heidelberg University)

Date of Production:

2020

Distributor:

heiDATA

Date of Distribution:

2020-03-06

Study Scope

Keywords:

Arts and Humanities, Computer and Information Science, framing, naming, Twitter, annotated corpus, German corpus, stance, sentiment, social media

Topic Classification:

Sentiment

Abstract:

The German Titling Twitter Corpus consists of 1904 stance-annotated tweets collected in June/July 2018 mentioning 24 German politicians with a doctoral degree. The Addendum contains an additional 296 stance-annotated tweets from each month of 2018 mentioning 6 left-leaning and 4 right-leaning politicians with a doctoral degree.

Kind of Data:

textual data in comma-separated format

Methodology and Processing

File Description--f3333

File: GTTC_addendum.tab

  • Number of cases: 296

  • No. of variables per record: 6

  • Type of File: text/tab-separated-values

Notes:

UNF:6:/0DZmxoM6p055wM6fW+b4w==

File Description--f3332

File: GTTC.tab

  • Number of cases: 1904

  • No. of variables per record: 5

  • Type of File: text/tab-separated-values

Notes:

UNF:6:hDTAU0fvrPT3em851EVmhw==

Variable Description

List of Variables:

Variables

ID

f3333 Location:

Summary Statistics: Valid 296.0; Min. 9.5441100573558784E17; StDev 3.8320031720292328E16; Max. 1.0794886413894697E18; Mean 1.01772846744354253E18

Variable Format: numeric

Notes: UNF:6:xLr0BLTxWblhuL3pV0YPzA==

Politician

f3333 Location:

Variable Format: character

Notes: UNF:6:bVdDuyxWNXSHJaljnWqvYA==

Party

f3333 Location:

Variable Format: character

Notes: UNF:6:rbHCMPXJ35VTBHr8UIkHdA==

Stance

f3333 Location:

Variable Format: character

Notes: UNF:6:KsS8yJWd1c75fU2sKjHPLA==

Naming-Form

f3333 Location:

Variable Format: character

Notes: UNF:6:Jtdi/dbxwX3dlbqRZdr87w==

Political-Orientation

f3333 Location:

Variable Format: character

Notes: UNF:6:xeYunupqtih1o2VVL1B4Jw==

ID

f3332 Location:

Summary Statistics: Max. 1.01983439984673178E18; Valid 1904.0; StDev 4.274166050955296E15; Mean 1.01085256989895181E18; Min. 1.00359050678028698E18

Variable Format: numeric

Notes: UNF:6:hInjGZCWXaiHeakrQQzV5w==

Politician

f3332 Location:

Variable Format: character

Notes: UNF:6:s1l6RuSg3ag2nTa1ivz8uw==

Party

f3332 Location:

Variable Format: character

Notes: UNF:6:NQHQCBalMEnAXz9DmhheJg==

Stance

f3332 Location:

Variable Format: character

Notes: UNF:6:bQImYpWswAIsiI9AU0aYUA==

Naming-Form

f3332 Location:

Variable Format: character

Notes: UNF:6:rjPCIbBqxc+dH8HCh27BHA==

Other Study-Related Materials

Label:

README.md

Notes:

text/markdown