tweeDe (ICPSR doi:10.11588/data/S90S35)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

tweeDe

Identification Number:

doi:10.11588/data/S90S35

Distributor:

heiDATA

Date of Distribution:

2020-03-26

Version:

1

Bibliographic Citation:

Rehbein, Ines; Ruppenhofer, Josef; Do, Bich-Ngoc, 2020, "tweeDe", https://doi.org/10.11588/data/S90S35, heiDATA, V1

Study Description

Citation

Title:

tweeDe

Subtitle:

A German UD Twitter treebank

Identification Number:

doi:10.11588/data/S90S35

Authoring Entity:

Rehbein, Ines (Leibniz Institute for the German Language)

Ruppenhofer, Josef (Leibniz Institute for the German Language)

Do, Bich-Ngoc (Department of Computational Linguistics, Heidelberg University)

Date of Production:

2019

Distributor:

heiDATA

Date of Distribution:

2020-03-26

Study Scope

Keywords:

Arts and Humanities, Computer and Information Science, Twitter treebank, tweets, German, Universal Dependency, parsing, annotation, part of speech, PoS, morphological feature, syntactic dependency

Topic Classification:

Treebanking, Dependency parsing

Abstract:

A German UD Twitter treebank, with >12,000 tokens from 519 tweets, annotated in the Universal Dependencies framework

Kind of Data:

archived tab-separated text (CoNLL-U)

Methodology and Processing

Other Study-Related Materials

Label:

readme.txt

Notes:

text/plain

Other Study-Related Materials

Label:

tweeDe.conllu

Notes:

application/octet-stream