1 to 10 of 85 Results
Feb 17, 2021
Daza, Angel, 2021, "X-SRL Dataset and mBERT Word Aligner", https://doi.org/10.11588/data/HVXXIJ, heiDATA, V1
This code contains a method to automatically align words from parallel sentences by using multilingual BERT pre-trained embeddings. This can be used to transfer source annotations (for example labeled English sentences) into the target side (for example a German translation of th... |
Feb 17, 2021 -
X-SRL Dataset and mBERT Word Aligner
Markdown Text - 6.0 KB - MD5: 00d9aab1a8323bf228abd46cd51a666b
|
Feb 17, 2021 -
X-SRL Dataset and mBERT Word Aligner
ZIP Archive - 37.7 KB - MD5: 6b35c476556dfdb2b9b25a7a1cdc755d
|
Jan 20, 2021
van den Berg, Esther; Korfhage, Katharina; Ruppenhofer, Josef; Wiegand, Michael; Markert, Katja, 2020, "German Twitter Titling Corpus", https://doi.org/10.11588/data/AOSUY6, heiDATA, V2, UNF:6:14BxjwJS7Q3mfI6ei7iBBw== [fileUNF]
The German Titling Twitter Corpus consists of 1904 stance-annotated tweets collected in June/July 2018 mentioning 24 German politicians with a doctoral degree. The Addendum contains an additional 296 stance-annotated tweets from each month of 2018 mentioning 10 politicians with a... |
Jan 20, 2021 -
German Twitter Titling Corpus
Tab-Delimited - 19.7 KB - MD5: 0f6e049cae118929ae2265482e3b76b6
|
Jan 20, 2021 -
German Twitter Titling Corpus
Markdown Text - 1.2 KB - MD5: 2fb7128786b3a52452273bb4546963c5
|
Mar 26, 2020
Rehbein, Ines; Ruppenhofer, Josef; Do, Bich-Ngoc, 2020, "tweeDe", https://doi.org/10.11588/data/S90S35, heiDATA, V1
A German UD Twitter treebank, with >12,000 tokens from 519 tweets, annotated in the Universal Dependencies framework |
Mar 26, 2020 -
tweeDe
Plain Text - 4.3 KB - MD5: f331fd03061fbc1b28085934d6a9b10f
|
Mar 26, 2020 -
tweeDe
Unknown - 945.9 KB - MD5: 32d20db78b577a921d9fd4bc3868770e
|
Mar 26, 2020
Rehbein, Ines; Ruppenhofer, Josef; Zimmermann, Victor, 2020, "Pre-trained POS tagging models for German social media", https://doi.org/10.11588/data/W3JBV4, heiDATA, V1
Pre-trained POS tagging models for the HunPos tagger (Halácsy et al. 2007) the biLSTM-char-CRF tagger (Reimers & Gurevych 2017) Online-Flors (Yin et al. 2015). References: Halácsy, P., Kornai, A., and Oravecz, C. (2007). HunPos: An open source trigram tagger. In Proceedings of th... |