Data publications of the Leibniz ScienceCampus “Empirical Linguistics and Computational Language Modeling”

The Leibniz ScienceCampus “Empirical Linguistics and Computational Language Modeling” (LiMo) is a cooperative research project between the Leibniz Institute for the German Language (Leibniz-Institut für Deutsche Sprache, IDS) in Mannheim and the Department of Computational Linguistics at Heidelberg University (ICL). The general aims of the project are to develop new methods, models, and tools for compiling and analysing automatically large German textual corpora covering different domains, genres and language varieties.

The project is supported by funds from the Baden-Württemberg Ministry of Science, Research and the Arts and the Leibniz Association together with funds provided by the Leibniz Institute for the German Language and Heidelberg University.

Funding Period: 2015 – 2020

Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

Advanced Search

81 to 90 of 184 Results
Gzip Archive - 157.3 MB - MD5: 763a915177b7ee6786e72ba31414f40d
Data
Gzip Archive - 184.8 MB - MD5: 42c33f1cb5479edfe24ef124afcbbef2
Data
Gzip Archive - 157.0 MB - MD5: d02ad4f2206a01dfc565f60eff32aa97
Data
ZIP Archive - 58.2 KB - MD5: 243c245db6088fe0024c94e9b503f0c3
Data
Gzip Archive - 8.9 MB - MD5: 4ac1020a50db7eb567561ac79dd7fddb
Data
Gzip Archive - 9.0 MB - MD5: 01aa0e08971e4652db3441dc8ad2f81b
Data
Gzip Archive - 13.3 MB - MD5: f1c8a15808715c67a1e00ce31060c32e
Data
Gzip Archive - 14.4 MB - MD5: 9a02480db917c11e9a17d88a2eefd0b1
Data
Mar 26, 2020
Rehbein, Ines; Ruppenhofer, Josef; Zimmermann, Victor, 2020, "Pre-trained POS tagging models for German social media", https://doi.org/10.11588/data/W3JBV4, heiDATA, V1
Pre-trained POS tagging models for the HunPos tagger (Halácsy et al. 2007) the biLSTM-char-CRF tagger (Reimers & Gurevych 2017) Online-Flors (Yin et al. 2015). References: Halácsy, P., Kornai, A., and Oravecz, C. (2007). HunPos: An open source trigram tagger. In Proceedings of th...
ZIP Archive - 12.7 MB - MD5: 7057006601db4c004d0f5e041e508e08
CodeData
Add Data

Sign up or log in to create a dataverse or add a dataset.

Share Dataverse

Share this dataverse on your favorite social media networks.

Link Dataverse
Reset Modifications

Are you sure you want to reset the selected metadata fields? If you do this, any customizations (hidden, required, optional) you have done will no longer appear.