heiDATA

Metrics

223,007 Downloads

Featured Dataverses

In order to use this feature you must have at least one published dataverse.

Publish Dataverse

Are you sure you want to publish your dataverse? Once you do so it must remain published.

Publish Dataverse

This dataverse cannot be published because the dataverse it is in has not been published.

Delete Dataverse

Are you sure you want to delete your dataverse? You cannot undelete this dataverse.

There was an error with your search parameters. Please clear your search and try again.

1 to 10 of 18 Results

Knowledge-Enhanced Neural Networks for Machine Reading Comprehension [Source Code and Additional Material] Apr 24, 2024 - AIPHES Mihaylov, Todor, 2024, "Knowledge-Enhanced Neural Networks for Machine Reading Comprehension [Source Code and Additional Material]", https://doi.org/10.11588/data/HU3ARF, heiDATA, V1 Machine Reading Comprehension is a language understanding task where a system is expected to read a given passage of text and typically answer questions about it. When humans assess the task of reading comprehension, in addition to the presented text, they usually use the knowled...
RATIO_EXPLAIN(Heidelberg University, Department of Computational Linguistics) Feb 26, 2024 Open Research Data from the ExpLAIN project, a joint research project of the NLP Group at the Computational Linguistics Department of Heidelberg University and the Data and Web Science Groupat University of Mannheim.
Natural Language Processing Group(Universität Heidelberg) Jan 17, 2024 The main purpose of language is to encode and communicate information of all sorts. Our research focuses on semantics — the study of meaning — and how a machine can assign meaning to utterances: words, sentences and texts, as humans can do. Our work is linguistically informed and...
HITS MBM(Heidelberg Institute for Theoretical Studies) Nov 8, 2022 Data publications from the the Molecular Biomechanics (MBM) group at HITS. Shareholders of HITS are the “HITS-Stiftung”, Heidelberg University and the Karlsruhe Institute of Technology (KIT)
LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition Jun 13, 2020 - Statistical Natural Language Processing Group Beilharz, Benjamin; Sun, Xin, 2019, "LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition", https://doi.org/10.11588/data/TMEDTX, heiDATA, V2 This dataset is a corpus of sentence-aligned triples of German audio, German text, and English translation, based on German audio books. The corpus consists of over 100 hours of audio material and over 50k parallel sentences. The speech data are low in disfluencies because of the...
3D Micro-Mapping of Subsidence Stations [Source Code and Data] Jan 15, 2020 - 3D Spatial Data Processing Herfort, Benjamin; Anders, Katharina; Marx, Sabrina; Eberlein, Stefan; Höfle, Bernhard, 2020, "3D Micro-Mapping of Subsidence Stations [Source Code and Data]", https://doi.org/10.11588/data/OU8YA1, heiDATA, V1 This dataset comprises the source code to reproduce the 3D micro-mapping tool for plane adjustment at subsidence stations. In this project, users adjust a plane (height and orientation) at the positions of fixed poles, so-called subsidence stations, to provide information on the...
Opinion role extractor Sep 2, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo) Wiegand, Michael, 2019, "Opinion role extractor", https://doi.org/10.11588/data/3W7AQP, heiDATA, V1 System for the Extraction of Subjective Expressions, Sentiment Sources and Sentiment Targets from German Text
BPEmb: Pre-trained Subword Embeddings in 275 Languages (LREC 2018) Feb 6, 2019 - AIPHES Heinzerling, Benjamin, 2019, "BPEmb: Pre-trained Subword Embeddings in 275 Languages (LREC 2018)", https://doi.org/10.11588/data/V9CXPR, heiDATA, V1 BPEmb is a collection of pre-trained subword unit embeddings in 275 languages, based on Byte-Pair Encoding (BPE). In an evaluation using fine-grained entity typing as testbed, BPEmb performs competitively, and for some languages better than alternative subword approaches, while r...
Source Code, Data and Additional Material for the Thesis: "Aspects of Coherence for Entity Analysis" Feb 6, 2019 - AIPHES Heinzerling, Benjamin, 2019, "Source Code, Data and Additional Material for the Thesis: "Aspects of Coherence for Entity Analysis"", https://doi.org/10.11588/data/9JKAVW, heiDATA, V1 This dataset contains source code and system output used in the PhD thesis "Aspects of Coherence for Entity Analysis". This dataset is split into three parts corresponding to the chapters describing the three main contributions of the thesis: chapter3.tar.gz: Java source code for...
Abstract Anaphora Resolution [Source Code] Feb 4, 2019 - AIPHES Marasovic, Ana, 2019, "Abstract Anaphora Resolution [Source Code]", https://doi.org/10.11588/data/UDMPY5, heiDATA, V1 Abstract Anaphora Resolution (AAR) aims to find the interpretation of nominal expressions (e.g., this result, those two actions) and pronominal expressions (e.g., this, that, it) that refer to abstract-object-antecedents such as facts, events, plans, actions, or situations. The f...

Knowledge-Enhanced Neural Networks for Machine Reading Comprehension [Source Code and Additional Material]

Apr 24, 2024 - AIPHES

Mihaylov, Todor, 2024, "Knowledge-Enhanced Neural Networks for Machine Reading Comprehension [Source Code and Additional Material]", https://doi.org/10.11588/data/HU3ARF, heiDATA, V1

Machine Reading Comprehension is a language understanding task where a system is expected to read a given passage of text and typically answer questions about it. When humans assess the task of reading comprehension, in addition to the presented text, they usually use the knowled...

RATIO_EXPLAIN(Heidelberg University, Department of Computational Linguistics)

Feb 26, 2024

Open Research Data from the ExpLAIN project, a joint research project of the NLP Group at the Computational Linguistics Department of Heidelberg University and the Data and Web Science Groupat University of Mannheim.

Natural Language Processing Group(Universität Heidelberg)

Jan 17, 2024

The main purpose of language is to encode and communicate information of all sorts. Our research focuses on semantics — the study of meaning — and how a machine can assign meaning to utterances: words, sentences and texts, as humans can do. Our work is linguistically informed and...

HITS MBM(Heidelberg Institute for Theoretical Studies)

Nov 8, 2022

Data publications from the the Molecular Biomechanics (MBM) group at HITS. Shareholders of HITS are the “HITS-Stiftung”, Heidelberg University and the Karlsruhe Institute of Technology (KIT)

LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition

Jun 13, 2020 - Statistical Natural Language Processing Group

Beilharz, Benjamin; Sun, Xin, 2019, "LibriVoxDeEn - A Corpus for German-to-English Speech Translation and Speech Recognition", https://doi.org/10.11588/data/TMEDTX, heiDATA, V2

This dataset is a corpus of sentence-aligned triples of German audio, German text, and English translation, based on German audio books. The corpus consists of over 100 hours of audio material and over 50k parallel sentences. The speech data are low in disfluencies because of the...

3D Micro-Mapping of Subsidence Stations [Source Code and Data]

Jan 15, 2020 - 3D Spatial Data Processing

Herfort, Benjamin; Anders, Katharina; Marx, Sabrina; Eberlein, Stefan; Höfle, Bernhard, 2020, "3D Micro-Mapping of Subsidence Stations [Source Code and Data]", https://doi.org/10.11588/data/OU8YA1, heiDATA, V1

This dataset comprises the source code to reproduce the 3D micro-mapping tool for plane adjustment at subsidence stations. In this project, users adjust a plane (height and orientation) at the positions of fixed poles, so-called subsidence stations, to provide information on the...

Opinion role extractor

Sep 2, 2019 - Empirical Linguistics and Computational Language Modeling (LiMo)

Wiegand, Michael, 2019, "Opinion role extractor", https://doi.org/10.11588/data/3W7AQP, heiDATA, V1

System for the Extraction of Subjective Expressions, Sentiment Sources and Sentiment Targets from German Text

BPEmb: Pre-trained Subword Embeddings in 275 Languages (LREC 2018)

Feb 6, 2019 - AIPHES

Heinzerling, Benjamin, 2019, "BPEmb: Pre-trained Subword Embeddings in 275 Languages (LREC 2018)", https://doi.org/10.11588/data/V9CXPR, heiDATA, V1

BPEmb is a collection of pre-trained subword unit embeddings in 275 languages, based on Byte-Pair Encoding (BPE). In an evaluation using fine-grained entity typing as testbed, BPEmb performs competitively, and for some languages better than alternative subword approaches, while r...

Source Code, Data and Additional Material for the Thesis: "Aspects of Coherence for Entity Analysis"

Feb 6, 2019 - AIPHES

Heinzerling, Benjamin, 2019, "Source Code, Data and Additional Material for the Thesis: "Aspects of Coherence for Entity Analysis"", https://doi.org/10.11588/data/9JKAVW, heiDATA, V1

This dataset contains source code and system output used in the PhD thesis "Aspects of Coherence for Entity Analysis". This dataset is split into three parts corresponding to the chapters describing the three main contributions of the thesis: chapter3.tar.gz: Java source code for...

Abstract Anaphora Resolution [Source Code]

Feb 4, 2019 - AIPHES

Marasovic, Ana, 2019, "Abstract Anaphora Resolution [Source Code]", https://doi.org/10.11588/data/UDMPY5, heiDATA, V1

Abstract Anaphora Resolution (AAR) aims to find the interpretation of nominal expressions (e.g., this result, those two actions) and pronominal expressions (e.g., this, that, it) that refer to abstract-object-antecedents such as facts, events, plans, actions, or situations. The f...

Add Data

Share Dataverse

Link Dataverse

Reset Modifications