DeModify (doi:10.11588/data/KIWEMF)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

(external link)

Document Description

Citation

Title:

DeModify

Identification Number:

doi:10.11588/data/KIWEMF

Distributor:

heiDATA

Date of Distribution:

2019-07-15

Version:

1

Bibliographic Citation:

Nastase, Vivi; Fritz, Devon; Frank, Anette, 2019, "DeModify", https://doi.org/10.11588/data/KIWEMF, heiDATA, V1

Study Description

Citation

Title:

DeModify

Subtitle:

A Dataset for Analyzing Contextual Constraints on Modifier Deletion

Identification Number:

doi:10.11588/data/KIWEMF

Authoring Entity:

Nastase, Vivi (Department of Computational Linguistics, Heidelberg University, Germany)

Fritz, Devon (Department of Computational Linguistics, Heidelberg University, Germany)

Frank, Anette (Department of Computational Linguistics, Heidelberg University, Germany)

Date of Production:

2018

Distributor:

heiDATA

Access Authority:

Nastase, Vivi

Holdings Information:

https://doi.org/10.11588/data/KIWEMF

Study Scope

Keywords:

Arts and Humanities, Computer and Information Science, annotation, modifier deletion, text simplification

Topic Classification:

knowledge discovery

Abstract:

deModify consists of 3631 instances, each with three annotations obtained through CrowdFlower. An instance is a short story in which a modifier is annotated with respect to its impact on the information in the story, assessed through its deletion from the context: crucial, not-crucial, ungrammatical. Based on these annotations we have created two gold standards: strict (includes instances on which all annotators agree) and relaxed (majority voting). The archive contains a file that proposes a split into 5 folds for the instances that belong to either of the gold standards.

Kind of Data:

tab delimited text data

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

Related Materials

<p><strong>Argumentative texts from:&nbsp;</strong></p> <p>Andreas Peldszus, Manfred Stede (2015). An annotated corpus of argumentative microtexts. In <em>First European Conference on Argumentation: Argumentation and Reasoned Action</em>, Portugal, Lisbon, June 2015. </p> <p>Link to the publikation: <a href="http://www.ling.uni-potsdam.de/~peldszus/eca2015-preprint.pdf">http://www.ling.uni-potsdam.de/~peldszus/eca2015-preprint.pdf</a></p> <p>Link to the arg-microtexts corpus: <a href="https://github.com/peldszus/arg-microtexts">https://github.com/peldszus/arg-microtexts</a></p> <p>License: Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International</p> <p><strong>Short stories from:</strong></p> <p>Mostafazadeh, N., Chambers, N., He, X., Parikh, D., Batra, D., Vanderwende, L., Kohli, P., and Allen, J. (2016). A Corpus and Cloze Evaluation for Deeper Understanding ofCommonsense Stories. In <em>Proceedings of the 2016 Conference of the North American Chapter of the Association for Computational Linguistics: Human Language Technologies (NAACL HLT)</em>, pages 839&ndash;849,San Diego, California, June 12-17, 2016 </p> <p>Link to the publikation: <a href="https://www.aclweb.org/anthology/N16-1098">https://www.aclweb.org/anthology/N16-1098</a></p> <p>Link to the dataset: <a href="http://cs.rochester.edu/nlp/rocstories/">http://cs.rochester.edu/nlp/rocstories/</a></p> <p>Access to the dataset: free to everyone</p>

Related Publications

Citation

Title:

<p>Nastase, V., Fritz, D., and Frank, A. (2018). DeModify: A dataset for analyzing contextual constraints on modifier deletion. In<em> Proceedings of the 11th International Conference on Language Resources and Evaluation</em>, pages 1357-1363, 7-12 May 2018, Miyazaki, Japan.</p>

Identification Number:

https://www.aclweb.org/anthology/L18-1217

Bibliographic Citation:

<p>Nastase, V., Fritz, D., and Frank, A. (2018). DeModify: A dataset for analyzing contextual constraints on modifier deletion. In<em> Proceedings of the 11th International Conference on Language Resources and Evaluation</em>, pages 1357-1363, 7-12 May 2018, Miyazaki, Japan.</p>

Other Study-Related Materials

Label:

demodify.data_split.tsv

Notes:

text/tsv

Other Study-Related Materials

Label:

demodify.tsv

Notes:

text/tsv

Other Study-Related Materials

Label:

README

Notes:

text/plain; charset=US-ASCII