Source Code, Data and Additional Material for the Thesis: "Aspects of Coherence for Entity Analysis" (doi:10.11588/data/9JKAVW)

View:

Part 1: Document Description
Part 2: Study Description
Part 5: Other Study-Related Materials
Entire Codebook

Document Description

Citation

Title:

Source Code, Data and Additional Material for the Thesis: "Aspects of Coherence for Entity Analysis"

Identification Number:

doi:10.11588/data/9JKAVW

Distributor:

heiDATA

Date of Distribution:

2019-02-06

Version:

1

Bibliographic Citation:

Heinzerling, Benjamin, 2019, "Source Code, Data and Additional Material for the Thesis: "Aspects of Coherence for Entity Analysis"", https://doi.org/10.11588/data/9JKAVW, heiDATA, V1

Study Description

Citation

Title:

Source Code, Data and Additional Material for the Thesis: "Aspects of Coherence for Entity Analysis"

Identification Number:

doi:10.11588/data/9JKAVW

Authoring Entity:

Heinzerling, Benjamin (Heidelberg University and Natural Language Processing (NLP) Group at the Heidelberg Institute for Theoretical Studies (HITS))

Distributor:

heiDATA

Access Authority:

Heinzerling, Benjamin

Holdings Information:

https://doi.org/10.11588/data/9JKAVW

Study Scope

Keywords:

Computer and Information Science

Abstract:

This dataset contains source code and system output used in the PhD thesis "Aspects of Coherence for Entity Analysis". This dataset is split into three parts corresponding to the chapters describing the three main contributions of the thesis: <ul> <li> chapter3.tar.gz: Java source code for the entity linking system based on interleaved multitasking, system results, system output. Java and Python source code for automatic verification of entity linking results. Java source code for the Visual Entity Explorer. <li> chapter4.tar.gz: Java and Scala source code for extracting pairs of terms and their dependency context from GigaWord and Wikilinks. <li> chapter5.tar.gz: Python code used to run entity typing experiments. </ul>

Methodology and Processing

Sources Statement

Data Access

Other Study Description Materials

Related Studies

Graff, David, and Christopher Cieri. English Gigaword LDC2003T05. Web Download. Philadelphia: Linguistic Data Consortium, 2003. URL: <a href="https://catalog.ldc.upenn.edu/LDC2003T05">https://catalog.ldc.upenn.edu/LDC2003T05</a>

Singh, Sameer, Amarnag Subramanya, Fernando Pereira, and Andrew McCallum. "Wikilinks: A large-scale cross-document coreference corpus labeled via links to Wikipedia." University of Massachusetts, Amherst, Tech. Rep. UM-CS-2012 15 (2012). URL: <a http://www.iesl.cs.umass.edu/data/data-wiki-links>http://www.iesl.cs.umass.edu/data/data-wiki-links</a>

Other Study-Related Materials

Label:

chapter3.tar.gz

Text:

Files related to chapter 3

Notes:

application/gzip

Other Study-Related Materials

Label:

chapter4.tar.gz

Text:

Files related to chapter 4

Notes:

application/gzip

Other Study-Related Materials

Label:

chapter5.tar.gz

Text:

Files related to chapter 5

Notes:

application/gzip