Persistent Identifier
|
doi:10.11588/data/KS5W0H |
Publication Date
|
2022-06-02 |
Title
| NLP in Diagnostic Texts from Nephropathology [Research Data] |
Author
| Legnar, Maximilian (Institute of Pathology, Medical Faculty Mannheim, Heidelberg University, Germany)
Daumke, Philipp (Averbis GmbH, Freiburg, Germany)
Hesser, Jürgen (Data Analysis and Modeling, MIISM, Medical School, Interdisciplinary Center for Scientific Computing (IWR), Central Institute for Computer Engineering (ZITI), CZS Heidelberg Center for Model-Based AI, Heidelberg University)
Porubsky, Stefan (Institute of Pathology, Medical Faculty Mainz, University Hospital Mainz, Mainz, Germany)
Popovic, Zoran (Institute of Pathology, Medical Faculty Mannheim, Heidelberg University)
Bindzus, Jan Niklas (Institute of Pathology, Medical Faculty Mannheim, Heidelberg University)
Siemoneit, Joern-Helge (Institute of Pathology, Medical Faculty Mannheim, Heidelberg University)
Weis, Cleo-Aron (Institute of Pathology, Medical Faculty Mannheim, Heidelberg University) |
Point of Contact
|
Use email button above to contact.
Legnar, Maximilian (Institute of Pathology, Medical Faculty Mannheim, Heidelberg University, Germany)
Weis, Cleo-Aron (Institute of Pathology, Medical Faculty Mannheim, Heidelberg University, Germany) |
Description
| This data set contains all annotated topic word tables from the work "NLP in Diagnostic Texts from Nephropathology", as well as all pre-processed and tf-idf-vectorized text files. The raw texts (i.e., descriptive and diagnostic sections) are explicitly not made available, since it cannot be ruled out here that it is possible to infer the patient or the person making the report. This is in accordance with our local ethics committee. Please note: This data set is not yet complete and will be completed soon. Please refer to chapter 3.1.2 of our paper to learn how to interpret the annotated topic word tables. The associated gitlab project http://gitlab.medma.uni-heidelberg.de/mlegnar/nlp-in-diagnostic-texts-from-nephropathology contains some examples of how the .pkl files can be opened and used with python. |
Subject
| Medicine, Health and Life Sciences |
Keyword
| NLP
text analysis
nephropathology
pathology reports
text classification
topic modelling |
Language
| English |
Producer
| Institute of Pathology, Medical Faculty Mannheim, Heidelberg University |
Production Date
| 2022-05-30 |
Contributor
| Data Manager : Legnar, Maximillian
Data Manager : Weis, Cleo-Aron
Editor : Bindzus, Jan Niklas |
Related Material
| http://gitlab.medma.uni-heidelberg.de/mlegnar/nlp-in-diagnostic-texts-from-nephropathology |