1 to 10 of 69 Results
Feb 26, 2024
Open Research Data from the ExpLAIN project, a joint research project of the NLP Group at the Computational Linguistics Department of Heidelberg University and the Data and Web Science Groupat University of Mannheim. |
Jan 17, 2024
The main purpose of language is to encode and communicate information of all sorts. Our research focuses on semantics — the study of meaning — and how a machine can assign meaning to utterances: words, sentences and texts, as humans can do. Our work is linguistically informed and... |
Nov 13, 2023Empirical Linguistics and Computational Language Modeling (LiMo)
Research Data to the PhD Projects of Ngoc Do. |
Oct 24, 2023
Open Research Data from the Data Analysis and Modeling in Medicine at the Medical Faculty Mannheim of Heidelberg University. |
Mar 21, 2023 - Ground truth data for HTR on South Asian Scripts
Derrick, Tom; British Library, 2023, "Ground Truth transcriptions for training OCR of historical Bengali printed texts – Recognition of Early Indian Printed Documents competition - updated with improved XML coordinates", https://doi.org/10.11588/data/AIQSXL, heiDATA, V1
This dataset comprises 81 digitised images (TIFF files) drawn from a selection of early printed Bengali books (1713-1914) digitised through the Two Centuries of Indian Print project (https://www.bl.uk/projects/two-centuries-of-indian-print). Also contained are ground truth transc... |
Nov 8, 2022
Data publications from the the Molecular Biomechanics (MBM) group at HITS. Shareholders of HITS are the “HITS-Stiftung”, Heidelberg University and the Karlsruhe Institute of Technology (KIT) |
Oct 26, 2022FID4SA@heiDATA
A collection of Ground Truth data for handwritten and printed text recognition for South Asian scripts provided by FID4SA - Specialized Information Service South Asia. Interested researchers can download the data archived here and use it as training data for their own text recogn... |
Oct 26, 2022
Data publications of the FID4SA – Specialized Information Service South Asia. |
Jul 25, 2022
Data publications of the Scientific Software Center (SSC) at Heidelberg University. |
Jul 14, 2022 - Computer Assisted Clinical Medicine
Zöllner, Frank, 2022, "Synthesis of CT images from digital body phantoms using CycleGAN [dataset]", https://doi.org/10.11588/data/7NRFYC, heiDATA, V1
The potential of medical image analysis with neural networks is limited by the restricted availability of extensive data sets. The incorporation of synthetic training data is one approach to bypass this shortcoming, as synthetic data offer accurate annotations and unlimited data... |