10,001 to 10,010 of 10,016 Results
Adobe PDF - 144.2 KB -
MD5: 36006e032c23f166f4cb4841b1d97b6b
|
Stata Syntax - 1.3 KB -
MD5: c35d25df0390b7c9d0be1b1bd687b9cd
STATA do file |
Jun 16, 2014 - Statistical Natural Language Processing Group
Sokolov, Artem; Jehl Laura; Hieber Felix; Ruppert, Eugen; Riezler, Stefan, 2014, "BoostCLIR: JP-EN Relevance Marked Patent Corpus", https://doi.org/10.11588/data/10001, heiDATA, V1
BoostCLIR is a bilingual (Japanese-English) corpus of patent abstracts, extracted from the MAREC patent data, and the data from the NTCIR PatentMT workshop collections, accompanied with relevance judgements for the task of patent prior-art search. Important: The English side of t... |
Jun 16, 2014 -
BoostCLIR: JP-EN Relevance Marked Patent Corpus
Gzip Archive - 241.8 MB -
MD5: 35fde8d24e6e80bf932490549c991a3f
data set |
Jun 16, 2014 -
BoostCLIR: JP-EN Relevance Marked Patent Corpus
Plain Text - 1.5 KB -
MD5: 544fa4db045f692d07a7d4596da99741
README |
Jun 16, 2014 - Statistical Natural Language Processing Group
Wäschle, Katharina; Riezler, Stefan, 2014, "PatTR: Patent Translation Resource", https://doi.org/10.11588/data/10002, heiDATA, V3
PatTR is a sentence-parallel corpus extracted from the MAREC patent collection. The current version contains more than 22 million German-English and 18 million French-English parallel sentences collected from all patent text sections as well as 5 million German-French sentence pa... |
Jun 5, 2014 -
PatTR: Patent Translation Resource
Gzip Archive - 234.3 MB -
MD5: 3bd140f68ab0eefe239e3e893012c991
data set de-en, Part 1/3 (License information: see part 1) |
Jun 5, 2014 -
PatTR: Patent Translation Resource
Gzip Archive - 1.3 GB -
MD5: 2d1336fe8eecd100c01488f5e3e9bc97
data set de-en, Part 2/3 |
Jun 5, 2014 -
PatTR: Patent Translation Resource
Gzip Archive - 1.3 GB -
MD5: b838211b8ddc04001d79f7e1e2e066cb
data set de-en, Part 2/3 (License information: see part 1) |
Jun 5, 2014 -
PatTR: Patent Translation Resource
Gzip Archive - 669.7 MB -
MD5: bf9d77a06ebd10d50648c2c8d300c5e2
data set en-fr, Part 1/3 |