no code implementations • LREC (LAW) 2022 • Eva Hajicova, Marie Mikulová, Barbora Štěpánková, Jiří Mírovský
Recently, many corpora have been developed that contain multiple annotations of various linguistic phenomena, from morphological categories of words through the syntactic structure of sentences to discourse and coreference relations in texts.
no code implementations • LREC 2022 • Marie Mikulová, Milan Straka, Jan Štěpánek, Barbora Štěpánková, Jan Hajič
This paper presents an analysis of annotation using an automatic pre-annotation for a mid-level annotation complexity task -- dependency syntax annotation.
no code implementations • 5 Jun 2020 • Jan Hajič, Eduard Bejček, Jaroslava Hlaváčová, Marie Mikulová, Milan Straka, Jan Štěpánek, Barbora Štěpánková
We present a richly annotated and genre-diversified language resource, the Prague Dependency Treebank-Consolidated 1. 0 (PDT-C 1. 0), the purpose of which is - as it always been the case for the family of the Prague Dependency Treebanks - to serve both as a training data for various types of NLP tasks as well as for linguistically-oriented research.