no code implementations • LREC 2020 • Aleks Berdicevskis, rs, Hanne Eckhoff
We describe the Troms{\o} Old Russian and Old Church Slavonic Treebank (TOROT) that spans from the earliest Old Church Slavonic to modern Russian texts, covering more than a thousand years of continuous language history.
no code implementations • WS 2018 • Aleks Berdicevskis, rs, {\c{C}}a{\u{g}}r{\i} {\c{C}}{\"o}ltekin, Katharina Ehret, Kilu von Prince, Daniel Ross, Bill Thompson, Chunxiao Yan, Vera Demberg, Gary Lupyan, Taraka Rama, Christian Bentz
We evaluate corpus-based measures of linguistic complexity obtained using Universal Dependencies (UD) treebanks.
no code implementations • WS 2016 • Hanne Martine Eckhoff, Aleks Berdi{\v{c}}evskis, rs
Historical treebanks tend to be manually annotated, which is not surprising, since state-of-the-art parsers are not accurate enough to ensure high-quality annotation for historical texts.
no code implementations • WS 2016 • Christian Bentz, Aleks Berdicevskis, rs
The morphological complexity of languages differs widely and changes over time.