no code implementations • LREC 2012 • Ond{\v{r}}ej Bojar, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}, Ond{\v{r}}ej Du{\v{s}}ek, Petra Galu{\v{s}}{\v{c}}{\'a}kov{\'a}, Martin Majli{\v{s}}, David Mare{\v{c}}ek, Ji{\v{r}}{\'\i} Mar{\v{s}}{\'\i}k, Michal Nov{\'a}k, Martin Popel, Ale{\v{s}} Tamchyna
CzEng 1. 0 is automatically aligned at the level of sentences as well as words.
no code implementations • LREC 2012 • Martin Majli{\v{s}}, Zden{\v{e}}k {\v{Z}}abokrtsk{\'y}
The W2C Web Corpus contains more than 100{\textasciitilde}MB of text available for 75 languages.