Search Results for author: Thomas Proisl

Found 16 papers, 3 papers with code

The_Illiterati: Part-of-Speech Tagging for Magahi and Bhojpuri without even knowing the alphabet

no code implementations • NSURL 2019 • Thomas Proisl, Peter Uhrig, Andreas Blombach, Natalie Dykes, Philipp Heinrich, Besim Kabashi, Sefora Mammarella

Part-Of-Speech Tagging

Paper
Add Code

A Corpus of German Reddit Exchanges (GeRedE)

no code implementations • LREC 2020 • Andreas Blombach, Natalie Dykes, Philipp Heinrich, Besim Kabashi, Thomas Proisl

GeRedE is a 270 million token German CMC corpus containing approximately 380, 000 submissions and 6, 800, 000 comments posted on Reddit between 2010 and 2018.

Paper
Add Code

EmpiriST Corpus 2.0: Adding Manual Normalization, Lemmatization and Semantic Tagging to a German Web and CMC Corpus

no code implementations • LREC 2020 • Thomas Proisl, Natalie Dykes, Philipp Heinrich, Besim Kabashi, Andreas Blombach, Stefan Evert

The EmpiriST corpus (Bei{\ss}wenger et al., 2016) is a manually tokenized and part-of-speech tagged corpus of approximately 23, 000 tokens of German Web and CMC (computer-mediated communication) data.

Lemmatization

Paper
Add Code

EmotiKLUE at IEST 2018: Topic-Informed Classification of Implicit Emotions

1 code implementation • WS 2018 • Thomas Proisl, Philipp Heinrich, Besim Kabashi, Stefan Evert

EmotiKLUE is a submission to the Implicit Emotion Shared Task.

Classification General Classification +1

Paper
Code

Albanian Part-of-Speech Tagging: Gold Standard and Evaluation

no code implementations • LREC 2018 • Besim Kabashi, Thomas Proisl

Morphological Analysis Part-Of-Speech Tagging

Paper
Add Code

SoMeWeTa: A Part-of-Speech Tagger for German Social Media and Web Texts

1 code implementation • LREC 2018 • Thomas Proisl

Domain Adaptation Lemmatization +3

Paper
Code

Delta vs. N-Gram Tracing: Evaluating the Robustness of Authorship Attribution Methods

no code implementations • LREC 2018 • Thomas Proisl, Stefan Evert, Fotis Jannidis, Christof Sch{\"o}ch, Leonard Konle, Steffen Pielstr{\"o}m

Authorship Attribution Optical Character Recognition (OCR)

Paper
Add Code

SoMaJo: State-of-the-art tokenization for German web and social media texts

1 code implementation • WS 2016 • Thomas Proisl, Peter Uhrig

Lemmatization

134

Paper
Code

A Proposal for a Part-of-Speech Tagset for the Albanian Language

no code implementations • LREC 2016 • Besim Kabashi, Thomas Proisl

Part-of-speech tagging is a basic step in Natural Language Processing that is often essential.

Part-Of-Speech Tagging

Paper
Add Code

SemantiKLUE: Semantic Textual Similarity with Maximum Weight Matching

no code implementations • SEMEVAL 2015 • Nataliia Plotnikova, Gabriella Lapesa, Thomas Proisl, Stefan Evert

Semantic Textual Similarity Word Alignment

Paper
Add Code

Towards a better understanding of Burrows's Delta in literary authorship attribution

no code implementations • WS 2015 • Stefan Evert, Thomas Proisl, Thorsten Vitt, Christof Sch{\"o}ch, Fotis Jannidis, Steffen Pielstr{\"o}m

Authorship Attribution Text Clustering

Paper
Add Code

SentiKLUE: Updating a Polarity Classifier in 48 Hours

no code implementations • SEMEVAL 2014 • Stefan Evert, Thomas Proisl, Paul Greiner, Besim Kabashi

Sentiment Analysis

Paper
Add Code

SemantiKLUE: Robust Semantic Similarity at Multiple Levels Using Maximum Weight Matching

no code implementations • SEMEVAL 2014 • Thomas Proisl, Stefan Evert, Paul Greiner, Besim Kabashi

Question Answering Semantic Similarity +2

Paper
Add Code

KLUE-CORE: A regression model of semantic textual similarity

no code implementations • SEMEVAL 2013 • Paul Greiner, Thomas Proisl, Stefan Evert, Besim Kabashi

Lemmatization Question Answering +2

Paper
Add Code

KLUE: Simple and robust methods for polarity classification

no code implementations • SEMEVAL 2013 • Thomas Proisl, Paul Greiner, Stefan Evert, Besim Kabashi

Classification General Classification +1

Paper
Add Code

Efficient Dependency Graph Matching with the IMS Open Corpus Workbench

no code implementations • LREC 2012 • Thomas Proisl, Peter Uhrig

State-of-the-art dependency representations such as the Stanford Typed Dependencies may represent the grammatical relations in a sentence as directed, possibly cyclic graphs.

Dependency Parsing Graph Matching +2

Paper
Add Code

Cannot find the paper you are looking for? You can Submit a new open access paper.