Jakob Prange, Andrea Horbach and Stefan Thater:UdS-(retrain|distributional|surface): Improving POS Tagging for OOV Words in German CMC and Web Data
System description paper for the EmpiriST Shared Task
WAC-X 2016
Andrea Horbach, Andrea Hensler, Sabine Krome, Jakob Prange, Werner Scholze-Stubenrecht, Diana Steffen, Stefan Thater, Christian Wellner and Manfred Pinkal:
A Corpus of Literal and Idiomatic Uses of German Infinitive-Verb Compounds
LREC 2016
Michael Beißwenger, Eric Ehrhardt, Andrea Horbach, Harald Lüngen, Diana Steffen and Angelika Storrer:
Adding Value to CMC Corpora: CLARINification and Part-of-speech Annotation of the Dortmund Chat Corpus
NLP4CMC 2015 workshop
Jakob Prange, Stefan Thater and Andrea Horbach:
Unsupervised Induction of Part-of-Speech Information for OOV Words in German Internet Forum Posts
NLP4CMC 2015 workshop
Andrea Horbach, Diana Steffen, Stefan Thater and Manfred Pinkal.
Improving the Performance of Standard Part-of-Speech Taggers for Computer-Mediated Communication
Proceedings of Konvens2014
Andrea Horbach, Stefan Thater, Diana Steffen, Peter M. Fischer, Andreas Witt, Manfred Pinkal.
Internet Corpora: A Challenge for Linguistic Processing
Datenbank-Spektrum, Springer 2015.
Bachelor and Master Theses
Jana Ott: POS-Tagging von deutschen Internettexten: Untersuchung von verschiedenen Verfahren zur Performanzsteigerung, Bachelor thesis, November 2014.Jakob Prange: Part-of-Speech-Tagging von Internettexten mithilfe von distributioneller Ähnlichkeitsinformation, Bachelor thesis, August 2015
Christian Wellner: Automatische Klassifikation von Verb-Verb-Gefügen für die Beobachtung des Schreibgebrauchs, Bachelor thesis, December 2015