KRENN, BRANTS, SKUT AND USZKOREIT

Language and Computation
RECENT ADVANCES IN CORPUS ANNOTATION

Workshop

BRIGITTE KRENN, THORSTEN BRANTS, WOJCIECH SKUT and HANS USZKOREIT

Computational Linguistics, University of the Saarland

First week
krenn@coli.uni-sb.de

Course description

The use of large annotated (linguistically interpreted) corpora has become a central component of a large number of projects in the field of computational linguistics. But except for English the corpus resources available are rather small. This workshop is intended to bring together those researchers that create reusable annotation schemes and corpora for different types of information (morphology/syntax/semantics/dialogue, competence/performance) and for different languages.

The focus of the workshop will be both on the methodology of corpus annotation, and on current work concerning annotation tools and the interaction between human annotators and automatic parsers.

Authors of accepted papers:

Maria Wolters: "Linguistic Annotation of Two Prosodic Databases"
Judith Klein, Sabine Lehmann, Klaus Netter, Tillmann Wegst: "Construction and Annotation of Test-Items in DiET
Dan Cristea, Ovidiu Craciun, Cristian Ursu: "A Visual Interactive Tool for Discourse Annotation"
Wojciech Skut, Thorsten Brants, Brigitte Krenn, Hans Uszkoreit: "A Linguistically Interpreted Corpus of German Newspaper Text"
Anna Rahman, Geoffrey Sampson: "Extending Grammar Annotation Standards to Spontaneous Speech"
Norbert Reithinger, Michael Kipp: "Large Scale Dialogue Annotation in VERBMOBIL"
Judith Klein, Thierry Declerck: "Annotating German Language Data for Shallow Processing"
Gerold Schneider, Martin Volk: "Adding Manual Constraints and Lexical Look-up to a Brill-Tagger for German"
Remi Zajac: "Annotation Management for Large-Scale NLP"

Prerequisites
None

Literature
No specific recommendation

HOME PROGRAMME CONTACT REGISTRATION

Language and Computation	RECENT ADVANCES IN CORPUS ANNOTATION
Workshop	BRIGITTE KRENN, THORSTEN BRANTS, WOJCIECH SKUT and HANS USZKOREIT Computational Linguistics, University of the Saarland
First week	krenn@coli.uni-sb.de
Course description	The use of large annotated (linguistically interpreted) corpora has become a central component of a large number of projects in the field of computational linguistics. But except for English the corpus resources available are rather small. This workshop is intended to bring together those researchers that create reusable annotation schemes and corpora for different types of information (morphology/syntax/semantics/dialogue, competence/performance) and for different languages. The focus of the workshop will be both on the methodology of corpus annotation, and on current work concerning annotation tools and the interaction between human annotators and automatic parsers. Authors of accepted papers: Maria Wolters: "Linguistic Annotation of Two Prosodic Databases" Judith Klein, Sabine Lehmann, Klaus Netter, Tillmann Wegst: "Construction and Annotation of Test-Items in DiET Dan Cristea, Ovidiu Craciun, Cristian Ursu: "A Visual Interactive Tool for Discourse Annotation" Wojciech Skut, Thorsten Brants, Brigitte Krenn, Hans Uszkoreit: "A Linguistically Interpreted Corpus of German Newspaper Text" Anna Rahman, Geoffrey Sampson: "Extending Grammar Annotation Standards to Spontaneous Speech" Norbert Reithinger, Michael Kipp: "Large Scale Dialogue Annotation in VERBMOBIL" Judith Klein, Thierry Declerck: "Annotating German Language Data for Shallow Processing" Gerold Schneider, Martin Volk: "Adding Manual Constraints and Lexical Look-up to a Brill-Tagger for German" Remi Zajac: "Annotation Management for Large-Scale NLP"
Prerequisites	None
Literature	No specific recommendation