5th International Workshop on

Linguistically Interpreted Corpora (LINC-04)

A workshop to be held at

the 20th International Conference on Computational Linguistics

Geneva, 29 August 2004

Organized by:

Silvia Hansen-Schirra (Saarland University)
Stephan Oepen (University of Oslo & CSLI Stanford)
Hans Uszkoreit (Saarland University & DFKI Saarbrücken)

Topic and Motivation

Large linguistically interpreted corpora play an increasingly important role for machine learning, evaluation, psycholinguistics as well as theoretical linguistics. Many research groups are engaged in the creation of corpus resources annotated with morphological, syntactic, semantic and discourse information for a variety of languages. Since 1999, the LINC workshop series has brought together these activities in order to identify and disseminate best practice in the development and utilization of linguistically interpreted corpora.

The aim of the workshop is to exchange and propagate research results with resprect to the annotation, conversion and exploitation of corpora taking into account different applications and theoretical investigations in the field of language technology and research. We invite submissions of papers constituting substantial, original,and unpublished work on all aspects of linguistically interpreted corpora, including, but not limited to:

  • creation of practical annotation schemes;
  • efficient annotation techniques;
  • automation of corpus annotation;
  • tools supporting corpus conversions;
  • validation including consistency checking of corpora;
  • browsing corpora and searching for instances of linguistic phenomena;
  • relating actual annotation to contemporary linguistic theory;
  • interpretation of quantitative results; and
  • use of annotated corpora in the automated induction of linguistic knowledge.
Rob Malouf, Professor at at San Diego State University, will give a keynote presentation on utilizing a dependency-annotated corpus in connection to broad-coverage deep grammatical analysis.

Workshop Programme

9:00 – 9:15 Welcome & Introduction
Silvia Hansen-Schirra, Stephan Oepen, and Hans Uszkoreit
9:15 – 9:45 The HOLJ Corpus. Supporting Summarisation of Legal Texts
Claire Grover, Ben Hachey, and Ian Hughson
9:45 – 10:15 Towards User-Adaptive Annotation Guidelines
Stefanie Dipper, Michael Goetze, Stavros Skopeteas
10:15 – 10:45 The TIGER Dependency Bank
Martin Forst, Nuria Bertomeu, Berthold Crysmann, Frederik Fouvry, Silvia Hansen-Schirra, and Valia Kordoni
11:00 – 11:30 Coffee Break
11:30 – 12:00 Word Order Variation in German Main Clauses
Andrea Weber and Karin Mueller
12:00 – 12:30 Inflectional Syncretism and Corpora
Dunstan Brown, Carole Tiberius, and Greville G. Corbett
12:30 – 13:00 Discussion
13:00 – 14:30 Lunch Break
14:30 – 15:30 The Hinoki Treebank. Working Toward Text Understanding
Francis Bond, Sanae Fujita, Chikara Hashimoto, Kaname Kasahara, Shigeko Nariyama,
Eric Nichols, Akira Ohtani, Takaaki Tanaka, and Shigeaki Amano
The Szeged Corpus.
A POS Tagged and Syntactically Annotated Hungarian Natural Language Corpus

Dora Csendes, Janos Csirik, and Tibor Gyimothy
Mining Linguistically Interpreted Texts
Cassiana Fagundes da Silva, Renata Vieira, Fernando Santos Osorio, and Paulo Quaresma
Automated Induction of Sense in Context
James Pustejovsky, Patrick Hanks, and Anna Rumshisky
15:30 – 16:00 Coffee Break
16:00 – 16:30 Corpus-based Induction of an LFG Syntax-Semantics Interface for Frame Semantic Processing
Anette Frank and Jiri Semecky
16:30 – 17:00 Bootstrapping Parallel Treebanks
Martin Volk and Yvonne Samuelsson
17:00 – 17:30 Discussion

Programme Committee

  • Francis Bond, Kyoto
  • Pierrette Bouillon, Geneva
  • Thorsten Brants, Mountain View
  • John Carroll, Sussex
  • Tomaz Erjavec, Ljubljana
  • Dan Flickinger, Stanford
  • Silvia Hansen-Schirra (co-chair), Saarbrücken
  • Frank Keller, Edinburgh
  • Brigitte Krenn, Vienna
  • Joakim Nivre, Vaxjo
  • Stephan Oepen (co-chair), Oslo & Stanford
  • Laurent Romary, Nancy
  • Geoffrey Sampson, Sussex
  • Kiril Simov, Sofia
  • Hans Uszkoreit (co-chair), Saarbrücken
  • Jean Veronis, Aix-en-Provence
  • Atro Voutilainen, Helsinki
  • Jakub Zavrel, Antwerp

Additional Reviewers

  • Stella Neumann, Saarbrücken
  • Andreas Eisele, Saarbrücken
  • Valia Kordoni, Saarbrücken
  • Gregor Erbach, Saarbrücken
  • Nuria Bertomeu, Saarbrücken
  • Frederik Fouvry, Saarbrücken
  • Michaela Mahlberg, Saarbrücken


  • April 11, 2004: Deadline for paper submission;
  • May 7, 2004: Notification of acceptance;
  • June 11, 2004: Camera ready copies due;
  • August 29, 2004: Workshop at Geneva (9:15 – 18:00).


Please refer to the main conference web pages for registration details.


Printed and electronic workshop proceedings are available from the COLING organizers. We will consider subsequent publication as an edited collection if there is sufficient demand.
last modified: 19-aug-04 (oe@csli.stanford.edu)