SALSA Corpus Release

A new German corpus with role semantic annotation in the Berkeley FrameNet paradigm is available here. The corpus is provided by Manfred Pinkal's Saarbrücken SALSA project, funded by German Science Foundation (DFG).It is based on TIGER (release 1), a syntactically annotated German newspaper corpus.

More information on TIGER and FrameNet can be found here:

http://www.ims.uni-stuttgart.de/projekte/TIGER/
http://framenet.icsi.berkeley.edu

SALSA uses the frames of FrameNet release 1.2 for the German annotation, wherever available and appropriate. In addition, SALSA has developed a number of "proto-frames", i.e., predicate-specific frames, to provide coverage for predicate instances currently not covered by FrameNet. The total size of the annotation is currently roughly 20.000 verbal instances. More information on SALSA can be found on the website

http://www.coli.uni-saarland.de/projects/salsa/

The annotation scheme is described in

A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Padó and M. Pinkal
The SALSA Corpus: a German Corpus Resource for Lexical Semantics.
Proceedings of LREC 2006, Genoa, Italy.

The license conditions are found here.

For a download request go here.