SALSA Corpus Release 2.0

The second version of the German SALSA corpus with role semantic annotation in the Berkeley FrameNet paradigm is available here. The corpus is provided by Manfred Pinkal's Saarbrücken SALSA project, funded by German Science Foundation (DFG).It is based on TIGER (release 2.1), a syntactically annotated German newspaper corpus.

More information on TIGER and FrameNet can be found here:

SALSA uses the frames of FrameNet release 1.2 for the German annotation, wherever available and appropriate. In addition, SALSA has developed a number of "proto-frames", i.e., predicate-specific frames, to provide coverage for predicate instances currently not covered by FrameNet. The total size of the annotation is currently roughly 20.000 verbal instances and more than 17.000 nominal instances. More information on SALSA can be found on the website

The annotation scheme is described in

A. Burchardt, K. Erk, A. Frank, A. Kowalski, S. Padó and M. Pinkal
The SALSA Corpus: a German Corpus Resource for Lexical Semantics.
Proceedings of LREC 2006, Genoa, Italy.

I. Rehbein, J. Ruppenhofer, C. Sporleder and M. Pinkal
Adding nominal spice to SALSA - frame-semantic annotation of German nouns and verbs.
Proceedings of KONVENS 2012, Vienna, Austria.

The license conditions are found here.

For a download request go here.