next up previous contents index
Next: Subcorpus creation Up: Corpus Management Previous: Corpus Management   Contents   Index


Corpus creation

The tool expects TIGER XML [5] as input format of the corpora to be annotated. The TIGERRegistry adminstration tool which is available online can be used to import various corpus formats.[*]

Each resulting TIGER-XML corpus is stored in a ``corpus directory''. In the SALSA tool the ``master'' directory containing such ``corpus directories'' has to be specified via the menue Edit$\Rightarrow$Setup. Note that you must not choose the TIGER-XML corpus directory itself here but the 'master' directory containing it (see Figure 4). Otherwise the corpus is not found by the tool.



Subsections

Aljoscha Burchardt 2007-09-04