Computational Linguistics & Phonetics Computational Linguistics & Phonetics Fachrichtung 4.7 Universität des Saarlandes

NLP/Text Mining for Historical Texts

Projektseminar: Computerlinguistik, B.Sc./M.Sc.

Leitung: Sporleder, Schreiber
Ort: wird noch bekannt gegeben
Zeit: Blockseminar (voraussichtl./probably 22.2.-5.3.2010)
Beginn: wird noch bekannt gegeben
Geeignet für: B.Sc./M.Sc.

This is an interdisciplinary course that is open for both, students from Computational Linguistics and History. The aim is to design, implement and test practical NLP and text mining solutions to make historical documents more accessible. Possible topics include: detecting and correcting (OCR) errors , information extraction from historical manuscripts, finding links between documents, converting unstructured documents into searchable databases, knowledge discovery from historical documents.

The course consists of a theoretical and a practical part. In the theoretical part, students give a presentation on topics relevant to the course. In the practical part, small interdisciplinary groups will work on implementing a system that solves a real problem relevant for the documents discussed in the seminar.

Note, if you are interested in joining this course please send an email to csporled@coli. We'll have an introductory meeting on January, 28, 2010, 18:00-19:00 (Geb . C7.2, Konferenzraum 2.11).

Zur Homepage des Kurses

Literatur
Research papers on language processing for cultural heritage data. Details will be given on the course homepage and during the Vorbesprechung/introductory meeting.

Bemerkungen/Voraussetzungen
for M.Sc. and advanced B.Sc. students

Scheine
Presentation plus practical work (incl. brief report)

Stellung im Studienplan
Regelstudienzeit B.Sc. 5. Semester;
standard time slot for M.Sc. students: 3rd semester

Leistungspunkte
5 CP