Computational Linguistics & Phonetics Co
mputational Linguistics & Phonetics Fachrichtung 4.7Universit344t des Saarlan
des

Language Processing for Different Domains and Genres (WS 2009/10)


What? Projektseminar: Computational Linguistics (BSc and MSc)
Who? Caroline Sporleder   (csporled AT coli)
Ines Rehbein   (rehbein AT coli)
Tutor: Linlin Li   (linlin AT coli)
When? Thursday 10:00-12:00
Where?   Geb. C7 2, Konferenzraum 2.11


Course Information

Most work in natural language processing focusses on the news domain. NLP tools such as part-of-speech taggers or syntactic parsers are typically trained and tested on news wire texts, such as the Wall Street Journal. However, most tools are sensitive to domain changes, i.e. their performance degrades (often significantly) when applied to a domain or genre that is different from that of the training data. The manual annotation of large training corpora for the new target domain is typically not a viable solution. But recently there has been an increased interest in developing alternative techniques for cross-domain portability. In the seminar, we want to look at these techniques.

Course Objectives
  • learn about linguistic properties of different domains
  • learn about different state-of-the-art domain adaptation methods and experiment with some of them
  • gain hands-on experience with NLP tools
Scheine (Coli)
  • Projektseminar (MSc/BSc): class presentation and practical work including a short report (additional oral exam can be arranged)
Stellung im Studienplan (Coli)
  • als Projektseminar im B.Sc.: Regelstudienzeit 5/6. Semester
  • as project seminar in M.Sc. Programm
Leistungspunkte (Coli)
  • als Projektseminar/project seminar(MSc/BSc) 5 CP