Language Processing for Different Domains and Genres (WS 2009/10)
What? | Projektseminar: Computational Linguistics (BSc and MSc) |
Who? | Caroline
Sporleder (csporled AT coli) Ines Rehbein (rehbein AT coli) Tutor: Linlin Li (linlin AT coli) |
When? | Thursday 10:00-12:00 |
Where? | Geb. C7 2, Konferenzraum 2.11 |
Course Information
Most work in natural language processing focusses on the news domain. NLP tools such as part-of-speech taggers or syntactic parsers are typically trained and tested on news wire texts, such as the Wall Street Journal. However, most tools are sensitive to domain changes, i.e. their performance degrades (often significantly) when applied to a domain or genre that is different from that of the training data. The manual annotation of large training corpora for the new target domain is typically not a viable solution. But recently there has been an increased interest in developing alternative techniques for cross-domain portability. In the seminar, we want to look at these techniques.
Course Objectives
- learn about linguistic properties of different domains
- learn about different state-of-the-art domain adaptation methods and experiment with some of them
- gain hands-on experience with NLP tools
- Projektseminar (MSc/BSc): class presentation and practical work including a short report (additional oral exam can be arranged)
- als Projektseminar im B.Sc.: Regelstudienzeit 5/6. Semester
- as project seminar in M.Sc. Programm
- als Projektseminar/project seminar(MSc/BSc) 5 CP