Computational Linguistics & Phonetics Co
mputational Linguistics & Phonetics Fachrichtung 4.7Universit344t des Saarlan
des

Unlocking the Secrets of the Past: Text Mining for Historical Documents (WS 2010/11)



Detailed Schedule



Week One
Monday 21.02.
9:30-11:00 Organisational Stuff, Motivation/Case Study (slides), Introduction to NLP (slides)
11:00-11:15 Coffee Break
11:15-12:45 Introduction to Historic Research: What is History?, Historical Sources, Archives (Martin Schreiber, Michael Röhrig) (slides)
12:45-14:00 Lunch Break
14:00-15:30 Digitisation and Preprocessing
  • Digitisation, Preservation of Digital Data, Dealing with OCR Errors (slides)
Dealing with Non-Standard Language
  • Dealing with non-standard orthography (Christian Meyer) (slides)
15:30-15:45 Coffee Break
15:45-18:00 Group Formation, Group Work

Tuesday 22.02
10:00-10:45 Standardisation, Meta-Data, Semantic Web (slides)
10:45-11:00 Coffee Break
11:00-12:30 Semantic Web, Meta-Data
  • Ontologies (Cosmina Croitoru) (slides)
  • Inferring Metadata (slides) (Patricia Helmich)
12:45-14:15 Lunch Break
14:15-15:45 Information Extraction, Text Mining
  • Event Recognition (Wenbin Lee) (slides)
15:45-18:00 Group Work
Wednesday 23.02
10:00-11:30 Multi-Modal Data
  • Speech (slides) (Daan Henselmans)
Personalisation
  • Personalisation:M-Piro (Angeliki Lazaridou) (slides)
11:30-18:00 Group Work
Thursday 25.02
9:30-18:00 Group Work
Friday 26.02
10:00-18:00 Group Work
Week Two
Monday 28.02.
14:00-16:00 Taking stock: presentations of projects
  • Daan and Wenbin:
    Entity Cube - American Civil War
    slides
  • Angeliki, Christian, Cosmina, Patricia:
    Pseudo-Relevance Feedback and Query Expansion, Europarl
Tuesday 01.03.
9:30-18:00 Guided Tour of City Archive (11:00-12:30), Group Work
Wednesday 02.03.
9:30-18:00 Group Work
Thursday 03.03.
9:30-18:00 Group Work
Friday 04.03.
9:30-12:30 Group Work
Week Three
Monday 07.03.
9:30-18:00 Group Work
Tuesday 08.03.
9:30-18:00 Group Work
Wednesday 09.03.
9:30-18:00 Group Work
Thursday 10.03.
9:30-18:00 Group Work
Friday 11.03.
10:00-13:00 Final Presentation of Group Projects