DiaBruck 2003 Tutorial

DiaBruck 2003 Tutorial:
Best Practice in Empirically-based Dialogue Research

David Traum, Institute for Creative Technology, University of Southern California, Marina del Rey, California, USA
Laurent Romary, LORIA, Nancy, France
Michael Strube, EML Research gGmbH, Heidelberg, Germany

Introduction (81KB)
- Why do we (semanticists, dialogue researchers, dialogue system developers) need empirical data?
- Why do we need to collect data (instead of making them up)?
- Why should we use proper methods for collecting and annotating data?
- The museum of annotation (1.75 MB): An illustrated history of annotation in the past
Corpus development and use life-cycle (51KB)
Corpus analysis and annotation
- Research question and annotation scheme (136KB)
- Reliability and the Kappa statistic (52 KB)
Representation, data format, standards (661KB)
- Stand-off annotation
- Multi-level annotation
- XML
- ISO standardization
Annotation tool (MMAX) (180 KB)
What are the annotated data good for? (31 KB)
- Data generation
- Machine learning
- Evaluation
Discussion (36KB)
Literature (15 KB)

DiaBruck 2003 Tutorial: Best Practice in Empirically-based Dialogue Research

DiaBruck 2003 Tutorial:
Best Practice in Empirically-based Dialogue Research