Computational Linguistics & Phonetics Computational Linguistics & Phonetics Fachrichtung 4.7 Universität des Saarlandes

Recent developments in computational discourse processing (Seminar)


A very useful introductory textbook: M. Stede. Discourse Processing. Morgan & Claypool, 2011.
More background in: Chapter 21, D. Jurafsky and J. Martin. Speech and Language Processing. Pearson, 2009.

Below, you can find a list of topics/session. For each topic, you should present the paper(s) listed for this topic. If there are multiple papers, you are expected to present one (sometimes two) of them in detail (title(s) in red), and give an overview of what's happening in the other(s). In some cases additional references are provided for background information. Ask us if you have any questions!
  1. Introduction to Computational Discourse
  2. Text Tiling - Tristan
  3. Document Compression - Patrick
  4. Penn Discourse Treebank & Parser (PDTB) - Nikos
    • Lin, Ziheng, Hwee Tou Ng, and Min-Yen Kan. A PDTB-styled end-to-end discourse parser. Natural Language Engineering (2012): 1-34. (FOCUS ON THIS PAPER)
    • The PDTB Research Group. The Penn Discourse TreeBank 1.0. Annotation Manual. IRCS Technical Report IRCS-06-01, Institute for Research in Cognitive Science, University of Pennsylvania. March 2006. (no need to read all of this, but some sections may be useful.)
    • Eleni Miltsakaki, Rashmi Prasad, Aravind Joshi and Bonnie Webber.The Penn Discourse TreeBank. In Proceedings of the Language Resources and Evaluation Conference. Lisbon, Portugal. 2004.
  5. Rhetorical Structure Theory & Parser (RST)
    • Mann, W.C., & Thompson, S.A. 1988. Text, 8 (3). 243-281. A useful overview is here.
    • Soricut, Radu, and Daniel Marcu. Sentence Level Discourse Parsing using Syntactic and Lexical Information Proceedings of the 2003 Conference of the North American Chapter of the Association for Computational Linguistics on Human Language Technology-Volume 1. Association for Computational Linguistics, 2003.
  6. Entity grid/discourse role matrix for coherence modeling - Richard
  7. Topic Modeling - Bryan (Please make sure to explain some background about topic models)
  8. Discourse Relations vs. Entity-based Coherence - Zhe
  9. Temporal Relation Processing - Susanne
  10. Situation Entity Classification
  11. Coreference Resolution - Omid
  12. Narrative Chains - Liesa
  13. Discourse in Machine Translation - Kleo
  14. Sentiment analysis - Ilya
  15. Summarization - Madhumita
  16. Essay scoring - Anjana
  17. Discourse relation classification - Clayton
  18. Assessing text quality