Computational Linguistics & Phonetics Computational Linguistics & Phonetics Fachrichtung 4.7 Universität des Saarlandes
Suggested Papers for Seminar Below are suggested research papers on multi-word expressions for your presentation or term paper (slides). If you have additional suggestions for papers to work with please contact me.

Tim Baldwin

  • Timothy Baldwin Bahar Salehi, Paul Cook. Using distributional similarity of multi-way translations to predict multiword expression compositionality. In Proceedings of the 14th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2014), pages 472-481, Gothenburg, Sweden, 2014.
  • Timothy Baldwin Bahar Salehi, Paul Cook. A word embedding approach to predicting the compositionality of multiword expressions. In Proceedings of the 2015 Conference of the North American Chapter of the Association for Computational Linguistics - Human Language Technologies (NAACL HLT 2015), pages 977-983, Denver, USA, 2015.
  • Francis Bond Ann Copestake Dan Flickinger Ivan Sag, Timothy Baldwin. Multiword expressions: A pain in the neck for nlp. In Proceedings of the Third International Conference on Intelligent Text Processing and Computational Linguistics (CICLING 2002), pages 1-5, Mexico City, Mexico, 2002.
  • Takaaki Tanaka Dominic Widdows Timothy Baldwin, Colin Bannard. An empirical model of multiword expression decomposability. In Proceedings of the ACL-2003 Workshop on Multiword Expressions: Analysis, Acquisition and Treatment, pages 89-96, Sapporo, Japan, 2003.

Afsaneh Fazly

  • Ryan North Afsaneh Fazly, Suzanne Stevenson. Automatically learning semantic knowledge about multiword predicates. Journal of Language Resources and Evaluation, 41(1), 2007a.
  • Suzanne Stevenson Afsaneh Fazly. Distinguishing subtypes of multiword expressions using linguistically-motivated statistical measures. In Proceedings of the ACL.07 Workshop on A Broader Perspective on Multiword Expressions, Prague, Czech Republic, 2007b.
  • Suzanne Stevenson Afsaneh Fazly. A distributional account of the semantics of multiword expressions. special issue of the Italian Journal of Linguistics, 20(1): 157-179, 2008.
  • Suzanne Stevenson Afsaneh Fazly, Aida Nematzadeh. Acquiring multiword verbs: The role of statistical evidence. In Proceedings of the 31st Annual Conference of the Cognitive Science Society, Amsterdam, Netherlands, 2009a. Suzanne Stevenson Afsaneh Fazly, Paul Cook. Unsupervised type and token identification of idiomatic expressions. Computational Linguistics, 35(1):61-103, 2009b.
  • Suzanne Stevenson Afsaneh Fazly, Ryab North. Automatically determining allowable combinations of a class of flexible multiword expressions. In Proceedings of the International Conference on Computational Linguistics and Intelligent Text Processing (CICLing 2006), pages 81-92, Mexico City, Mexico, 2006.
  • Ryan North Suzanne Stevenson, Afsaneh Fazly. Statistical measures of the semi-productivity of light verb constructions. In Proceedings of the ACL 2004 Workshop on Multiword Expressions: Integrating Processing, Barcelona, Spain, 2004.

Other

  • Marine Carpuat and Mona Diab. Task-based evaluation of multiword expressions: A pilot study in statistical machine translation. In Human Language Technologies: The 2010 Annual Conference of the North American Chapter of the Association for Computational Linguistics, HLT .10, pages 242-245, Stroudsburg, PA, USA, 2010.
  • Ricarda Dormeyer and Ingrid Fischer. Building Lexicons out of a Database for Idioms. In Antonio Rubio, Nativiad Gallardo, Rosa Castro, and Antonio Tejada, editors, Proceedings of the First International Conference on Language Resources and Evaluation, pages 833-838, 1998.
  • Anna Feldman and Jing Peng. Automatic detection of idiomatic clauses. In Computational Linguistics and Intelligent Text Processing: 14th International Conference (CICLing), pages 435-446, Samos, Greece, 2013.
  • Graham Katz. Automatic identification of non-compositional multi-word expressions using latent semantic analysis. In In Proceedings of the ACL/COLING-06 Workshop on Multiword Expressions: Identifying and Exploiting Underlying Properties, pages 12-19, Sydney, Australia, 2006.
  • Linlin Li and Caroline Sporleder. Linguistic cues for distinguishing literal and non-literal usages. In Proceedings of the 2010 Conference for the Association for Computational Linguistics, 2010.
  • Grace Muzny and Luke S. Zettlemoyer. Automatic idiom identification in wiktionary. In Proceedings of the 2013 Conference on Empirical Methods for Natural Language Processing (EMNLP), pages 1417-1421, Seattle, Washington, 2013. Jing Peng, Anna Feldman, and Ekaterina Vylomova. Classifying idiomatic and literal expressions using topic models and intensity of emotions. In Proceedings of the 2014 Conference on Empirical Methods in Natural Language Processing (EMNLP), pages 2019-2027, Doha, Qatar, October 2014.
  • Maria Pershina, Yifan He, and Ralph Grishman. Idiom paraphrases: Seventh heaven vs cloud nine. In Proceedings of the First Workshop on Linking Computational Models of Lexical, Sentential and Discourse-level Semantics, pages 76-82, Lisbon, Portugal, September 2015.
  • Caroline Sporleder and Linlin Li. Unsupervised recognition of literal and non-literal use of idiomatic expressions. In Proceedings of the 12th Conference of the European Chapter of the ACL (EACL 2009), pages 754-762, Athens, Greece, March 2009.

Also

check out the papers that were published in the Multi-word expressions workshops over the years, e.g.