Computational Linguistics & Phonetics Computational Linguistics & Phonetics Fachrichtung 4.7 Universität des Saarlandes

Computational Linguistics Colloquium

Thursday, 17 January 2013, 16:15
Conference room, Building C7.4

Reconstructing and Exploiting Semantics of Mathematical Documents in Science, Techology, and Engineering

Michael Kohlhase
KWARC Group, School of Engineering and Science
Jacobs University Bremen

This talk addresses an application field that has largely ignored by computational linguistics: Mathematical Documents in Science, Technology, and Engineering (STEMDoc).

Formalization structure of the knowledge underlying such documents have been studied in Artificial Intelligence for the use in knowledge-based systems, but the whole-sale and largely automated semantization is still an unsolved problem. For instance, the knowledge base underlying Wolfram Alpha is the product of a human formalization effort of ca. 500 person-years so far.

In an effort to scale mathematical knowledge management (MKM) technologies the KWARC group at Jacobs University has started work on extracting semantic representations from various large STEMDoc corpora and built applications ranging from semantic publishing to formula search engines on the results.

This talk will discuss STEMDoc peculiarities, representation formats, our corpora, and the MKM applications.

If you would like to meet with the speaker, please contact Manfred Pinkal.