Information extraction (IE) refers to the task of turning text documents into a structured form, in order to make the information contained therein automatically processable. Ontology Mediated Information Extraction (OMIE) is a new paradigm for IE that seeks to exploit the semantic knowledge expressed in ontologies to improve query answering over unstructured data (properly raw text). In this paper we present Mastro System-T, an OMIE tool born from a joint collaboration between the University of Rome “La Sapienza” and IBM Research Almaden and its first application in a financial domain, namely to facilitate the access to and the sharing of data extracted from the EDGAR system.
Dettaglio pubblicazione
2020, Proceedings of the Sixth International Workshop on Data Science for Macro-Modeling, Pages -
Ontology Mediated Information Extraction in Financial Domain with Mastro System-T (04b Atto di convegno in volume)
Lembo Domenico, Li Yunyao, Popa Lucian, Scafoglieri Federico
keywords