Ex is an IE system based on extraction ontologies, developed by the Knowledge Engineering Group (KEG) at UEP since 2006. Extraction ontologies aim to extract standalone named entities (standalone attributes) and instances (groups of attributes which "belong together"). The advantage of this technology is that it can utilize multiple sources of extraction knowledge which should lower the requirement for training data. Ex can be used for extraction from heavily structured (e.g. tabular) documents, semi-structured documents and also from free-text documents.

For a domain of interest, the user writes an extraction ontology. An extraction ontology is structurally similar a conventional domain ontology, however it reflects the way information is presented on the web rather than the inherent state of affairs, and is extended with extraction knowledge that can be used to identify the described objects in text. An extraction ontology can be viewed as a set of attribute definitions, class definitions and axiom definitions.

Development of Ex is ongoing. The code is writen in Java.
Ex is distributed under the LGPL license.

