Abstract
Motivation: Annotated reference corpora such as the GENIA corpus play an important role in biomedical information extraction. A semantic annotation of the natural language texts in these reference corpora using formal ontologies and logic is challenging due to the ambiguous use of natural language and natural language semantics. Providing formal definitions and axioms for these relations would offer the means for developing consistent and verifiable annotation guidelines and allow for the automatic verification of annotations as well as enabling the discovery of new information through deductive inferences. Results: We developed a formal ontology of relations based on the relations used in the recent GENIA corpus annotations. For this purpose, we selected existing axiom systems based on the desired properties of the relations within the domain and provided new axioms for several relations. To apply this ontology of relations to the semantic annotation of natural language texts, we developed and implemented two ontology design patterns. We provide an implementation of the ontology of relations in the Web Ontology Language (OWL). By combining the implementation of the design patterns and that of the relation ontology, we also provide a software application to convert annotated GENIA abstracts into OWL ontologies. In this way, we make these ontologies amenable for automated verification, deductive inferences and other knowledge-based applications.
Original language | English (US) |
---|---|
Pages (from-to) | 27-35 |
Number of pages | 9 |
Journal | CEUR Workshop Proceedings |
Volume | 714 |
State | Published - 2010 |
Externally published | Yes |
Event | 4th International Symposium on Semantic Mining in Biomedicine, SMBM 2010 - Cambridge, United Kingdom Duration: Oct 25 2010 → Oct 26 2010 |
ASJC Scopus subject areas
- General Computer Science