Data from multiple sources are characterized by multiple types of heterogeneity. The following hierarchy is often used:345
Ontologies, as formal models of representation with explicitly defined concepts and named relationships linking them, are used to address the issue of semantic heterogeneity in data sources. In domains like bioinformatics and biomedicine, the rapid development, adoption and public availability of ontologies [1] has made it possible for the data integration community to leverage them for semantic integration of data and information.
Ontologies enable the unambiguous identification of entities in heterogeneous information systems and assertion of applicable named relationships that connect these entities together. Specifically, ontologies play the following roles:
There are three main architectures that are implemented in ontology‑based data integration applications,11 namely,
H. Wache; T. Vögele; U. Visser; H. Stuckenschmidt; G. Schuster; H. Neumann; S. Hübner (2001). Ontology-Based Integration of Information A Survey of Existing Approaches. CiteSeerX 10.1.1.142.4390. /wiki/CiteSeerX_(identifier) ↩
Maurizio Lenzerini (2002). Data Integration: A Theoretical Perspective (PDF). pp. 243–246. http://www.dis.uniroma1.it/~lenzerin/homepagine/talks/TutorialPODS02.pdf ↩
A.P. Sheth (1999). "Changing Focus on Interoperability in Information Systems: From System, Syntax, Structure to Semantics". Interoperating Geographic Information Systems. M. F. Goodchild, M. J. Egenhofer, R. Fegeas, and C. A. Kottman (eds.), Kluwer Academic Publishers (PDF). pp. 5–30. http://lsdis.cs.uga.edu/library/download/S98-changing.pdf ↩
AHM02 Tutorial 5: Data Integration and Mediation; Contributors: B. Ludaescher, I. Altintas, A. Gupta, M. Martone, R. Marciano, X. Qian http://daks.ucdavis.edu/~ludaesch/Paper/AHM02/tutorial5.html ↩
"AHM02 Tutorial 5: Data Integration and Mediation". users.sdsc.edu. Retrieved 2017-11-23. http://users.sdsc.edu/~ludaesch/Paper/AHM02/tutorial5.html ↩
Y. Arens; C. Hsu; C.A. Knoblock (1996). Query Processing in sims information mediator (PDF). http://www.isi.edu/integration/papers/arens98-agents.pdf ↩
"Semantic Knowledge Source Integration | Cycorp". www.cyc.com. Archived from the original on 2014-05-17. https://web.archive.org/web/20140517151759/http://www.cyc.com/content/semantic-knowledge-source-integration ↩
"Harnessing Cyc to Answer Clinical Researchers' Ad Hoc Queries | Lenat | AI Magazine". Archived from the original on 2010-12-31. Retrieved 2014-05-15. https://web.archive.org/web/20101231150233/http://www.aaai.org/ojs/index.php/aimagazine/article/viewArticle/2299 ↩
"Home". gellish.net. https://www.gellish.net/ ↩
E. Mena; V. Kashyap; A. Sheth; A. Illarramendi (1996). OBSERVER: An Approach for Query Processing in Global Information Systems based on Interoperation across Pre-existing Ontologies (PDF). http://dit.unitn.it/~p2p/RelatedWork/Matching/MKSI96.pdf ↩
Cheng Hian Goh (1997). Representing and Reasoning about Semantic Conflicts in Heterogeneous Information Systems (PDF). http://context2.mit.edu/coin/publications/goh-thesis/goh-thesis.pdf ↩