Imprecise Answers in Distributed Environments: Estimation of Information Loss for Multi-Ontology Based Query Processing
The World Wide Web is fast becoming a ubiquitous computing environment. Prevalent keyword-based search techniques are scalable, but are incapable of accessing information based on concepts. We investigate the use of concepts from multiple, real-world pre-existing, domain ontologies to describe the underlying data content and support information access at a higher level of abstraction. It is not practical to have a single domain ontology to describe the vast amounts of data on the Web. In fact, we expect multiple ontologies to be used as different world views and present an approach to "browse" ontologies as a paradigm for information access. A critical challenge in this approach is the vocabulary heterogeneity problem. Queries are rewritten using interontology relationships to obtain translations across ontologies. However, some translations may not be semantics preserving, leading to uncertainty or loss in the information retrieved. We present a novel approach for estimating loss of information based on the navigation of ontological terms. We define measures for loss of information based on intensional information as well as on well established metrics like precision and recall based on extensional information. These measures are used to select results having the desired quality of information.
Digital Object Identifier (DOI)
Published in International Journal of Cooperative Information Systems (IJCIS), Volume 9, Issue 4, 2000, pages 403-426.
© 2000 World Scientific Publishing Co Pte Ltd
Mena, E., Kashyap, V., Illarramendi, A., & Sheth, A. P. (2000). Imprecise Answers in Distributed Environments: Estimation of Information Loss for Multi-Ontology Based Query Processing. International Journal of Cooperative Information Systems (IJCIS), 9 (4), 403-426. https://doi.org/10.1142/S0218843000000193