Abstract
Integrating information from diverse sources is of great importance in the database area. The main difficulty in information integration is reconciling data semantics. Common approaches to semantic reconciliation are based on first identifying similar entity types in various sources, and then reconciling entity type properties (attributes and relationships). Such approaches assume all instances to be reconciled belong to well-defined types. We suggest an alternative approach based on two fundamental principles. First, reconciliation does not require that instances be assigned to specific types. Instead, sources can be reconciled by analyzing similarities of properties. Second, properties that appear different may be manifestations of a higher-level property that has the same meaning across sources. We present the fundamental ideas underlying our approach, analyze its potential advantages, suggest how the approach can be formalized, demonstrate with examples the feasibility of using it for semantic reconciliation, and suggest directions for further research.
Access this chapter
Tax calculation will be finalised at checkout
Purchases are for personal use only
Preview
Unable to display preview. Download preview PDF.
Similar content being viewed by others
References
Batini, C, M. Lenzerini, and S.B. Navathe, “A Comparative Analysis of Methodologies for Database Schema Integration,” ACM Computing Surveys, 18(4), 1986, 323–364.
Bunge, M., Treatise on Basic Philosophy (Volume 3), Ontology I: The Furniture of the World, Boston: Reidel, 1977.
Bunge, M., Treatise on Basic Philosophy (Volume 4), Ontology II: A World of Systems, Boston: Reidel, 1979.
Castano, S., V. De Antonellis, M. Fugini, and B. Pernici (1999), “Conceptual Schema Analysis: Techniques and Applications,” ACM Transactions on Database Systems, 23(3), 286–333.
Castano, S., V. De Antonellis, and S. De Capitani di Vimercati (2001), “Global Viewing of Heterogeneous Data Sources,” IEEE Transactions on Knowledge and Data Engineering, 13(2), 277–297.
Clarke, C, G. Cormack, and F. Burkowski (1995), “Schema-Independent Retrieval from Heterogeneous Structured Text,” in Fourth Annual Symposium on Document Analysis and Information Retrieval, 279–289.
Clifton, C, E. Housman, and A. Rosenthal (1997), “Experience with a Combined Approach to Attribute-Matching Across Heterogeneous Databases,” Data Mining and Reverse Engineering. Proceedings ofDS-7, IFIP 1997.
Cohen, W. (1998), “Integration of Heterogeneous Databases Without Common Domains Using Queries Based on Textual Similarity,” Proceedings of the Conference on Information and Knowledge Management (CIKM’98), Seattle, WA, 201–212.
Goble, C, S. Bechhofer, L. Carr, D. De Roure, and W. Hall (2001), “Conceptual Open Hypermedia = The Semantic Web?” Proceedings of the 2001 Semantic Web Workshop, Hong Kong, China, 7 pages.
Hulgeri, A., G. Bhalotia, C. Nakhe, and S. Chakrabarti (2001), “Keyword Search in Databases,” IEEE Data Engineering Bulletin, 24(3), 22–32.
Lee, J.-O. and D.-K. Baik (1999), “SemQL: A Semantic Query Language for Multidatabase Systems,” Proceedings of the Conference on Information and Knowledge Management (CIKM’99), Kansas City, MO, 259–266.
Litwin, W., L. Mark, and N. Roussopoulos (1990), “Interoperability of Multiple Autonomous Databases,” ACM Computing Surveys, 22(3), 267–293.
Maedche, A. and S. Staab (2001), “Learning Ontologies for the Semantic Web,” Proceedings of the 2001 Semantic Web Workshop, Hong Kong, China, 10 pages.
Miller, R., “Using Schematically Heterogeneous Structures,” Proceedings SIGMOD’98, Seattle, WA, 189–200.
Naiman, C. and A. Ouksel, “A Classification of Semantic Conflicts in Heterogeneous Information Systems,” Journal of Organizational Computing, 5(2), 1995, 167–193.
Paepcke, A., C.-C. Chang, H. Garcia-Molina, and T. Winograd (1998), “Interoperability for Digital Libraries Worldwide,” Communications of the ACM, 41(4), 33–43.
Parent, C, S. Spaccapietra (1998), “Database Integration: an Overview of Issues and Approaches,” Communications of the ACM, vol. 41, no 5, pp. 166–178, May 1998.
Parsons, J. and Y. Wand, (2000), “Emancipating Instances from the Tyranny of Classes in Information Modeling,” ACM Transactions on Database Systems, 25(2), 228–268.
Qian, X. (1993), “Semantic Interoperation Via Intelligent Mediation,” Proceedings of the 1993 International Workshop on Research Issues in Data Engineering, 228–231.
Reddy, M.P., B.E. Prasad, P.G. Reddy, and A. Gupta, “A Methodology for Integration of Heterogeneous Databases,” IEEE Transactions on Knowledge and Data Engineering, 6(6), 1994, 920–933.
Sciore, E., M. Siegel, and A. Rosenthal (1994), “Using Semantic Values to Facilitate Interoperability Among Heterogeneous Information Systems,” ACM Transactions on Database Systems, 19(2), 254–290.
Smith, K. and L. Obrst, “Unpacking the Semantics of Source and Usage to Perform Semantic Reconciliation in Large-Scale Information Systems,” ACM SIGMOD Record, 28(1), 1999, 26–31.
Toivonen, S. (2001), “Using RDF(S) to Provide Multiple Views into a Single Ontology,” Proceedings of the 2001 Semantic Web Workshop, Hongkong, China, 6 pages.
Wand, Y., “A Proposal for a Formal Model of Objects,” in W. Kim (ed.) and F. Lochovsky (eds.), Object-oriented Concepts, Databases and Applications, Reading, MA: Addison-Wesley, 1989, 537–559.
Wand, Y., V. Storey, and R. Weber, “An Ontological Analysis of the relationship Construct in Conceptual Modelling”, ACM Transactions on Database Systems, Vol. 24, No. 4, December 1999, pp. 494–528.
Wand, Y. and Weber, R., “An Ontological Model of an Information System,” IEEE Transactions on Software Engineering, 16(11), November 1990, 1282–1292.
Wand, Y. and Weber, R., “On the Ontological Expressiveness of Information Systems Analysis and Design Grammars,” Journal of Information Systems, 1993, 217–237.
Wand, Y. and Weber, R., “Towards a Theory of Deep Structure of Information Systems,” Journal of Information Systems, 1995, 203–223.
Author information
Authors and Affiliations
Editor information
Editors and Affiliations
Rights and permissions
Copyright information
© 2002 Springer-Verlag Berlin Heidelberg
About this paper
Cite this paper
Parsons, J., Wand, Y. (2002). Property-Based Semantic Reconciliation of Heterogeneous Information Sources. In: Spaccapietra, S., March, S.T., Kambayashi, Y. (eds) Conceptual Modeling — ER 2002. ER 2002. Lecture Notes in Computer Science, vol 2503. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45816-6_34
Download citation
DOI: https://doi.org/10.1007/3-540-45816-6_34
Published:
Publisher Name: Springer, Berlin, Heidelberg
Print ISBN: 978-3-540-44277-6
Online ISBN: 978-3-540-45816-6
eBook Packages: Springer Book Archive