iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://doi.org/10.1007/3-540-45816-6_20
Representing and Querying Semistructured Web Data Using Nested Tables with Structural Variants | SpringerLink
Skip to main content

Representing and Querying Semistructured Web Data Using Nested Tables with Structural Variants

  • Conference paper
  • First Online:
Conceptual Modeling — ER 2002 (ER 2002)

Part of the book series: Lecture Notes in Computer Science ((LNCS,volume 2503))

Included in the following conference series:

Abstract

This paper proposes an approach to representing and querying semistructured Web data. The proposed approach is based on nested tables, which may have internal nested structural variations to accommodate semistructured data. Our motivation is to reduce the complexity found in typical query languages for semistructured data and to provide users with an alternative for quickly querying data obtained from multiple-record Web pages. We show the feasibility of our proposal by developing a prototype for a graphical query interface called QSByE (Querying Semistructured data By Example). For QSByE, we define a particular variation of nested tables and propose a set of QBE-like operations that extends typical nested-relational-algebra operations to handle semistructured data. We show examples of how users can pose interesting queries using QSByE.

This work was partially supported by Project SIAM (MCT/CNPq/PRONEX grant number 76.97.1016.00) and by CNPq (grant number 467775/00-1). The first and second authors are supported by scholarships from CAPES. The fourth author is supported by NSF (grant number IIS-0083127).

On leave from the University of Amazonas, Brazil.

This is a preview of subscription content, log in via an institution to check access.

Access this chapter

Subscribe and save

Springer+ Basic
$34.99 /Month
  • Get 10 units per month
  • Download Article/Chapter or eBook
  • 1 Unit = 1 Article or 1 Chapter
  • Cancel anytime
Subscribe now

Buy Now

Chapter
USD 29.95
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
eBook
USD 39.99
Price excludes VAT (USA)
  • Available as PDF
  • Read on any device
  • Instant download
  • Own it forever
Softcover Book
USD 54.99
Price excludes VAT (USA)
  • Compact, lightweight edition
  • Dispatched in 3 to 5 business days
  • Free shipping worldwide - see info

Tax calculation will be finalised at checkout

Purchases are for personal use only

Institutional subscriptions

Preview

Unable to display preview. Download preview PDF.

Unable to display preview. Download preview PDF.

Similar content being viewed by others

References

  1. Bonifati, A., AND Ceri, S. Comparative analysis of five XML query languages. SIGMOD Record 29, 1 (2001), 68–79.

    Google Scholar 

  2. Buneman, P., Davidson, S. B., Hillebrand, G. G., AND Suciu, D. A Query Language and Optimization Techniques for Unstructured Data. In Proceedings of the 1996 ACM SIGMOD International Conference on Management of Data (Quebec, Canada, 1996), pp. 505–516.

    Google Scholar 

  3. Buneman, P., Deutsch, A., AND Tan, W. A Deterministic Model for Semistructured Data. In Proceedings of the Workshop on Query Processing for Semistructured Data and Non-Standard Data Formats (Jerusalem, Israel, 1999).

    Google Scholar 

  4. Colby, L. S. A Recursive Algebra and Query Optimization for Nested Relations. In Proceedings of the 1989 ACM SIGMOD International Conference on Management of Data (Portland, Oregon, 1989), pp. 273–283.

    Google Scholar 

  5. Deutsch, A., Fernandez, M. F., AND Suciu, D. Storing Semistructured Data with STORED. In Proceedings the 1999 ACM SIGMOD International Conference on Management of Data (Philadephia, Pennsylvania, 1999), pp. 431–442.

    Google Scholar 

  6. Embley, D., Campbell, D., Jiang, Y., Liddle, S., Lonsdale, D., Ng, Y.-K., AND Smith, R. Conceptual-model-based data extraction from multiple-record Web pages. Data & Knowledge Engineering 31, 3 (1999), 227–251.

    Article  Google Scholar 

  7. Evangelista-Filha, I. M. R., Laender, A. H. F., AND Silva, A. S. Querying Semistructured Data By Example: The QSByE Interface. In Proceedings of the International Workshop on Information Integration on the Web (Rio de Janeiro, Brazil, 2001), pp. 156–163.

    Google Scholar 

  8. Florescu, D., Levy, A., AND Mendelzon, A. Database Techniques for the World-Wide Web: A Survey. SIGMOD Record 27, 3 (1998), 59–74.

    Article  Google Scholar 

  9. Goldman, R., AND Widom, J. DataGuides: Enabling Query Formulation and Optimization in Semistructured Databases. In Proceedings of the 23rd International Conference on Very Large Data Bases (Athens, Greece, 1997), pp. 436–445.

    Google Scholar 

  10. Jaeschke, G., AND Schek, H.-J. Remarks on the Algebra of Non First Normal Form Relations. In Proceedings of the ACM Symposium on Principles of Database (Los Angeles, California, 1982), pp. 124–138.

    Google Scholar 

  11. Laender, A. H. F., Ribeiro-Neto, B., AND Dasilva., A. S. DEByE-Data Extraction By Bxample. Data and Knowledge Engineering 40, 2 (2002), 121–154.

    Article  Google Scholar 

  12. Libkin, L. A Relational Algebra for Complex Objects Based on Partial Information. In Proceedings of the 3rd Symposium on Mathematical Fundamentals of Database and Knowledge Bases Systems (Rostock, Germany, 1991), pp. 29–43.

    Google Scholar 

  13. Lorentzos, N. A., AND Dondis, K. A. Query by Example for Nested Tables. In Proceedings of the 9th International Conference on Database and Expert Systems Applications (Vienna, Austria, 1998), pp. 716–725.

    Google Scholar 

  14. Makinouchi, A. A Consideration on Normal Form of Not-Necessarily-Normalized Relation in the Relational Data Model. In Proceedings of the 3rd International Conference on Very Large Data Bases (Tokyo, Japan, 1977), pp. 447–453.

    Google Scholar 

  15. Mchugh, J., Abiteboul, S., Goldman, R., Quass, D., AND Widom, J. Lore: A Database Management System for Semistructured Data. SIGMOD Record 26, 3 (1997), 54–66.

    Article  Google Scholar 

  16. Papakonstantinou, Y., Garcia-molina, H., AND Widom, J. Object Exchange Across Heterogeneous Information Sources. In Proceedings of the 11th International Conference on Data Engineering (Taipei, Taiwan, 1995), pp. 251–260.

    Google Scholar 

  17. Thomas, S. J., AND Fischer, P. C. Nested Relational Structures. Advances in Computing Research 3 (1986), 269–307.

    Google Scholar 

  18. Zloof, M. M. Query-by-Example: A Data Base Language. IBM Systems Journal 16, 4 (1977), 324–343.

    Article  Google Scholar 

Download references

Author information

Authors and Affiliations

Authors

Editor information

Editors and Affiliations

Rights and permissions

Reprints and permissions

Copyright information

© 2002 Springer-Verlag Berlin Heidelberg

About this paper

Cite this paper

da Silva, A.S., Evangelista Filha, I.M.R., Laender, A.H.F., Embley, D.W. (2002). Representing and Querying Semistructured Web Data Using Nested Tables with Structural Variants. In: Spaccapietra, S., March, S.T., Kambayashi, Y. (eds) Conceptual Modeling — ER 2002. ER 2002. Lecture Notes in Computer Science, vol 2503. Springer, Berlin, Heidelberg. https://doi.org/10.1007/3-540-45816-6_20

Download citation

  • DOI: https://doi.org/10.1007/3-540-45816-6_20

  • Published:

  • Publisher Name: Springer, Berlin, Heidelberg

  • Print ISBN: 978-3-540-44277-6

  • Online ISBN: 978-3-540-45816-6

  • eBook Packages: Springer Book Archive

Publish with us

Policies and ethics