Abstract
In the Semantic Web, vocabularies are defined and shared among knowledge workers to describe linked data for scientific, industrial or daily life usage. With the rapid growth of online vocabularies, there is an emergent need for approaches helping users understand vocabularies quickly. In this paper, we study the summarization of vocabularies to help users understand vocabularies. Vocabulary summarization is based on the structural analysis and pragmatics statistics in the global Semantic Web. Local Bipartite Model and Expanded Bipartite Model of a vocabulary are proposed to characterize the structure in a vocabulary and links between vocabularies. A structural importance for each RDF sentence in the vocabulary is assessed using link analysis. Meanwhile, pragmatics importance of each RDF sentence is assessed using the statistics of instantiation of its terms in the Semantic Web. Summaries are produced by extracting important RDF sentences in vocabularies under a re-ranking strategy. Preliminary experiments show that it is feasible to help users understand a vocabulary through its summary.
Similar content being viewed by others
Explore related subjects
Discover the latest articles, news and stories from top researchers in related subjects.References
Ding L, Pan R, Finin T, Joshi A, Peng Y, Kolari P. Finding and ranking knowledge on the Semantic Web. In Proc. the 4th International Semantic Web Conference, Galway, Ireland, June 9–12, 2005, pp.156–170.
Mani I. Automatic Summarization. John Benjamins Publishing Company, 2001.
Penin T, Wang H F, Tran T, Yu Y. Snippet generation for Semantic Web search engines. In Proc. the 3rd Asian Semantic Web Conference, Bangkok, Thailand, 2009. (To appear)
Zhang X, Cheng G, Qu Y Z. Ontology summarization based on RDF sentence graph. In Proc. the 16th International World Wide Web Conference, Banff, Canada, May 8–12, 2007, pp.707–715.
Kessler M M. Bibliographic coupling between scientific papers. American Documentation, 1963, 14(1): 10–25.
Lempel R, Moran S. The stochastic approach for link-structure analysis (SALSA) and the TKC effect. In Proc. the 9th International World Wide Web Conference, Amsterdam, Netherlands, May 15–19, 2000, pp.387–401.
Kleinberg J. Authoritative sources in a hyperlinked environment. In Proc. the 9th ACM SIAM Symposium on Discrete Algorithm, San Francisco, California, USA, Jan. 1998, pp.668–677.
Carbonell J, Goldstein J. The use of MMR, diversity-based reranking for reordering documents and producing summaries. In Proc. the 21st International ACM SIGIR Conference on Research and Development in Information Retrieval, Melbourne, Australia, 1998, pp.335–336.
Radev D R, Jing H, Budzikowska M. Centroid-based summarization of multiple documents: Sentence extraction, utility-based evaluation and user studies. In Proc. ANLP/NAACL 2000 Workshop, Seattle, Washington, USA, May 2000, pp.21–29.
Kershenbaum A, Ma L, Schonberg E, Srinivas K, Fokoue A. The summary Abox: Cutting ontologies down to size. In Proc. the 5th International Semantic Web Conference, Athens, GA, USA, Nov. 5–9, 2006, pp.343–356.
Hustadt U, Motik B, Sattler U. Reducing SHIQ description logic to disjunctive datalog programs. In Proc. the 9th International Conference on Knowledge Representation and Reasoning, Whistler, Canada, June 2004, pp.152–162.
Aleman-Meza B, Halaschek-Wiener C, Arpinar I B, Ramakrishnan C, Sheth A P. Ranking complex relationships on the Semantic Web. IEEE Internet Computing, June 2005, 9(3): 37–44.
Anyanwu K, Maduko A, Sheth A. SemRank: Ranking complex relationship search results on the Semantic Web. In Proc. the 14th International Conference on World Wide Web, Chiba, Japan, May 10–14, 2005, pp.117–127.
Alani H, Brewster C. Ontology ranking based on the analysis of concept structures. In Proc. the 3rd International Conference on Knowledge Capture, Banff, Canada, Oct. 23–25, 2005, pp.51–58.
Yu C, Jagadish H V. Schema summarization. In Proc. the 32nd International Conference on Very Large Data Bases, Seoul, Korea, Sept. 12–15, 2006, pp.319–330.
Author information
Authors and Affiliations
Corresponding author
Additional information
The work is supported in part by the National Basic Research 973 Program of China under Grant No. 2003CB317004 and the National Natural Science Foundation of China under Grant No. 60773106.
Electronic Supplementary Material
Below is the link to the electronic supplementary material.
Rights and permissions
About this article
Cite this article
Zhang, X., Cheng, G., Ge, WY. et al. Summarizing Vocabularies in the Global Semantic Web. J. Comput. Sci. Technol. 24, 165–174 (2009). https://doi.org/10.1007/s11390-009-9212-9
Received:
Revised:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s11390-009-9212-9