Abstract
Massive amount of data that are associated with geographic information are generated in Internet. More and more researches focus on how to retrieve geo-textual data effectively. Existing methods mostly allow exact matches for query keywords but fail to support fuzzy preference queries. In this paper, we study the skyline problem of fuzzy preference queries. That is, given a set of geo-textual data, the skyline comprises the objects that are not dominated by others. In this paper, we only consider the problem of two dimensions, the text relevance dimension and the spatial relevance dimension. We introduce two functions to quantify the text relevance and the spatial relevance. We also develop a new index structure to organize the geo-textual data and an algorithm based on it. Theoretical analysis and experimental results show that our method offers scalability and good performance.
Similar content being viewed by others
References
Bao J, Mokbel MF (2013) GeoRank: an efficient location-aware news feed ranking system. In: SIGSPATIA/L GIS 2013. 184–193. ACM, Orlando
Lisi C, Gao C, Xin C (2013) An efficient query indexing mechanism for filtering geo-textual data. In: SIGMOD 2013. 749–760. ACM, NewYork
Long G, Jie S, Htoo Htet A, Kian-Lee T (2015) Efficient continuous top-k spatial keyword queries on road networks. GeoInformatica 19(1):29–60
Huang W, Li G, Tan K-L, Feng J (2012) Efficient safe-region construction for moving top-K spatial keyword queries. In: CIKM 2012. 932–941. ACM, Maui
Chen L, Cong G, Cao X, Tan K-L (2015) Temporal Spatial-Keyword Top-k publish/subscribe. In: ICDE 2015. 255–266. ICDE Press, Seoul
Zheng K, Su H, Zheng B, Shang S, Xu J, Liu J, Zhou X (2015) Interactive Top-k Spatial Keyword queries. In: ICDE 2015. 423–434. ICDE Press, Seoul
Yunjun Gao, Xu Qin, Baihua Zheng, Gang Chen: Efficient Reverse Top-k Boolean Spatial Keyword Queries on Road Networks. IEEE Trans. Knowl. Data Eng. (TKDE) 27(5):1205–1218 (2015)
Zhang D, Chan C-Y, Tan K-L (2014) Processing spatial keyword query as a top-k aggregation query. In: SIGIR 2014. 355–364. ACM, Gold Coast
Chen L, Lin X, Hu H, Jensen CS, Xu J (2015) Answering why-not questions on spatial keyword top-k queries. In: ICDE 2015. 279–290. ICDE Press, Seoul
Tan KL, Eng PK, Ooi BC (2001) Efficient progressive skyline computation. In: VLDB 2001. 301–310. ACM, Roma
Dellis E, Seeger B (2007) Efficient computation of reverse skyline queries. In: VLDB 2007. 291–302. ACM, Vienna
Borzsonyi S, Kossmann D, Stocker K (2001) The skyline operator. In: ICDE 2001. 421–430. ICDE Press, Heidelberg
De Felipe I, Hristidis V, Rishe N (2008) Keyword search on spatial databases. In: ICDE 2008. 656–665. ICDE Press, Washington
Zhang D, Chee YM, Mondal A, Tung AKH, Kitsuregawa M (2009) Keyword search in spatial databases: Towards Searching by Document. In: ICDE 2009. 688–699. ICDE Press, Shanghai
Chen YY, Suel T, Markowetz A (2006) Efficient query processing in geographic web search engines. In: SIGMOD 2006. 277–288. ACM, Chicago
De Felipe I, Hristidis V, Rishe N (2008) Keyword search on spatial databases. In: ICDE 2008. 656–665. ICDE Press, Cancún
Li G, Xu J, Feng J (2012) Keyword-based k-nearest neighbor search in spatial databases. In: CIKM 2012. 2144–2148. ACM, Maui
Xin C, Gao C, Tao G, Jensen CS, Beng Chin O (2015) Efficient processing of spatial group keyword queries. ACM trans. Database Syst (TODS) 40(2):13
Dingming W, Man Lung Y, Jensen CS (2013) Moving spatial keyword queries: Formulation, methods, and analysis. ACM Trans. Database Syst (TODS) 38(1):7
Wang X, Zhang Y, Zhang W, Lin X, Wang W (2015) AP-Tree: Efficiently support continuous spatial-keyword queries over stream. In: ICDE 2015. 1107–1118. ICDE Press, Seoul
Ying L, Jiaheng L, Gao C, Wei W, Cyrus S (2014) Efficient algorithms and cost models for reverse spatial-keyword k-nearest neighbor search. ACM trans. Database Syst (TODS) 39(2):13
Khodaei A, Shahabi C (2012) Chen Li: SKIF-P: a point-based indexing and ranking of web documents for spatial-keyword search. GeoInformatica 16(3):563–596
Jun H, Ju F, Guoliang L, Shanshan C (2012) Top-k Fuzzy Spatial Keyword Search. (in Chinese). Chin J Comput 35(11):2237–2246
Cong G, Jensen CS, Wu D (2009) Efficient Retrieval of the Top-k Most Relevant Spatial Web Objects. In: VLDB 2009. 337–348. ACM, Lyon
Li G, Feng J, Xu J (2012) DESKS: Direction-Aware Spatial Keyword Search. In: ICDE 2012. 474–485. ICDE Press, Washington
Zhang C, Zhang Y, Zhang W, Lin X (2013) Inverted Linear Quadtree: Efficient Top K Spatial Keyword Search. In: ICDE 2013. 901–912. ICDE Press, Brisbane
Yao B, Li F, Hadjieleftheriou M, Hou K (2010) Approximate string search in spatial databases. In: ICDE 2010. 545–556. ICDE Press, Long Beach
Hadjieleftheriou M, Li C (2009) Efficient approximate search on string collections. In: VLDB 2009. 1660–1661. ACM, Lyon
Li C, Lu J, Lu Y (2008) Efficient merging and filtering algorithms for approximate string searches. In: ICDE 2008. 257–266. ICDE Press, Cancún
Xiao C, Wang W, Lin X, Shang H (2009) Top-k set similarity joins. In: ICDE 2009. 916–927. ICDE Press, Shanghai
Kossmann D, Ramsak F, Rost S (2002) Shooting stars in the sky: an online algorithm for skyline queries. In: VLDB 2002. 275–286. ACM, Hong Kong
Papadias D, Fu G, Seeger B, Tao Y (2003) An optimal and progressive algorithm for skyline queries. In: SIGMOD 2003. 467–478. ACM, San Diego
Lee J, Hwang S-W (2014) Toward efficient multidimensional subspace skyline computation. In: VLDB 2014. 129–145. ACM, Hangzhou
Liu B, Chan C-Y (2010) ZINC: Efficient indexing for skyline computation. In: VLDB 2010. 197–207. ACM, Singapore
Acknowledgments
This paper was supported by NGFR 973 grant 2012CB316200, NSFC grant 61472099, 61133002 and National Sci-Tech Support Plan 2015BAH10F01.
Author information
Authors and Affiliations
Corresponding author
Rights and permissions
About this article
Cite this article
Li, J., Wang, H., Li, J. et al. Skyline for geo-textual data. Geoinformatica 20, 453–469 (2016). https://doi.org/10.1007/s10707-015-0243-9
Received:
Revised:
Accepted:
Published:
Issue Date:
DOI: https://doi.org/10.1007/s10707-015-0243-9