iBet uBet web content aggregator. Adding the entire web to your favor.
iBet uBet web content aggregator. Adding the entire web to your favor.



Link to original content: https://dl.acm.org/doi/abs/10.1145/2939672.2939751
Asymmetric Transitivity Preserving Graph Embedding | Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining skip to main content
10.1145/2939672.2939751acmconferencesArticle/Chapter ViewAbstractPublication PageskddConference Proceedingsconference-collections
research-article

Asymmetric Transitivity Preserving Graph Embedding

Published: 13 August 2016 Publication History

Abstract

Graph embedding algorithms embed a graph into a vector space where the structure and the inherent properties of the graph are preserved. The existing graph embedding methods cannot preserve the asymmetric transitivity well, which is a critical property of directed graphs. Asymmetric transitivity depicts the correlation among directed edges, that is, if there is a directed path from u to v, then there is likely a directed edge from u to v. Asymmetric transitivity can help in capturing structures of graphs and recovering from partially observed graphs. To tackle this challenge, we propose the idea of preserving asymmetric transitivity by approximating high-order proximity which are based on asymmetric transitivity. In particular, we develop a novel graph embedding algorithm, High-Order Proximity preserved Embedding (HOPE for short), which is scalable to preserve high-order proximities of large scale graphs and capable of capturing the asymmetric transitivity. More specifically, we first derive a general formulation that cover multiple popular high-order proximity measurements, then propose a scalable embedding algorithm to approximate the high-order proximity measurements based on their general formulation. Moreover, we provide a theoretical upper bound on the RMSE (Root Mean Squared Error) of the approximation. Our empirical experiments on a synthetic dataset and three real-world datasets demonstrate that HOPE can approximate the high-order proximities significantly better than the state-of-art algorithms and outperform the state-of-art algorithms in tasks of reconstruction, link prediction and vertex recommendation.

Supplementary Material

MP4 File (kdd2016_zhang_asymmetric_transitivity_01-acm.mp4)

References

[1]
M. Belkin and P. Niyogi. Laplacian eigenmaps and spectral techniques for embedding and clustering. In NIPS, volume 14, pages 585--591, 2001.
[2]
S. Cao, W. Lu, and Q. Xu. Grarep: Learning graph representations with global structural information. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pages 891--900. ACM, 2015.
[3]
S. Chang, G.-J. Qi, C. C. Aggarwal, J. Zhou, M. Wang, and T. S. Huang. Factorized similarity learning in networks. In Data Mining (ICDM), 2014 IEEE International Conference on, pages 60--69. IEEE, 2014.
[4]
M. Chen, Q. Yang, and X. Tang. Directed graph embedding. In IJCAI, pages 2707--2712, 2007.
[5]
M. De Choudhury, Y.-R. Lin, H. Sundaram, K. S. Candan, L. Xie, A. Kelliher, et al. How does the data sampling strategy impact the discovery of information diffusion in social media? ICWSM, 10:34--41, 2010.
[6]
R. A. Fisher. The use of multiple measurements in taxonomic problems. Annals of eugenics, 7(2):179--188, 1936.
[7]
M. S. Handcock, A. E. Raftery, and J. M. Tantrum. Model-based clustering for social networks. Journal of the Royal Statistical Society: Series A (Statistics in Society), 170(2):301--354, 2007.
[8]
X. He, S. Yan, Y. Hu, P. Niyogi, and H.-J. Zhang. Face recognition using laplacianfaces. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 27(3):328--340, 2005.
[9]
M. Hochstenbach. A jacobi--davidson type method for the generalized singular value problem. Linear Algebra and its Applications, 431(3):471--487, 2009.
[10]
P. D. Hoff. Multiplicative latent factor models for description and prediction of social networks. Computational and Mathematical Organization Theory, 15(4):261--272, 2009.
[11]
P. D. Hoff, A. E. Raftery, and M. S. Handcock. Latent space approaches to social network analysis. Journal of the american Statistical association, 97(460):1090--1098, 2002.
[12]
P. W. Holland and S. Leinhardt. An exponential family of probability distributions for directed graphs. Journal of the american Statistical association, 76(373):33--50, 1981.
[13]
J. Hopcroft and R. Kannan. Foundations of data science. 2014.
[14]
G. Jeh and J. Widom. Simrank: a measure of structural-context similarity. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 538--543. ACM, 2002.
[15]
I. Jolliffe. Principal component analysis. Wiley Online Library, 2002.
[16]
L. Katz. A new status index derived from sociometric analysis. Psychometrika, 18(1):39--43, 1953.
[17]
J. Leskovec, J. Kleinberg, and C. Faloutsos. Graphs over time: densification laws, shrinking diameters and possible explanations. In Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, pages 177--187. ACM, 2005.
[18]
O. Levy and Y. Goldberg. Neural word embedding as implicit matrix factorization. In Advances in Neural Information Processing Systems, pages 2177--2185, 2014.
[19]
T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pages 3111--3119, 2013.
[20]
K. Miller, M. I. Jordan, and T. L. Griffiths. Nonparametric latent feature models for link prediction. In Advances in neural information processing systems, pages 1276--1284, 2009.
[21]
S. Mousazadeh and I. Cohen. Embedding and function extension on directed graph. Signal Processing, 111:137--149, 2015.
[22]
C. C. Paige and M. A. Saunders. Towards a generalized singular value decomposition. SIAM Journal on Numerical Analysis, 18(3):398--405, 1981.
[23]
J. Pennington, R. Socher, and C. D. Manning. Glove: Global vectors for word representation. Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP 2014), 12:1532--1543, 2014.
[24]
B. Perozzi, R. Al-Rfou, and S. Skiena. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 701--710. ACM, 2014.
[25]
D. Perrault-Joncas and M. Meila. Estimating vector fields on manifolds and the embedding of directed graphs. arXiv preprint arXiv:1406.0013, 2014.
[26]
D. C. Perrault-Joncas and M. Meila. Directed graph embedding: an algorithm based on continuous limits of laplacian-type operators. In Advances in Neural Information Processing Systems, pages 990--998, 2011.
[27]
S. T. Roweis and L. K. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500):2323--2326, 2000.
[28]
B. Scholkopft and K.-R. Mullert. Fisher discriminant analysis with kernels. Neural networks for signal processing IX, 1:1, 1999.
[29]
T. A. Snijders, P. E. Pattison, G. L. Robins, and M. S. Handcock. New specifications for exponential random graph models. Sociological methodology, 36(1):99--153, 2006.
[30]
H. H. Song, T. W. Cho, V. Dave, Y. Zhang, and L. Qiu. Scalable proximity estimation and link prediction in online social networks. In Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference, pages 322--335. ACM, 2009.
[31]
L. Subelj and M. Bajec. Model of complex networks based on citation dynamics. In Proceedings of the 22nd international conference on World Wide Web companion, pages 527--530. International World Wide Web Conferences Steering Committee, 2013.
[32]
J. Tang, M. Qu, M. Wang, M. Zhang, J. Yan, and Q. Mei. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web, pages 1067--1077. International World Wide Web Conferences Steering Committee, 2015.
[33]
J. B. Tenenbaum, V. De Silva, and J. C. Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319--2323, 2000.
[34]
Y. J. Wang and G. Y. Wong. Stochastic blockmodels for directed graphs. Journal of the American Statistical Association, 82(397):8--19, 1987.
[35]
S. Yan, D. Xu, B. Zhang, H.-J. Zhang, Q. Yang, and S. Lin. Graph embedding and extensions: a general framework for dimensionality reduction. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(1):40--51, 2007.
[36]
J. Ye, R. Janardan, and Q. Li. Two-dimensional linear discriminant analysis. In Advances in neural information processing systems, pages 1569--1576, 2004.
[37]
S. Zhu, K. Yu, Y. Chi, and Y. Gong. Combining content and link for classification using matrix factorization. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 487--494. ACM, 2007.

Cited By

View all
  • (2024)A Community Detection and Graph-Neural-Network-Based Link Prediction Approach for Scientific LiteratureMathematics10.3390/math1203036912:3(369)Online publication date: 24-Jan-2024
  • (2024)Effective Temporal Graph Learning via Personalized PageRankEntropy10.3390/e2607058826:7(588)Online publication date: 10-Jul-2024
  • (2024)Multi-channel high-order network representation learning researchFrontiers in Neurorobotics10.3389/fnbot.2024.134046218Online publication date: 29-Feb-2024
  • Show More Cited By

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences
KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining
August 2016
2176 pages
ISBN:9781450342322
DOI:10.1145/2939672
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 August 2016

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. asymmetric transitivity
  2. directed graph
  3. graph embedding
  4. high-order proximity

Qualifiers

  • Research-article

Funding Sources

Conference

KDD '16
Sponsor:

Acceptance Rates

KDD '16 Paper Acceptance Rate 66 of 1,115 submissions, 6%;
Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)308
  • Downloads (Last 6 weeks)27
Reflects downloads up to 30 Oct 2024

Other Metrics

Citations

Cited By

View all
  • (2024)A Community Detection and Graph-Neural-Network-Based Link Prediction Approach for Scientific LiteratureMathematics10.3390/math1203036912:3(369)Online publication date: 24-Jan-2024
  • (2024)Effective Temporal Graph Learning via Personalized PageRankEntropy10.3390/e2607058826:7(588)Online publication date: 10-Jul-2024
  • (2024)Multi-channel high-order network representation learning researchFrontiers in Neurorobotics10.3389/fnbot.2024.134046218Online publication date: 29-Feb-2024
  • (2024)Temporally-aware node embeddings for evolving networks topologiesAI Communications10.3233/AIC-23002837:3(411-428)Online publication date: 19-Jun-2024
  • (2024)CatBoost Optimization Using Recursive Feature EliminationJurnal Online Informatika10.15575/join.v9i2.13249:2(169-178)Online publication date: 24-Aug-2024
  • (2024)User tendency-based rating scaling in online trading networksPLOS ONE10.1371/journal.pone.029790319:4(e0297903)Online publication date: 16-Apr-2024
  • (2024)DrugDAGT: a dual-attention graph transformer with contrastive learning improves drug-drug interaction predictionBMC Biology10.1186/s12915-024-02030-922:1Online publication date: 14-Oct-2024
  • (2024)Graph embedding on mass spectrometry- and sequencing-based biomedical dataBMC Bioinformatics10.1186/s12859-023-05612-625:1Online publication date: 2-Jan-2024
  • (2024)Attention-Based Learning for Predicting Drug-Drug Interactions in Knowledge Graph Embedding Based on Multisource Fusion InformationInternational Journal of Intelligent Systems10.1155/2024/51559972024Online publication date: 1-Jan-2024
  • (2024)Neighbor-Enhanced Representation Learning for Link Prediction in Dynamic Heterogeneous Attributed NetworksACM Transactions on Knowledge Discovery from Data10.1145/367655918:8(1-25)Online publication date: 4-Jul-2024
  • Show More Cited By

View Options

Get Access

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media