iBet uBet web content aggregator. Adding the entire web to your favor.

Link to original content: https://dl.acm.org/doi/abs/10.1145/2939672.2939751

Asymmetric Transitivity Preserving Graph Embedding | Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

research-article

Asymmetric Transitivity Preserving Graph Embedding

Authors:

Wenwu ZhuAuthors Info & Claims

KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

Pages 1105 - 1114

https://doi.org/10.1145/2939672.2939751

Published: 13 August 2016 Publication History

Abstract

Graph embedding algorithms embed a graph into a vector space where the structure and the inherent properties of the graph are preserved. The existing graph embedding methods cannot preserve the asymmetric transitivity well, which is a critical property of directed graphs. Asymmetric transitivity depicts the correlation among directed edges, that is, if there is a directed path from u to v, then there is likely a directed edge from u to v. Asymmetric transitivity can help in capturing structures of graphs and recovering from partially observed graphs. To tackle this challenge, we propose the idea of preserving asymmetric transitivity by approximating high-order proximity which are based on asymmetric transitivity. In particular, we develop a novel graph embedding algorithm, High-Order Proximity preserved Embedding (HOPE for short), which is scalable to preserve high-order proximities of large scale graphs and capable of capturing the asymmetric transitivity. More specifically, we first derive a general formulation that cover multiple popular high-order proximity measurements, then propose a scalable embedding algorithm to approximate the high-order proximity measurements based on their general formulation. Moreover, we provide a theoretical upper bound on the RMSE (Root Mean Squared Error) of the approximation. Our empirical experiments on a synthetic dataset and three real-world datasets demonstrate that HOPE can approximate the high-order proximities significantly better than the state-of-art algorithms and outperform the state-of-art algorithms in tasks of reconstruction, link prediction and vertex recommendation.

Supplementary Material

MP4 File (kdd2016_zhang_asymmetric_transitivity_01-acm.mp4)

Download
256.48 MB

References

[1]

M. Belkin and P. Niyogi. Laplacian eigenmaps and spectral techniques for embedding and clustering. In NIPS, volume 14, pages 585--591, 2001.

Digital Library

[2]

S. Cao, W. Lu, and Q. Xu. Grarep: Learning graph representations with global structural information. In Proceedings of the 24th ACM International on Conference on Information and Knowledge Management, pages 891--900. ACM, 2015.

Digital Library

[3]

S. Chang, G.-J. Qi, C. C. Aggarwal, J. Zhou, M. Wang, and T. S. Huang. Factorized similarity learning in networks. In Data Mining (ICDM), 2014 IEEE International Conference on, pages 60--69. IEEE, 2014.

Digital Library

[4]

M. Chen, Q. Yang, and X. Tang. Directed graph embedding. In IJCAI, pages 2707--2712, 2007.

Digital Library

[5]

M. De Choudhury, Y.-R. Lin, H. Sundaram, K. S. Candan, L. Xie, A. Kelliher, et al. How does the data sampling strategy impact the discovery of information diffusion in social media? ICWSM, 10:34--41, 2010.

[6]

R. A. Fisher. The use of multiple measurements in taxonomic problems. Annals of eugenics, 7(2):179--188, 1936.

[7]

M. S. Handcock, A. E. Raftery, and J. M. Tantrum. Model-based clustering for social networks. Journal of the Royal Statistical Society: Series A (Statistics in Society), 170(2):301--354, 2007.

[8]

X. He, S. Yan, Y. Hu, P. Niyogi, and H.-J. Zhang. Face recognition using laplacianfaces. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 27(3):328--340, 2005.

Digital Library

[9]

M. Hochstenbach. A jacobi--davidson type method for the generalized singular value problem. Linear Algebra and its Applications, 431(3):471--487, 2009.

[10]

P. D. Hoff. Multiplicative latent factor models for description and prediction of social networks. Computational and Mathematical Organization Theory, 15(4):261--272, 2009.

Digital Library

[11]

P. D. Hoff, A. E. Raftery, and M. S. Handcock. Latent space approaches to social network analysis. Journal of the american Statistical association, 97(460):1090--1098, 2002.

[12]

P. W. Holland and S. Leinhardt. An exponential family of probability distributions for directed graphs. Journal of the american Statistical association, 76(373):33--50, 1981.

[13]

J. Hopcroft and R. Kannan. Foundations of data science. 2014.

[14]

G. Jeh and J. Widom. Simrank: a measure of structural-context similarity. In Proceedings of the eighth ACM SIGKDD international conference on Knowledge discovery and data mining, pages 538--543. ACM, 2002.

Digital Library

[15]

I. Jolliffe. Principal component analysis. Wiley Online Library, 2002.

[16]

L. Katz. A new status index derived from sociometric analysis. Psychometrika, 18(1):39--43, 1953.

[17]

J. Leskovec, J. Kleinberg, and C. Faloutsos. Graphs over time: densification laws, shrinking diameters and possible explanations. In Proceedings of the eleventh ACM SIGKDD international conference on Knowledge discovery in data mining, pages 177--187. ACM, 2005.

Digital Library

[18]

O. Levy and Y. Goldberg. Neural word embedding as implicit matrix factorization. In Advances in Neural Information Processing Systems, pages 2177--2185, 2014.

Digital Library

[19]

T. Mikolov, I. Sutskever, K. Chen, G. S. Corrado, and J. Dean. Distributed representations of words and phrases and their compositionality. In Advances in neural information processing systems, pages 3111--3119, 2013.

Digital Library

[20]

K. Miller, M. I. Jordan, and T. L. Griffiths. Nonparametric latent feature models for link prediction. In Advances in neural information processing systems, pages 1276--1284, 2009.

Digital Library

[21]

S. Mousazadeh and I. Cohen. Embedding and function extension on directed graph. Signal Processing, 111:137--149, 2015.

Digital Library

[22]

C. C. Paige and M. A. Saunders. Towards a generalized singular value decomposition. SIAM Journal on Numerical Analysis, 18(3):398--405, 1981.

[23]

J. Pennington, R. Socher, and C. D. Manning. Glove: Global vectors for word representation. Proceedings of the Empiricial Methods in Natural Language Processing (EMNLP 2014), 12:1532--1543, 2014.

[24]

B. Perozzi, R. Al-Rfou, and S. Skiena. Deepwalk: Online learning of social representations. In Proceedings of the 20th ACM SIGKDD international conference on Knowledge discovery and data mining, pages 701--710. ACM, 2014.

Digital Library

[25]

D. Perrault-Joncas and M. Meila. Estimating vector fields on manifolds and the embedding of directed graphs. arXiv preprint arXiv:1406.0013, 2014.

[26]

D. C. Perrault-Joncas and M. Meila. Directed graph embedding: an algorithm based on continuous limits of laplacian-type operators. In Advances in Neural Information Processing Systems, pages 990--998, 2011.

[27]

S. T. Roweis and L. K. Saul. Nonlinear dimensionality reduction by locally linear embedding. Science, 290(5500):2323--2326, 2000.

[28]

B. Scholkopft and K.-R. Mullert. Fisher discriminant analysis with kernels. Neural networks for signal processing IX, 1:1, 1999.

[29]

T. A. Snijders, P. E. Pattison, G. L. Robins, and M. S. Handcock. New specifications for exponential random graph models. Sociological methodology, 36(1):99--153, 2006.

[30]

H. H. Song, T. W. Cho, V. Dave, Y. Zhang, and L. Qiu. Scalable proximity estimation and link prediction in online social networks. In Proceedings of the 9th ACM SIGCOMM conference on Internet measurement conference, pages 322--335. ACM, 2009.

Digital Library

[31]

L. Subelj and M. Bajec. Model of complex networks based on citation dynamics. In Proceedings of the 22nd international conference on World Wide Web companion, pages 527--530. International World Wide Web Conferences Steering Committee, 2013.

Digital Library

[32]

J. Tang, M. Qu, M. Wang, M. Zhang, J. Yan, and Q. Mei. Line: Large-scale information network embedding. In Proceedings of the 24th International Conference on World Wide Web, pages 1067--1077. International World Wide Web Conferences Steering Committee, 2015.

Digital Library

[33]

J. B. Tenenbaum, V. De Silva, and J. C. Langford. A global geometric framework for nonlinear dimensionality reduction. Science, 290(5500):2319--2323, 2000.

[34]

Y. J. Wang and G. Y. Wong. Stochastic blockmodels for directed graphs. Journal of the American Statistical Association, 82(397):8--19, 1987.

[35]

S. Yan, D. Xu, B. Zhang, H.-J. Zhang, Q. Yang, and S. Lin. Graph embedding and extensions: a general framework for dimensionality reduction. Pattern Analysis and Machine Intelligence, IEEE Transactions on, 29(1):40--51, 2007.

Digital Library

[36]

J. Ye, R. Janardan, and Q. Li. Two-dimensional linear discriminant analysis. In Advances in neural information processing systems, pages 1569--1576, 2004.

Digital Library

[37]

S. Zhu, K. Yu, Y. Chi, and Y. Gong. Combining content and link for classification using matrix factorization. In Proceedings of the 30th annual international ACM SIGIR conference on Research and development in information retrieval, pages 487--494. ACM, 2007.

Digital Library

Cited By

Liu CHan YXu HYang SWang KSu Y(2024)A Community Detection and Graph-Neural-Network-Based Link Prediction Approach for Scientific LiteratureMathematics10.3390/math1203036912:3(369)Online publication date: 24-Jan-2024
https://doi.org/10.3390/math12030369
Liao ZLiu THe YLin L(2024)Effective Temporal Graph Learning via Personalized PageRankEntropy10.3390/e2607058826:7(588)Online publication date: 10-Jul-2024
https://doi.org/10.3390/e26070588
Ye ZTang YZhao HWang ZJi Y(2024)Multi-channel high-order network representation learning researchFrontiers in Neurorobotics10.3389/fnbot.2024.134046218Online publication date: 29-Feb-2024
https://doi.org/10.3389/fnbot.2024.1340462
Show More Cited By

Index Terms

Asymmetric Transitivity Preserving Graph Embedding
1. Computing methodologies
  1. Machine learning
    1. Learning paradigms
      1. Unsupervised learning
        Dimensionality reduction and manifold learning
    2. Machine learning approaches

Recommendations

Multi-view Clustering with Graph Embedding for Connectome Analysis
CIKM '17: Proceedings of the 2017 ACM on Conference on Information and Knowledge Management

Multi-view clustering has become a widely studied problem in the area of unsupervised learning. It aims to integrate multiple views by taking advantages of the consensus and complimentary information from multiple views. Most of the existing works in ...
Embedding into Bipartite Graphs

The conjecture of Bollobás and Komlós, recently proved by Böttcher, Schacht, and Taraz [Math. Ann., 343 (2009), pp. 175-205], implies that for any $\gamma>0$, every balanced bipartite graph on $2n$ vertices with bounded degree and sublinear bandwidth ...
Graph embedding via multi-scale graph representations
Abstract
Graph embedding provides an effective way to encode graph nodes as continuous vector representations in a low-dimensional space. The high-order proximities based graph embedding methods can preserve global structures, but the high-...

Comments

Information & Contributors

Information

Published In

cover image ACM Conferences

KDD '16: Proceedings of the 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 2016

2176 pages

ISBN:9781450342322

DOI:10.1145/2939672

General Chairs:
Balaji Krishnapuram
IBM
,
Mohak Shah
Bosch
,
Program Chairs:
Alex Smola
Amazon
,
Charu Aggarwal
IBM
,
Dou Shen
Baidu
,
Rajeev Rastogi
Amazon

Copyright © 2016 ACM.

Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

Sponsors

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 13 August 2016

Permissions

Request permissions for this article.

Request Permissions

Check for updates

Author Tags

Qualifiers

Research-article

Funding Sources

National Program on Key Basic Research Project
National Natural Science Foundation of China

Conference

KDD '16

Sponsor:

KDD '16: The 22nd ACM SIGKDD International Conference on Knowledge Discovery and Data Mining

August 13 - 17, 2016

California, San Francisco, USA

Acceptance Rates

KDD '16 Paper Acceptance Rate 66 of 1,115 submissions, 6%;

Overall Acceptance Rate 1,133 of 8,635 submissions, 13%

Contributors

Other Metrics

View Article Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

801
Total Citations
View Citations
3,003
Total Downloads

Downloads (Last 12 months)308
Downloads (Last 6 weeks)27

Reflects downloads up to 30 Oct 2024

Other Metrics

View Author Metrics

Citations

Cited By

Liu CHan YXu HYang SWang KSu Y(2024)A Community Detection and Graph-Neural-Network-Based Link Prediction Approach for Scientific LiteratureMathematics10.3390/math1203036912:3(369)Online publication date: 24-Jan-2024
https://doi.org/10.3390/math12030369
Liao ZLiu THe YLin L(2024)Effective Temporal Graph Learning via Personalized PageRankEntropy10.3390/e2607058826:7(588)Online publication date: 10-Jul-2024
https://doi.org/10.3390/e26070588
Ye ZTang YZhao HWang ZJi Y(2024)Multi-channel high-order network representation learning researchFrontiers in Neurorobotics10.3389/fnbot.2024.134046218Online publication date: 29-Feb-2024
https://doi.org/10.3389/fnbot.2024.1340462
Enes KNunes MMurai FPappa G(2024)Temporally-aware node embeddings for evolving networks topologiesAI Communications10.3233/AIC-23002837:3(411-428)Online publication date: 19-Jun-2024
https://doi.org/10.3233/AIC-230028
Hadianto AUtomo W(2024)CatBoost Optimization Using Recursive Feature EliminationJurnal Online Informatika10.15575/join.v9i2.13249:2(169-178)Online publication date: 24-Aug-2024
https://doi.org/10.15575/join.v9i2.1324
Jeong SKim JLee BLim S(2024)User tendency-based rating scaling in online trading networksPLOS ONE10.1371/journal.pone.029790319:4(e0297903)Online publication date: 16-Apr-2024
https://doi.org/10.1371/journal.pone.0297903
Chen YWang JZou QNiu MDing YSong JWang Y(2024)DrugDAGT: a dual-attention graph transformer with contrastive learning improves drug-drug interaction predictionBMC Biology10.1186/s12915-024-02030-922:1Online publication date: 14-Oct-2024
https://doi.org/10.1186/s12915-024-02030-9
Alvarez-Mamani EDechant RBeltran-Castañón CIbáñez A(2024)Graph embedding on mass spectrometry- and sequencing-based biomedical dataBMC Bioinformatics10.1186/s12859-023-05612-625:1Online publication date: 2-Jan-2024
https://doi.org/10.1186/s12859-023-05612-6
Li YYou ZWang SMi CWang MHuang YYi H(2024)Attention-Based Learning for Predicting Drug-Drug Interactions in Knowledge Graph Embedding Based on Multisource Fusion InformationInternational Journal of Intelligent Systems10.1155/2024/51559972024Online publication date: 1-Jan-2024
https://dl.acm.org/doi/10.1155/2024/5155997
Wei XWang WZhang CDing WWang BQian YHan ZSu C(2024)Neighbor-Enhanced Representation Learning for Link Prediction in Dynamic Heterogeneous Attributed NetworksACM Transactions on Knowledge Discovery from Data10.1145/367655918:8(1-25)Online publication date: 4-Jul-2024
https://dl.acm.org/doi/10.1145/3676559
Show More Cited By

View Options

Get Access

Login options

Check if you have access through your login credentials or your institution to get full access on this article.

Full Access

Get this Publication

View options

PDF

View or Download as a PDF file.

eReader

View online with eReader.

Media

Figures

Other

Tables

View Table of Contents