Random-Walk-Based or Similarity-Based Methods, Which is Better for Directed Graph Embedding?
- Authors
- Reyhani Hamedani, Masoud; Ryu, Jin-Su; Kim, Sang-Wook
- Issue Date
- Feb-2024
- Publisher
- Institute of Electrical and Electronics Engineers Inc.
- Keywords
- directed graph embedding; random walk; similarity measure
- Citation
- Proceedings - 2024 IEEE International Conference on Big Data and Smart Computing, BigComp 2024, pp 83 - 89
- Pages
- 7
- Indexed
- SCOPUS
- Journal Title
- Proceedings - 2024 IEEE International Conference on Big Data and Smart Computing, BigComp 2024
- Start Page
- 83
- End Page
- 89
- URI
- https://scholarworks.bwise.kr/hanyang/handle/2021.sw.hanyang/197720
- DOI
- 10.1109/BigComp60711.2024.00022
- ISSN
- 0000-0000
- Abstract
- Graph embedding methods map nodes in a graph into low-dimensional latent vectors that preserve the underlying semantic information in the graph and can be used for graph inference and analysis. Graph embedding is applicable to both directed and undirected graphs; however, it is more challenging for the former ones due to their asymmetric nature. Single-vector embedding methods have limitations in preserving the asymmet-ric information in directed graphs since they provide a single latent vector for each node in the graph. To alleviate this problem, double-vector embedding methods provide two latent vectors, i.e., source and target, for any nodes; in this paper, we first divide these methods into two following categories based on their graph embedding mechanism: random-walk-based methods (RWM) and similarity-based methods (SBM). Although the effectiveness of the double-vector and single-vector embedding methods are widely studied in the literature, there are no comprehensive studies to evaluate and compare the effectiveness of the state-of-the-art RWM and SBM in directed graph embedding. This paper provides such a study by conducting extensive experiments with seven real-world datasets. Our experimental results demonstrate that 1) the double-vector embedding methods are not always superior to the single-vector ones in directed graph embedding, and 2) superiority of random-walk-based or similarity-based double-vector embedding methods over the others depends on different factors such as the machine learning tasks, datasets, and even evaluation metrics.
- Files in This Item
- There are no files associated with this item.
- Appears in
Collections - 서울 공과대학 > 서울 컴퓨터소프트웨어학부 > 1. Journal Articles

Items in ScholarWorks are protected by copyright, with all rights reserved, unless otherwise indicated.