GCN-MF: Disease-Gene Association Identification By Graph Convolutional Networks and Matrix Factorzation
Peng Han (King Abdullah University of Science and Technology);Peng Yang (King Abdullah University of Science and Technology);Peilin Zhao (King Abdullah University of Science and Technology);Shuo Shang (Inception Institute of Artificial Intelligence);Yong Liu (Alibaba-NTU Singapore Joint Research Institute, Nanyang Technological University);Jiayu Zhou (Michigan State University);Xin Gao (King Abdullah University of Science and Technology);Panos Kalnis (King Abdullah University of Science and Technology);
Discovering disease-gene association is a fundamental and critical biomedical task, which assists biologists and physicians to discover pathogenic mechanism of syndromes. With various clinical biomarkers measuring the similarities among genes and disease phenotypes, network-based semi-supervised learning (NSSL) has been commonly utilized by these studies to address this class-imbalanced large-scale data issue. However, most existing NSSL approaches are based on linear models and suffer from two major limitations: 1) They implicitly consider a local-structure representation for each candidate; 2) They are unable to capture nonlinear associations between diseases and genes. In this paper, we propose a new framework for disease-gene association task by combining Graph Convolutional Network (GCN) and matrix factorization, named GCN-MF. With the help of GCN, we could capture non-linear interactions and exploit measured similarities. Moreover, we define a margin control loss function to reduce the effect of sparsity. Empirical results demonstrate that the proposed deep learning algorithm outperforms all other state-of-the-art methods on most of metrics.
How can we assist you?
We'll be updating the website as information becomes available. If you have a question that requires immediate attention, please feel free to contact us. Thank you!
Please enter the word you see in the image below: