Title:Multi-channel Partial Graph Integration Learning of Partial Multi-omics
Data for Cancer Subtyping
Volume: 18
Issue: 8
Author(s): Qing-Qing Cao, Jian-Ping Zhao*Chun-Hou Zheng*
Affiliation:
- College of Mathematics and System Sciences, Xinjiang University, Urumqi, China
- School of Artificial Intelligence, Anhui University, Hefei, China
Keywords:
Partial multi-omics data, high-order proximity, cancer data, multi-channel, classifier, graph autoencoder.
Abstract:
Background: The appearance of cancer subtypes with different clinical significance fully reflects
the high heterogeneity of cancer. At present, the method of multi-omics integration has become
more and more mature. However, in the practical application of the method, the omics of some samples
are missing.
Objective: The purpose of this study is to establish a depth model that can effectively integrate and express
partial multi-omics data to accurately identify cancer subtypes.
Methods: We proposed a novel partial multi-omics learning model for cancer subtypes, MPGIL (Multichannel
Partial Graph Integration Learning). MPGIL has two main components. Firstly, it obtains more
lateral adjacency information between samples within the omics through the multi-channel graph autoencoders
based on high-order proximity. To reduce the negative impact of missing samples, the
weighted fusion layer is introduced to replace the concatenate layer to learn the consensus representation
across multi-omics. Secondly, a classifier is introduced to ensure that the consensus representation
is representative of clustering. Finally, subtypes were identified by K-means.
Results: This study compared MPGIL with other multi-omics integration methods on 16 datasets. The
clinical and survival results show that MPGIL can effectively identify subtypes. Three ablation experiments
are designed to highlight the importance of each component in MPGIL. A case study of AML
was conducted. The differentially expressed gene profiles among its subtypes fully reveal the high heterogeneity
of cancer.
Conclusion: MPGIL can effectively learn the consistent expression of partial multi-omics datasets and
discover subtypes, and shows more significant performance than the state-of-the-art methods.