Generic placeholder image

Combinatorial Chemistry & High Throughput Screening

Editor-in-Chief

ISSN (Print): 1386-2073
ISSN (Online): 1875-5402

Letter Article

Identification of Hub Genes Associated with Tumor-Infiltrating Immune Cells and ECM Dynamics as the Potential Therapeutic Targets in Gastric Cancer through an Integrated Bioinformatic Analysis and Machine Learning Methods

Author(s): Jie Liu and Zhong Cheng*

Volume 26, Issue 4, 2023

Published on: 27 October, 2022

Page: [653 - 667] Pages: 15

DOI: 10.2174/1386207325666220820163319

Price: $65

conference banner
Abstract

Background: Stomach cancer, also known as gastric adenocarcinoma, remains the most common and deadly cancer worldwide. Its early diagnosis and prevention are effective to improve the 5-year survival rate of the patients. Therefore, it is important to discover specific biomarkers for early diagnosis and drug treatment. This study investigates the potential key genes and signaling pathways involved in gastric cancer.

Methods: The gene expression profiles, GSE63089, GSE33335, and GSE79973, were retrieved for the identification of Differentially Expressed Genes (DEGs) within a total of 80 gastric cancer samples and 80 normal samples. A total of 1423 uP- and 1155 downregulated genes were screened for overlapping DEGs visualized via Venn diagrams along with 58 upregulated and 43 downregulated genes. These overlapping DEGs were evaluated with Gene Ontology (GO) enrichment, Kyoto Encyclopedia of Genes and Genomes (KEGG) enrichment, and Protein-Protein Interaction (PPI) network analysis. Using DAVID software, we identified several genes enriched in both GO and KEGG analyses. PPI analysis was performed with STRING software, and 3 submodules were obtained with Cytoscape software. Then, we used Cytohubba with 12 classification methods to select candidate hub genes. The group 1 genes enriched in GO and KEGG pathway intersected with group 2 genes, which were approved by nine algorithms, and group 3 genes clustered in three submodules. 9 hub genes were intersected from group 1/2/3 genes and the prognostic values were estimated through GEPIA. We found that the LUM and COL1A1 expression levels and survival outcomes displayed a favorable prognostic value (P-value = 0.013 for LUM and P-value =0.042 for COL1A1).

Results: Finally, 5 machine learning methods were employed for the validation of two hub genes (COL1A1, LUM) to distinguish between the cancer samples and non-cancer samples. The accuracy of XGBoost was estimated to be 0.9375, and the precision and specificity as 1.000. The highest recalls of LR and MLP were 1.0000, and the AUC was 1.0000. In the test set GSE65801, the accuracy of all models was greater than 80%, and the XGBoost model obtained the highest prediction accuracy of 0.8906. The precision of 0.9301 and the specificity of 0.9375 were obtained. The highest recall of MLP was 0.8750 and AUC was 0.9082. The correlation of prognostic indicators with the tumor-infiltrating immune cell levels was analyzed using TIMER.

Conclusion: The identified hub genes explored in this study would enhance the understanding of the molecular mechanism of gastric cancer and may be regarded as a potential therapeutic target as assessed by integrating bioinformatics and machine learning methods.

Keywords: Gastric cancer, hub genes, bioinformatics, machine learning, gastric adenocarcinoma, gene ontology.

Next »
Graphical Abstract
[1]
Bray, F.; Ferlay, J.; Soerjomataram, I.; Siegel, R.L.; Torre, L.A.; Jemal, A. Global cancer statistics 2018: GLOBOCAN estimates of incidence and mortality worldwide for 36 cancers in 185 countries. CA Cancer J. Clin., 2018, 68(6), 394-424.
[http://dx.doi.org/10.3322/caac.21492] [PMID: 30207593]
[2]
Chia, N.Y.; Tan, P. Molecular classification of gastric cancer. Ann. Oncol., 2016, 27(5), 763-769.
[http://dx.doi.org/10.1093/annonc/mdw040] [PMID: 26861606]
[3]
Matsuoka, T.; Yashiro, M. Biomarkers of gastric cancer: Current topics and future perspective. World J. Gastroenterol., 2018, 24(26), 2818-2832.
[http://dx.doi.org/10.3748/wjg.v24.i26.2818] [PMID: 30018477]
[4]
Wadhwa, R.; Song, S.; Lee, J.S.; Yao, Y.; Wei, Q.; Ajani, J.A. Gastric cancer-molecular and clinical dimensions. Nat. Rev. Clin. Oncol., 2013, 10(11), 643-655.
[http://dx.doi.org/10.1038/nrclinonc.2013.170] [PMID: 24061039]
[5]
Feng, Y.; Shi, C.; Wang, D.; Wang, X.; Chen, Z. Integrated analysis of DNA copy number changes and gene expression identifies key genes in gastric cancer. J. Comput. Biol., 2020, 27(6), 877-887.
[http://dx.doi.org/10.1089/cmb.2019.0149] [PMID: 31545074]
[6]
Liu, J.; Ma, L.; Chen, Z.; Song, Y.; Gu, T.; Liu, X.; Zhao, H.; Yao, N. Identification of critical genes in gastric cancer to predict prognosis using bioinformatics analysis methods. Ann. Transl. Med., 2020, 8(14), 884.
[http://dx.doi.org/10.21037/atm-20-4427] [PMID: 32793728]
[7]
Yu, C.; Chen, J.; Ma, J.; Zang, L.; Dong, F.; Sun, J.; Zheng, M. Identification of key genes and signaling pathways associated with the progression of gastric cancer. Pathol. Oncol. Res., 2020, 26(3), 1903-1919.
[http://dx.doi.org/10.1007/s12253-019-00781-3] [PMID: 31848941]
[8]
Asadi, F.; Salehnasab, C.; Ajori, L. Supervised algorithms of machine learning for the prediction of cervical cancer. J. Biomed. Phys. Eng., 2020, 10(4), 513-522.
[PMID: 32802799]
[9]
Kim, S. Weighted K-means support vector machine for cancer prediction. Springerplus, 2016, 5(1), 1162.
[http://dx.doi.org/10.1186/s40064-016-2677-4] [PMID: 27512621]
[10]
Libbrecht, M.W.; Noble, W.S. Machine learning applications in genetics and genomics. Nat. Rev. Genet., 2015, 16(6), 321-332.
[http://dx.doi.org/10.1038/nrg3920] [PMID: 25948244]
[11]
Sun, Y.; Zhu, S.; Ma, K.; Liu, W.; Yue, Y.; Hu, G. Identification of 12 cancer types through genome deep learning. Sci. Rep., 2019, 9(1), 17256.
[http://dx.doi.org/10.1038/s41598-019-53989-3]
[12]
Wu, W.; Zhou, H. Data-driven diagnosis of cervical cancer with support vector machine-based approaches. IEEE Access, 2017, PP(99), 1-1.
[http://dx.doi.org/10.1109/ACCESS.2017.2763984]
[13]
Oue, N.; Sentani, K.; Sakamoto, N.; Uraoka, N.; Yasui, W. Molecular carcinogenesis of gastric cancer: Lauren classification, mucin phenotype expression, and cancer stem cells. Int. J. Clin. Oncol., 2019, 24(7), 771-778.
[http://dx.doi.org/10.1007/s10147-019-01443-9] [PMID: 30980196]
[14]
Ao, R.; Guan, L.; Wang, Y.; Wang, J.N. Retracted: Silencing of COL1A2, COL6A3, and THBS2 inhibits gastric cancer cell proliferation, migration, and invasion while promoting apoptosis through the PI3k‐Akt signaling pathway. J. Cell. Biochem., 2018, 119(6), 4420-4434.
[http://dx.doi.org/10.1002/jcb.26524] [PMID: 29143985]
[15]
Chen, X.; Li, X.; Hu, X.; Jiang, F.; Shen, Y.; Xu, R.; Wu, L.; Wei, P.; Shen, X. LUM expression and its prognostic significance in gastric cancer. Front. Oncol., 2020, 10, 605.
[http://dx.doi.org/10.3389/fonc.2020.00605] [PMID: 32500021]
[16]
Van Calster, B.; Verbakel, J.Y.; Christodoulou, E.; Steyerberg, E.W.; Collins, G.S. Statistics versus machine learning: Definitions are interesting (but understanding, methodology, and reporting are more important). J. Clin. Epidemiol., 2019, 116, 137-138.
[http://dx.doi.org/10.1016/j.jclinepi.2019.08.002] [PMID: 31425736]
[17]
Bzdok, D.; Altman, N.; Krzywinski, M. Statistics versus machine learning. Nat. Methods, 2018, 15(4), 233-234.
[http://dx.doi.org/10.1038/nmeth.4642] [PMID: 30100822]

Rights & Permissions Print Cite
© 2024 Bentham Science Publishers | Privacy Policy