An end-to-end framework for gene expression classification by integrating a background knowledge graph: application to cancer prognosis prediction
Inoue, Kazuma, Kojima, Ryosuke, Kamada, Mayumi, Okuno, Yasushi
–arXiv.org Artificial Intelligence
Motivation: Biological data may be separated into primary data, such as gene expression, and secondary data, such as pathways and protein-protein interactions. Methods using secondary data to enhance the analysis of primary data are promising, because secondary data have background information that is not included in primary data. In this study, we proposed an end-to-end framework to integrally handle secondary data to construct a classification model for primary data. We applied this framework to cancer prognosis prediction using gene expression data and a biological network. Results: Cross-validation results indicated that our model achieved higher accuracy compared with a deep neural network model without background biological network information. Experiments conducted in patient groups by cancer type showed improvement in ROC-area under the curve for many groups. Visualizations of high accuracy cancer types identified contributing genes and pathways by enrichment analysis. Known biomarkers and novel biomarker candidates were identified through these experiments.
arXiv.org Artificial Intelligence
Jun-29-2023
- Country:
- Asia > Japan
- Honshū > Kansai > Kyoto Prefecture > Kyoto (0.05)
- Europe
- Spain > Catalonia
- Barcelona Province > Barcelona (0.04)
- Switzerland > Basel-City
- Basel (0.04)
- Spain > Catalonia
- Oceania > Australia
- New South Wales > Sydney (0.04)
- Asia > Japan
- Genre:
- Research Report
- Experimental Study (0.95)
- New Finding (0.88)
- Research Report
- Industry:
- Health & Medicine
- Pharmaceuticals & Biotechnology (1.00)
- Therapeutic Area
- Immunology (1.00)
- Infections and Infectious Diseases (1.00)
- Neurology (1.00)
- Oncology > Carcinoma (1.00)
- Health & Medicine
- Technology: