A Deep Autoencoder System for Differentiation of Cancer Types Based on DNA Methylation State

Khwaja, Mohammed, Kalofonou, Melpomeni, Toumazou, Chris

arXiv.org Machine Learning 

Abstract--A Deep Autoencoder based content retrieval algorithm is proposed for prediction and differentiation of cancer types based on the presence of epigenetic patterns of DNA methylation identified in genetic regions known as CpG islands. The developed deep learning system uses a CpG island state classification subsystem to complete sets of missing/incomplete island data in given human cell lines, and is then pipelined with an intricate set of statistical and signal processing methods to accurately predict the presence of cancer and further differentiate the type and cell of origin in the event of a positive result. The proposed system was trained with previously reported data derived from four case groups of cancer cell lines, achieving overall Sensitivity of 88.24%, Specificity of 83.33%, Accuracy of 84.75% and Matthews Correlation Coefficient of 0.687. The ability to predict and differentiate cancer types using epigenetic events as the identifying patterns was demonstrated in previously reported data sets from breast, lung, lymphoblastic leukemia and urological cancer cell lines, allowing the pipelined system to be robust and adjustable to other cancer cell lines or epigenetic events. Significant progress has been made in understanding crucial regulatory mechanisms responsible for the development and progression of cancer at a cellular and molecular level, through genetic alterations such as DNA mutations and disruptions in epigenetic mechanisms including DNA methylation and histone modifications [1]. Cancer rates have been progressively increasing, with the latest statistics from Cancer Research UK to have reported more than 350,000 new cases diagnosed in the UK [2], of which more than 40% could have been prevented. Cancer research has been significantly progressing with advances in more effective treatments and screening methods, however there is still a pressing need for more targeted methods to be available for monitoring of cancer progression and prevention of treatment resistance that would help control the disease and improve survival rates.

Duplicate Docs Excel Report

Title
None found

Similar Docs  Excel Report  more

TitleSimilaritySource
None found