Journal List > Hanyang Med Rev > v.37(2) > 1044317

Kim: A Review of Deep Genomics Applying Machine Learning in Genomic Medicine

Abstract

Genomic medicine is to determine how an individual's DNA alteration can affect the risk of various diseases and to understand mechanisms and design targeted treatments. Here, we focus on how machine learning helps model the relationship between DNA and molecular phenotypes in a cell. Modern biology enables high throughput measurements of many cellular variables that can be handled as a training target for predictable models, such as gene expression, splicing, and protein binding to DNA or mRNA. With the increasing availability of large datasets and advanced computer skills such as deep learning, researchers have opened a new era in effective genomic medicine.

Notes

Running title Robotic Thyroidectomy

References

1. Watson JD. Molecular structure of nucleic aids: A structure for deoxyribose nucleic acid. Nature. 1953; 171:737–738.
crossref
2. Lander ES, Linton LM, Birren B, Nusbaum C, Zody MC, Baldwin J, et al. Initial sequencing and analysis of the human genome. Nature. 2001; 409:860–921.
3. de Klerk E, 't Hoen PA. Alternative mRNA transcription, processing, and translation: Insights from RNA sequencing. Trends Genet. 2015; 31:128–139.
crossref
4. Harrow J, Frankish A, Gonzalez JM, Tapanari E, Diekhans M, Kokocinski F, et al. GENCODE: The reference human genome annotation for the ENCODE project. Genome Res. 2012; 22:1760–1774.
crossref
5. Lindblad-Toh K, Garber M, Zuk O, Lin MF, Parker BJ, Washietl S, et al. A high-resolution map of human evolutionary constraint using 29 mammals. Nature. 2011; 478:476–482.
crossref
6. Moffat JG, Rudolph J, Bailey D. Phenotypic screening in cancer drug discovery-past, present and future. Nature Rev Drug Discov. 2014; 13:588–602.
7. Leung MK, Delong A, Alipanahi B, Frey BJ. Machine learning in genomic medicine: A review of computational problems and data sets. Proc IEEE. 2016; 104:176–197.
crossref
8. Landt SG, Marinov GK, Kundaje A, Kheradpour P, Pauli F, Batzoglou S, et al. ChIP-seq guidelines and practices of the ENCODE and modENCODE consortia. Genome Res. 2012; 22:1813–1831.
crossref
9. Xiong HY, Alipanahi B, Lee LJ, Bretschneider H, Merico D, Yuen RK, et al. RNA splicing. The human splicing code reveals new insights into the genetic determinants of disease. Science. 2015; 347:Epub 2014 Dec 18.
crossref
10. Singh RK. Pre-mRNA splicing in disease and therapeutics. Trends Mol Med. 2012; 18:472–482.
crossref
11. Pan Q, Shai O, Lee LJ, Frey BJ, Blencowe BJ. Deep surveying of alternative splicing complexity in the human transcriptome by highthroughput sequencing. Nat Genet. 2008; 40:1413–1415.
crossref
12. Barbosa-Morais NL, Irimia M, Pan Q, Xiong HY, Gueroussov S, Lee LJ, et al. The evolutionary landscape of alternative splicing in vertebrate species. Science. 2012; 338:1587–1593.
crossref
13. Voineagu I, Wang X, Johnston P, Lowe JK, Tian Y, Horvath S, et al. Transcriptomic analysis of autistic brain reveals convergent molecular pathology. Nature. 2011; 474:380.
crossref
14. López-Bigas N, Audit B, Ouzounis C, Parra G, Guigó R. Are splicing mutations the most frequent cause of hereditary disease? FEBS Lett. 2005; 579:1900–1903.
crossref
15. Levo M. In pursuit of design principles of regulatory sequences. Nat Rev Genet. 2014; 15:453–468.
crossref
16. Stormo GD. DNA binding sites: Representation and discovery. Bioinformatics. 2000; 16:16–23.
crossref
TOOLS
Similar articles