Preprint / Version 1

Genome Sequencing: Short-Read Sequencing and Computational Algorithms

##article.authors##

  • Sabrina Eng Student

Keywords:

DNA sequencing, k-mer, graph theory

Abstract

Sequencing the human genome provides valuable resources for biomedical research and medicinal practice. Modern technology struggles with whole-sequencing methods; most genome sequence techniques involve a short-read process by breaking apart the genome into small read segments. To reconstruct the entire sequence, each read is overlapped as a search for a Eulerian path. Computer science is thus applied to assemble the genome as the data sets are complex. This paper details one computational approach to genome reconstruction using the short-read sequencing described above.

References or Bibliography

Shanika L Amarasinghe, Shian Su, Xueyi Dong, Luke Zappia, Matthew E Ritchie, and Quentin Gouil. Opportunities and challenges in long-read sequencing data analysis. Genome biology, 21(1):1–16, 2020.

Gary Chartrand. Introductory graph theory. Courier Corporation, 1977.

Kishore R Kumar, Mark J Cowley, and Ryan L Davis. Next-generation sequencing and emerging technologies. In Seminars in thrombosis and hemostasis, volume 45, pages 661–673. Thieme Medical Publishers, 2019.

Mohit K Midha, Mengchu Wu, and Kuo-Ping Chiu. Long-read sequencing in deciphering human genetics to a greater depth. Human genetics, 138(11):1201–1215, 2019.

Dev Patel. Russian bridges, eulerian circuits, and genome assembly?, Jul 2021.

Judes Poirier, P Bertrand, S Kogan, S Gauthier, J Davignon, and D Bouthillier. Apolipoprotein e polymorphism and alzheimer’s disease. The Lancet, 342(8873):697–699, 1993.

Lisa A Urry, Michael Lee Cain, Steven Alexander Wasserman, Peter V Minorsky, and Jane B Reece. Campbell biology in focus, volume 10. Pearson Boston, MA, 2014.

Nava Whiteford, Niall Haslam, Gerald Weber, Adam Pru¨gel-Bennett, Jonathan W Essex, Pe- ter L Roach, Mark Bradley, and Cameron Neylon. An analysis of the feasibility of short read sequencing. Nucleic acids research, 33(19):e171–e171, 2005.

Downloads

Posted

09-30-2021