Predictability of Off-Targets in CRISPR-Cas9 Gene Editing Systems using Convolutional Neural Networks

Authors

  • Parsh Verma
  • Jothsna Kethar Gifted Gabber
  • Dr. Rajagopal Appavu, Ph.D University of South Florida

DOI:

https://doi.org/10.47611/jsrhs.v12i3.5052

Keywords:

Genetics, Artificial Intelligence, CRISPR-Cas9, Convolutional Neural Network, Gene Editing, Biomedical Engineering, Bioinformatics, Biostatistics, CRISPR

Abstract

CRISPR-Cas systems have catalyzed a quantum leap in genomics, providing a remarkable degree of precision in genomic sequence alterations. Despite these advancements, one notable conundrum persists - the prediction and mitigation of off-target effects. These unintended effects occur when the CRISPR apparatus interacts with homologous, albeit non-identical sequences, which could potentially induce unanticipated mutations. This investigation employs a comparative approach, juxtaposing machine-learning models that harness sophisticated Convolutional Neural Networks with different architectures. The standard CNN model utilizes a stratified architecture that meticulously filters pertinent genetic patterns, and identifies salient genetic features using a convolutional layer along with a batch normalization layer and a max pooling layer. This architecture prevents overfitting by applying a dropout layer, and 2 dense layers to categorize the two outputs. In contrast, the AttnToMismatch_CNN model marries the attention mechanism with the convolutional paradigm to encode sgRNA and DNA sequences into vector representations by using embedding and transformer layers. Which is then passed into a convolution layer and then to a dense layer to result in two outputs. Performance appraisal of the models, through the Area Under the Curve (AUC) of the Receiver Operating Characteristic Curve (AUC-ROC) score, indicated the standard CNN model's superior predictive accuracy. This research accentuates the untapped potential of Convolutional Neural Networks in augmenting the predictability of off-target effects in CRISPR-Cas systems, thereby fostering safer and more efficacious applications of this transformative gene editing tool.

Downloads

Download data is not yet available.

References or Bibliography

Almagro Armenteros, J. J., Sønderby, C. K., Sønderby, S. K., Nielsen, H., & Winther, O. (2017). DeepLoc: prediction of protein subcellular localization using deep learning. Bioinformatics, 33(21), 3387–3395. https://doi.org/10.1093/bioinformatics/btx431

Kang, S.-H., Lee, W., An, J.-H., Lee, J.-H., Kim, Y.-H., Kim, H., Oh, Y., Park, Y.-H., Jin, Y. B., Jun, B.-H., Hur, J. K., Kim, S.-U., & Lee, S. H. (2021). Author Correction: Prediction-based highly sensitive CRISPR off-target validation using target-specific DNA enrichment. Nature Communications, 12(1). https://doi.org/10.1038/s41467-020-20559-5

Lin, J., & Wong, K.-C. (2018). Off-target predictions in CRISPR-Cas9 gene editing using deep learning. Bioinformatics, 34(17), i656–i663. https://doi.org/10.1093/bioinformatics/bty554

Lino, C. A., Harper, J. C., Carney, J. P., & Timlin, J. A. (2018). Delivering CRISPR: a review of the challenges and approaches. Drug Delivery, 25(1), 1234–1257. https://doi.org/10.1080/10717544.2018.1474964

Liu, Q., Cheng, X., Liu, G., Li, B., & Liu, X. (2020). Deep learning improves the ability of sgRNA off-target propensity prediction. BMC bioinformatics, 21(1), 51. https://doi.org/10.1186/s12859-020-3395-z

Liu, Q., He, D., & Xie, L. (2019). Prediction of off-target specificity and cell-specific fitness of CRISPR-Cas System using attention boosted deep learning and network-based gene feature. PLoS computational biology, 15(10), e1007480. https://doi.org/10.1371/journal.pcbi.1007480

Ma, Y., Zhang, L., & Huang, X. (2014). Genome modification by CRISPR-Cas9. FEBS Journal, 281(23), 5186–5193. https://doi.org/10.1111/febs.13110

Manghwar, H., Lindsey, K., Zhang, X., & Jin, S. (2019). CRISPR-Cas System: Recent Advances and Future Prospects for Genome Editing. Trends in Plant Science, 24(12), 1102–1125. https://doi.org/10.1016/j.tplants.2019.09.006

Ran, F. Ann, Hsu, Patrick D., Lin, C.-Y., Gootenberg, Jonathan S., Konermann, S., Trevino, A. E., Scott, David A., Inoue, A., Matoba, S., Zhang, Y., & Zhang, F. (2013). Double Nicking by RNA-Guided CRISPR Cas9 for Enhanced Genome Editing Specificity. Cell, 154(6), 1380–1389. https://doi.org/10.1016/j.cell.2013.08.021

Redman, M., King, A., Watson, C., & King, D. (2016). What is CRISPR-Cas9? Archives of Disease in Childhood - Education & Practice Edition, 101(4), 213–215. https://doi.org/10.1136/archdischild-2016-310459

Zhang, F., Wen, Y., & Guo, X. (2014). CRISPR-Cas9 for genome editing: progress, implications and challenges. Human Molecular Genetics, 23(R1), R40–R46. https://doi.org/10.1093/hmg/ddu125

Zhang, X.-H., Tee, L. Y., Wang, X.-G., Huang, Q.-S., & Yang, S.-H. (2015). Off-target Effects in CRISPR-Cas9-mediated Genome Engineering. Molecular Therapy - Nucleic Acids, 4(1), e264. https://doi.org/10.1038/mtna.2015.37

Zhu, L. J. (2015). Overview of guide RNA design tools for CRISPR-Cas9 genome editing technology. Frontiers in Biology, 10(4), 289–296. https://doi.org/10.1007/s11515-015-1366-y

Published

08-31-2023

How to Cite

Verma, P., Kethar, J. ., & Appavu, Ph.D, D. R. (2023). Predictability of Off-Targets in CRISPR-Cas9 Gene Editing Systems using Convolutional Neural Networks. Journal of Student Research, 12(3). https://doi.org/10.47611/jsrhs.v12i3.5052

Issue

Section

HS Research Projects