Deep Learning for MS2 Feature Detection in Liquid Chromatography Mass Spectrometry

Authors

  • Jonathan He Texas Academy of Mathematics and Science
  • Olivia Liu
  • Xuan Guo

DOI:

https://doi.org/10.47611/jsrhs.v11i3.2964

Keywords:

Deep Learning, Feature detection, liquid-chromatography mass spectrometry

Abstract

Accuracy of peptide identification is crucial for LC-MS analysis to reveal information regarding many different aspects of proteins that aid in the discovery of biomarkers and profiling of complex proteomes. Preprocessing steps such as feature detection are crucial yet challenging; current feature detection tools are not robust enough to detect low-abundance, low-peak fragments of peptides found in MS2 data from tandem mass spectrometry. In this study, we developed a deep learning-based model with an innovative sliding window process that enables high-resolution processing of quantitative MS/MS data to conduct accurate feature detection on MS2 data. Experimental results show that our model is able to produce more accurate values and identifications than existing feature detection tools. Therefore, we believe that our model can realize the full potential of neural networks in the field of bioinformatics and yields long-term benefits in the advancement of proteomic inquiry.

Downloads

Download data is not yet available.

References or Bibliography

Ren, Shaoqing, et al. “Faster R-CNN: Towards Real-Time Object Detection with Region Proposal Networks.” IEEE Transactions on Pattern Analysis and Machine Intelligence, vol. 39, no. 6,2016, pp. 1137-1149, doi:10.1109/TPAMI.2016.2577031

Zohora, Fatema Tuz, et al. “DeepIso: A Deep Learning model for Peptide Feature Detection from LC-MS Map.” Scientific Reports, vol. 9, no. 17168, 2019. https://doi.org/10.1038/s41598-019-52954-4.

Sellers, K. & Miecznikowski, J. “Feature Detection Techniques for Preprocessing Proteomic Data.” International Journal of Biomedical Imaging, vol. 2010, 2010, 896718. doi: 10.1155/2010/896718.

Ma, Chunwei, et al. “Improved Peptide Retention Time Prediction in Liquid Chromatography through Deep Learning.” Analytical Chemistry, vol. 90, no. 18, 2018, pp. 10881-10888, https://doi.org/10.1021/acs.analchem.8b02386.

Nicolescu, Teodor Octavian. “Interpretation of Mass Spectra.” Mass Spectrometry, edited by Mahmood Aliofkhazraei, IntechOpen, 2017. doi: 10.5772/intechopen.68595.

Cox, J & Mann, M. “MaxQuant enables high peptide identification rates, individualized p.p.b.-range mass accuracies and proteome-wide quantification.” Nature Biotechnology, vol. 26, 2008, pp. 1367-1372. https://doi.org/10.1038/nbt.1511.

Teleman, John, et al. “Dinosaur: A Refined Open-Source Peptide MS Feature Detector.” Journal of Proteome Research, vol. 15, no. 7, 2016, pp. 2143-2151. https://doi.org/10.1021/acs.jproteome.6b00016.

Cappadona, S., Baker, P. R., Cutillas, P. R., Heck, A. J. & van Breukelen, B. “Current challenges in software solutions for mass spectrometry-based quantitative proteomics.” Amino acids, vol. 43, 2012, pp. 1087–1108. https://doi.org/10.1007/s00726-012-1289-8

Savojardo, Castrense, et al. “DeepSig: deep learning improves signal peptide detection in proteins.” Bioinformatics, vol. 34, no. 10, 2018, pp. 1690-1696. https://doi.org/10.1093/bioinformatics/btx818.

Tran, Ngoc Hieu, et al. “De novo peptide sequencing by deep learning.” Proceedings of the National Academy of Sciences, vol. 114, no. 31, 2017, pp. 8247-8252. https://doi.org/10.1073/pnas.1705691114.

Yang, X., Mochanov, P., Kautz, J. “Multilayer and Multimodal Fusion of Deep Neural Networks for Video Classification.” Proceedings of the 24th ACM International Conference on Multimedia, 2016, pp. 978-987, doi: 10.1145/2964284.2964297.

Rost, H., Sachsenberg, T., Aiche, S. et al. “OpenMS: a flexible open-source software platform for mass spectrometry data analysis.” Nature Methods, vol 13, 2016, pp. 741-748. https://doi.org/10.1038/nmeth.3959.

Hoffman, Melissa A et al. “Comparison of Quantitative Mass Spectrometry Platforms for Monitoring Kinase ATP Probe Uptake in Lung Cancer.” Journal of Proteome Research, vol. 17, no. 1, 2018, pp. 63-75. doi:10.1021/acs.jproteome.7b00329.

McLafferty, F. W. “Tandem mass spectrometry (MS/MS): a promising new analytical technique for specific component determination in complex mixtures.” Accounts of Chemical Research, vol. 13, no. 2, 1980, pp. 33-39. Doi: 10.1021/ar50146a001.

Kessner, D., Chambers, M., Burke, R., Agus, D. & Mallick, P. “ProteoWizard: open source software for rapid proteomics tool development.” Bioinformatics, vol. 24, no. 21, 2008, pp. 2534-2536. https://doi.org/10.1093/bioinformatics/btn323

Adusumilli, R. & Mallick, P. “Data Conversion with ProteoWizard msConvert.” Proteomics, Methods in Molecular Biology, vol. 1550, 2017. https://doi.org/10.1007/978-1-4939-6747-6_23.

Stewart, P. A., Welsh, E. A., Slebos, R. J. C. et al. “Proteogenomic landscape of squamous cell lung cancer.” Nature Communications, vol. 10, no. 3578, 2019. https://doi.org/10.1038/s41467-019-11452-x.

Stewart, P. et al. “Relative protein quantification and accessible biology in lung tumor proteomes from four LC-MS/MS discovery platforms.” Proteomics, vol. 17, no. 6, 2017. Doi: 10.1002/pmic.201600300.

Hu, A. et al. “Technical Advances in proteomics: new developments in data-independent acquisition.” F1000Research, vol. 5, 2016. doi: 10.12688/f1000research.7042.1.

Tada, I. et al. “Correlation-Based Deconvolution (CorrDec) To Generate High-Quality MS2 Spectra from Data-Independent Acquisition in Multisample Studies.” Analytical Chemistry, vol. 92, no. 16, 2020, pp. 11310-11317. Doi: 10.1021/acs.analchem.0c01980.

Virreira Winter, S., Meier, F., Wichmann, C. et al. “EASI-tag enables accurate multiplexed and interference-free MS2-based proteome quantification.” Nature Methods, vol. 15, 2018, pp. 527–530. https://doi.org/10.1038/s41592-018-0037-8.

Weissert, H. & Choudhary, J. S. “Targeted Feature Detection for Data-Dependent Shotgun Proteomics.” Journal of Proteome Research, vol. 16, no. 8, 2017, pp. 2964-2974. doi: 10.1021/acs.jproteome.7b00248.

Wang, X., Shen, S., Rasam, S. S. & Qu, J. “MS1 ion current-based quantitative proteomics: A promising solution for reliable analysis of large biological cohorts.” Mass Spectrometry Reviews, vol. 38, no. 6, 2019, pp. 461-482. doi: 10.1002/mas.21595.

Bodla, N., Singh, B., Chellappa, R. & Daivs, L. “Soft-NMS – Improving Object Detection with One Line of Code.” IEEE International Conference on Computer Vision, 2017, pp. 5562-5570. https://doi.org/10.48550/arXiv.1704.04503.

Zhang, G. et al. “Overview of peptide and protein analysis by mass spectrometry.” Current protocols in protein science, vol. 16, no. 1, 2010. doi:10.1002/0471140864.ps1601s62.

Published

08-31-2022

How to Cite

He, J., Liu, O., & Guo, X. (2022). Deep Learning for MS2 Feature Detection in Liquid Chromatography Mass Spectrometry. Journal of Student Research, 11(3). https://doi.org/10.47611/jsrhs.v11i3.2964

Issue

Section

HS Research Articles