Statistical analysis of methylation reveals epigenetic biomarkers for high- functioning autism spectrum disorder
DOI:
https://doi.org/10.47611/jsrhs.v13i1.6137Keywords:
autism spectrum disorder, ASD, Austism, Machine Learning, PCA, Principal Compoent Analysis, high-function autism, Kolmogorov-Smirnov test, KS-Test, False Discovery Rate, FDR, Cpgs, PvalueAbstract
Autism spectrum disorder (ASD) is a neurological disorder that affects social behavior and learning ability. Although studies have identified certain regions of the genome (22q13.33) linked to the increased risk of ASD, there is no clear etiology, which is why psychologists theorize that ASD is a complex interaction between genetic and environmental factors. The current study was designed to identify genes with a positive correlation toward the diagnosis of high-function autism spectrum disorder (HF-ASD). Samples were obtained from National Center for Biotechnology Information (NCBI) containing 450K CpG methylation from 69 subjects diagnosed as HF-ASD or typically developing (TD). We applied Principal Component Analysis (PCA) and further utilized two sample Kolmogorov-Smirnov test (KS-test) to obtain the p-values for each patient. False Discovery Rate (FDR Correction) is computed by Benjamini-Hochberg Procedure. Based on the output of the FDR method, we identified the top ten genes that have CpGs with statistically differentially methylated. Such potential epigenetic biomarker includes WBP11, LDHB, DHX29, NUB1, C2orf67, SNAI2, MTOR, CLCN3, SYNJ2BP, and TTK. Two sample KS test and FDR Correction code is available at: https://www.kaggle.com/code/aaravsharma123/gse109905-fdr-pvalues/notebook.
Downloads
References or Bibliography
Kimura, Ryo, et al. “An Epigenetic Biomarker for Adult High-Functioning Autism Spectrum Disorder.” Nature News, Nature Publishing Group, 20 Sept.2019, https://www.nature.com/articles/s41598-019-50250-9#data-availability.
https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE109905.
“PCA.” Pca Visualization in Python, https://plotly.com/python/pca-visualization/.
“Scipy.stats.ks_2samp#.” Scipy.stats.ks_2samp - SciPy v1.9.1 Manual, https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.ks_2samp.html.
“Statsmodels.stats.multitest.fdrcorrection” Statsmodels, https://www.statsmodels.org/dev/generated/statsmodels.stats.multitest.fdrcorrection.html.
https://www.kaggle.com/datasets/hayan2021lee/hm450coords
Marks, Katya, et al. “Maternal-Autoantibody-Related (MAR) Autism: Identifying Neuronal Antigens and Approaching Prospects for Intervention.” Journal of Clinical Medicine, MDPI, 7Aug.2020, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7465310/.
Zhu, Yihui, et al. “Placental Methylome Reveals a 22q13.33 Brain Regulatory Gene Locus Associated with Autism - Genome Biology.” BioMed Central, BioMed Central, 16 Feb. 2022, https://genomebiology.biomedcentral.com/articles/10.1186/s13059-022-02613-1.
Tordjman, Sylvie, et al. “Gene × Environment Interactions in Autism Spectrum Disorders: Role of Epigenetic Mechanisms.” Frontiers, Frontiers, 1 Jan. 1AD, https://www.frontiersin.org/articles/10.3389/fpsyt.2014.00053/full.
Published
How to Cite
Issue
Section
Copyright (c) 2024 Aarav Sharma; Professor Hayan Lee
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.