Statistical analysis of methylation reveals epigenetic biomarkers for high- functioning autism spectrum disorder

Authors

  • Aarav Sharma Archbishop Mitty High School
  • Professor Hayan Lee

DOI:

https://doi.org/10.47611/jsrhs.v13i1.6137

Keywords:

autism spectrum disorder, ASD, Austism, Machine Learning, PCA, Principal Compoent Analysis, high-function autism, Kolmogorov-Smirnov test, KS-Test, False Discovery Rate, FDR, Cpgs, Pvalue

Abstract

Autism spectrum disorder (ASD) is a neurological disorder that affects social behavior and learning ability. Although studies have identified certain regions of the genome (22q13.33) linked to the increased risk of ASD, there is no clear etiology, which is why psychologists theorize that ASD is a complex interaction between genetic and environmental factors. The current study was designed to identify genes with a positive correlation toward the diagnosis of high-function autism spectrum disorder (HF-ASD). Samples were obtained from National Center for Biotechnology Information (NCBI) containing 450K CpG methylation from 69 subjects diagnosed as HF-ASD or typically developing (TD). We applied Principal Component Analysis (PCA) and further utilized two sample Kolmogorov-Smirnov test (KS-test) to obtain the p-values for each patient. False Discovery Rate (FDR Correction) is computed by Benjamini-Hochberg Procedure. Based on the output of the FDR method, we identified the top ten genes that have CpGs with statistically differentially methylated. Such potential epigenetic biomarker includes WBP11, LDHB, DHX29, NUB1, C2orf67, SNAI2, MTOR, CLCN3, SYNJ2BP, and TTK. Two sample KS test and FDR Correction code is available at: https://www.kaggle.com/code/aaravsharma123/gse109905-fdr-pvalues/notebook.

 

Downloads

Download data is not yet available.

References or Bibliography

Kimura, Ryo, et al. “An Epigenetic Biomarker for Adult High-Functioning Autism Spectrum Disorder.” Nature News, Nature Publishing Group, 20 Sept.2019, https://www.nature.com/articles/s41598-019-50250-9#data-availability.

https://www.ncbi.nlm.nih.gov/geo/query/acc.cgi?acc=GSE109905.

“PCA.” Pca Visualization in Python, https://plotly.com/python/pca-visualization/.

“Scipy.stats.ks_2samp#.” Scipy.stats.ks_2samp - SciPy v1.9.1 Manual, https://docs.scipy.org/doc/scipy/reference/generated/scipy.stats.ks_2samp.html.

“Statsmodels.stats.multitest.fdrcorrection” Statsmodels, https://www.statsmodels.org/dev/generated/statsmodels.stats.multitest.fdrcorrection.html.

https://www.kaggle.com/datasets/hayan2021lee/hm450coords

Marks, Katya, et al. “Maternal-Autoantibody-Related (MAR) Autism: Identifying Neuronal Antigens and Approaching Prospects for Intervention.” Journal of Clinical Medicine, MDPI, 7Aug.2020, https://www.ncbi.nlm.nih.gov/pmc/articles/PMC7465310/.

Zhu, Yihui, et al. “Placental Methylome Reveals a 22q13.33 Brain Regulatory Gene Locus Associated with Autism - Genome Biology.” BioMed Central, BioMed Central, 16 Feb. 2022, https://genomebiology.biomedcentral.com/articles/10.1186/s13059-022-02613-1.

Tordjman, Sylvie, et al. “Gene × Environment Interactions in Autism Spectrum Disorders: Role of Epigenetic Mechanisms.” Frontiers, Frontiers, 1 Jan. 1AD, https://www.frontiersin.org/articles/10.3389/fpsyt.2014.00053/full.

Published

02-28-2024

How to Cite

Sharma, A., & Lee, H. (2024). Statistical analysis of methylation reveals epigenetic biomarkers for high- functioning autism spectrum disorder. Journal of Student Research, 13(1). https://doi.org/10.47611/jsrhs.v13i1.6137

Issue

Section

HS Research Projects