Finding Earthquake Patterns with Time Series and Regression Models

Authors

  • Stavya Gaonkar American High School
  • Aditi Ravindra
  • Apoorva Bathula
  • Rohan Kolala
  • Ansh Bhatia
  • Sam Fendell

DOI:

https://doi.org/10.47611/jsrhs.v13i2.6668

Keywords:

Machine Learning, Earthquakes, Regression, Time Series Forecasting

Abstract

Due to their unpredictability, earthquakes are certainly the most dangerous natural disasters. Scientists have long struggled to understand the movement of tectonic plates, as they occur at depths that cannot be directly seen and occur extremely slowly, at a rate that results in possibly one every thousand years between two adjacent plates. What if we could take a different approach, however? What if we could leverage the recent advances in machine learning to close this gap? That is exactly what we aimed to do in our research, with the question of whether various factors related to earthquakes, such as their magnitude and time of occurrence, could be determined through the analysis of years of historical data. This was conducted through the analysis of a dataset which contained records of all major earthquakes (magnitude 5.5 or higher) from 1900-2023. The library scikit-learn was used for various regression models, such as linear regression, random forest regression, and lasso regression, and the library FB prophet was used for time series forecasting, a method used to analyze patterns in previous earthquakes to predict new ones. Our conclusions were that, with some degree of error, the magnitude of an earthquake can be determined based on location/time and the time of occurrence can be pinpointed to specific days of the week as well as months.

Downloads

Download data is not yet available.

References or Bibliography

Islam, Jahaidul. “Significant Earthquake Dataset 1900-2023.” Kaggle, 18 Feb. 2023, www.kaggle.com/datasets/jahaidulislam/significant-earthquake-dataset-1900-2023.

“Is There Such a Thing as Earthquake Season?: Earth.” EarthSky, 22 Apr. 2021, earthsky.org/earth/earthquake-season-taiwan-dry-monsoon/.

“Can the Position of the Moon or the Planets Affect Seismicity? Are There More Earthquakes in the Morning/in the Evening/at a Certain Time of the Month?” Can the Position of the Moon or the Planets Affect Seismicity? Are There More Earthquakes in the Morning/in the Evening/at a Certain Time of the Month? | U.S. Geological Survey, www.usgs.gov/faqs/can-position-moon-or-planets-affect-seismicity-are-there-more-earthquakes-morningin-eveningat

Prabhakaran, Selva. “Time Series Analysis in Python - a Comprehensive Guide with Examples - Ml+.” Machine Learning Plus, Machine Learning Plus, 8 Sept. 2020, www.machinelearningplus.com/time-series/time-series-analysis-python/.

Brownlee, Jason. “How to Check If Time Series Data Is Stationary with Python.” Machine Learning Mastery, 14 Aug. 2020, machinelearningmastery.com/time-series-data-stationary-python/.

Erica. “Introduction to the Fundamentals of Time Series Data and Analysis.” Aptech, 12 May 2021, www.aptech.com/blog/introduction-to-the-fundamentals-of-time-series-data-and-analysis

“Forecasting at Scale.” Prophet, facebook.github.io/prophet/. Accessed 28 Feb. 2024.

Bakshi, Chaya. “Random Forest Regression.” Medium, Level Up Coding, 9 June 2020, levelup.gitconnected.com/random-forest-regression-209c0f354c84.

Published

05-31-2024

How to Cite

Gaonkar, S., Ravindra, A., Bathula, A. ., Kolala, R., Bhatia, A., & Fendell, S. (2024). Finding Earthquake Patterns with Time Series and Regression Models. Journal of Student Research, 13(2). https://doi.org/10.47611/jsrhs.v13i2.6668

Issue

Section

HS Research Projects