A Machine Learning Approach to Predict the Occurrence of Forest Fires with Meteorological Parameters

Authors

  • Yashnil Mohanty High School
  • Juyon Lee

DOI:

https://doi.org/10.47611/jsrhs.v13i2.6620

Keywords:

Machine Learning, Artificial Intelligence, Forest Fires, Random Forest Classifier, scikit learn

Abstract

Forest fires have emerged as a considerable challenge in the United States, posing substantial societal, economic, and environmental risks. As a result, the early and accurate prediction of these fires is imperative for management efforts. In this study, we used two Kaggle datasets: the “Algerian Forest Fires Dataset” with fire readings from 2012 and the “Forest Fires Data Set” with readings from 2007. However, because the second data set was originally intended for a regression task, providing approximate area values representing the predicted burned area of the forest fire, we phased the data set out while developing our final model. Ultimately, we used the Algerian Forest Fires Dataset, containing 13 attributes and 244 instances of forest fires in two regions of Algeria. To streamline the analysis, we reduced the number of features to 5, namely, month, temperature, humidity, wind, and rain. Moreover, we developed a Random Forest Classifier model to predict the occurrence of a forest fire, using the data set for training and testing. Performance was compared against Decision Tree, Logistic Regression, and Artificial Neural Network models, using cross-validation. The experiment showed a slight superiority to the Random Forest Classifier approach, achieving an accuracy score of 86.486% and an F1 score of 88.889%. Our approach provides a decimal value representing the probability for fire likelihood. Overarchingly, this research contributes to the advancement of forest fire prediction technologies by leveraging meteorological data.

Downloads

Download data is not yet available.

References or Bibliography

Staff. “Forest Fires & Climate Change: Effects of Deforestation on Wildfires: GFW.” Global Forest Watch, 2024, www.globalforestwatch.org/topics/fires/#intro.

Staff. “Wildfire Statistics.” Congressional Research Service, 1 June 2023, sgp.fas.org/crs/misc/IF10244.pdf.

Keeley, Jon, and Alexandra Syphard. “Climate change and future fire regimes: Examples from California.” Geosciences, vol. 6, no. 3, 17 Aug. 2016, p. 37, https://doi.org/10.3390/geosciences6030037.

Staff. “Fire Weather Index (FWI) System.” NWCG, 28 Aug. 2023, www.nwcg.gov/publications/pms437/cffdrs/fire-weather-index-system.

Zaidi, Abdelhamid. “Predicting wildfires in Algerian forests using machine learning models.” Heliyon, vol. 9, no. 7, 10 July 2023, https://doi.org/10.1016/j.heliyon.2023.e18064.

Castelli, Mauro, et al. “Predicting burned areas of forest fires: An Artificial Intelligence Approach.” Fire Ecology, vol. 11, no. 1, 1 Apr. 2015, pp. 106–118, https://doi.org/10.4996/fireecology.1101106.

Staff. “Wildfires.” EPA, Aug. 2016, www.epa.gov/sites/default/files/2016-08/documents/print_wildfires-2016.pdf.

Jaadi, Zakaria. “A Step-by-Step Explanation of Principal Component Analysis (PCA).” Built-In, 29 Mar. 2023, builtin.com/data-science/step-step-explanation-principal-component-analysis.

Staff. “What Is a Decision Tree.” IBM, www.ibm.com/topics/decision-trees#:~:text=A%20decision%20tree%20is%20a,internal%20nodes%20and%20leaf%20nodes. Accessed 17 Feb. 2024.

Staff. “What Is Random Forest?” IBM, www.ibm.com/topics/random-forest. Accessed 17 Feb. 2024.

Staff. “What Is Logistic Regression?” IBM, www.ibm.com/topics/logistic-regression. Accessed 17 Feb. 2024.

Staff. “What Is a Confusion Matrix in Machine Learning?” Simplilearn.Com, Simplilearn, 16 Feb. 2023, www.simplilearn.com/tutorials/machine-learning-tutorial/confusion-matrix-machine-learning.

Published

05-31-2024

How to Cite

Mohanty, Y., & Lee, J. (2024). A Machine Learning Approach to Predict the Occurrence of Forest Fires with Meteorological Parameters. Journal of Student Research, 13(2). https://doi.org/10.47611/jsrhs.v13i2.6620

Issue

Section

HS Research Projects