Preprint / Version 1

Using Machine Learning to Predict Penguin Species

##article.authors##

  • Alex Hua Acton-Boxborough Regional High School

Keywords:

Computer Science, Machine Learning, Cross Entropy Error, Neural Networks, Penguins, Multi Classification

Abstract

Machine learning is used to make a neural network to predict penguin species from physical attributes. The data set is first prepared, with missing and extraneous data deleted. After this, there remains 333 examples consisting of three different penguin species. These remaining examples are fed into a neural network which proceeds to train on them. The neural network consists of an input and output layer with no hidden layers and uses the softmax activation formula. 25% of the data set is used as a validation set with the remaining 75% used as the training set.

References or Bibliography

Müller, A., & Guido, S. (2016). Introduction to Machine Learning with Python: A guide for data scientists. O'Reilly Media

Géron, A. (2019). Hands-On Machine Learning with Scikit-Learn, Keras, and TensorFlow: Concepts, tools, and techniques to build intelligent systems. O’Reilly Media

Bishop, C. M. (2006). Pattern Recognition and Machine Learning (Information Science and Statistics). Springer Publishing

Downloads

Posted

09-16-2022