Analysis on Sales Data Using Hadoop and MapReduce Algorithm
Keywords:
MapReduce, Sales statistics, Big Data Analysis, Decentralized handling, Aligned management, Hadoop System, Trends and PatternsAbstract
For this project, we developed an extremely flexible and efficient MapReduce method to analyze a big sales dataset. The program's objectives were to find interesting correlations plus tendencies within the sales statistics and provide a strong substitute for in-depth analysis of the data. We created Map and Reduce algorithms that processed the information in a networked & simultaneous manner using the MapReduce framework's distributed processing capabilities. We extended the application to efficiently process enormous datasets by deploying it on the Hadoop platform. The results of the investigation demonstrate the MapReduce framework's effectiveness at managing enormous volumes of information through a dispersed, simultaneous way. By employing the networked computing features of the framework, we were able to swiftly and efficiently analyze the sales data and get meaningful insights about it. The program's great scalability allowed us to quickly and effectively examine large amounts of data, making it a powerful tool enabling big information research.
Downloads
Metrics
References or Bibliography
What Is Big Data?. (2022). Retrieved 11 December 2022, from https://www.oracle.com/big-data/what-is-big-data/
Oracle. (n.d.). What is a JAR file? Retrieved January 3, 2023, from https://www.oracle.com/java/technologies/what-is-a-jar-file.html
Mapper class. (n.d.). In Hadoop MapReduce Tutorial. Retrieved January 3, 2023, from https://hadoopmapreducetutorial.com/mapper-class/
Reducer class. (n.d.). In Hadoop MapReduce Tutorial. Retrieved January 3, 2023, from https://hadoopmapreducetutorial.com/reducer-class/
Driver class. (n.d.). In Hadoop MapReduce Tutorial. Retrieved January 3, 2023, from https://hadoopmapreducetutorial.com/driver-class/
Software Testing | Basics - GeeksforGeeks. (2017). Retrieved 11 December 2022, from https://www.geeksforgeeks.org/software-testing-basics/
O'Reilly Media. (n.d.). What is Apache Hadoop? Retrieved January 3, 2023, from https://www.oreilly.com/library/view/hadoop-the-definitive/9781449327891/ch01.html
Published
How to Cite
Issue
Section
Copyright (c) 2023 Abdul Malik Al Jabri, Ibraheem Sayeed; Puttaswamy Malali Rajegowda, Jitendra Pandey
This work is licensed under a Creative Commons Attribution-NonCommercial-ShareAlike 4.0 International License.
Copyright holder(s) granted JSR a perpetual, non-exclusive license to distriute & display this article.