Web Crawling Tripadvisor to Develop a Restaurant Recommendation System for Los Angeles

Authors

  • Jimin Han QSI International School of Shenzhen
  • Sergei Ievlev QSI International School of Shenzhen

DOI:

https://doi.org/10.47611/jsrhs.v13i1.6289

Keywords:

Web Crawling, Recommendation System, Natural Language Processing

Abstract

The widespread use of online review websites has revolutionized how consumers choose restaurants, particularly in popular tourist destinations like Los Angeles, where a vast range of dining options is readily available. However, the sheer abundance of similar cuisine offerings can be overwhelming. To address this challenge, this study used Python Selenium to web crawl Tripadvisor for gathering data about Los Angeles restaurants. Relevant information from user reviews was extracted and analyzed utilizing natural language processing techniques to classify restaurants based on cuisine, price, and customer reviews and ratings. This classification allowed for the identification of distinct dining preferences, providing insights into restaurant selection in tourist-heavy areas. With the application of cosine similarity, the analysis further led to the development of a recommendation system specific to consumers’ needs and preferences. This study thus offers a new approach to improving restaurant discovery and decision-making in busy urban centers in Los Angeles.

Downloads

Download data is not yet available.

References or Bibliography

Gase, L. N., Green, G., Montes, C., & Kuo, T. (2019). Understanding the Density and Distribution of Restaurants in Los Angeles County to Inform Local Public Health Practice. Preventing Chronic Disease, 16. https://doi.org/10.5888/pcd16.180278

Li, H. (2023, May 4). L.A. tourists are (mostly) back — except some big spenders - Los Angeles Times. Los Angeles Times. https://www.latimes.com/business/story/2023-05-04/la-fi-tourism-mostly-back-except-biggest-spenders

Mahajan, K., Joshi, V., Khedkar, M., Galani, J., & Kulkarni, M. (2021). Restaurant Recommendation System using Machine Learning. International Journal of Advanced Trends in Computer Science and Engineering, 10(3), 1671–1675. https://doi.org/10.30534/ijatcse/2021/261032021

Mayorquin, J. (2021, March 23). California Dreaming: Cultural diversity - one of the Golden State’s greatest assets. ABC7 Los Angeles. https://abc7.com/california-dream-solutions-culture/10327371/

Most visited travel and tourism websites worldwide 2023 | Statista. (2023, October 17). Statista. https://www.statista.com/statistics/1215457/most-visited-travel-and-tourism-websites-worldwide/

Munaji, A. A., & Emanuel, A. W. R. (2021). Restaurant Recommendation System Based on User Ratings with Collaborative Filtering. IOP Conference Series: Materials Science and Engineering, 1077(1), 012026. https://doi.org/10.1088/1757-899x/1077/1/012026

Published

02-28-2024

How to Cite

Han, J., & Ievlev, S. (2024). Web Crawling Tripadvisor to Develop a Restaurant Recommendation System for Los Angeles. Journal of Student Research, 13(1). https://doi.org/10.47611/jsrhs.v13i1.6289

Issue

Section

HS Research Articles