Analysing 1 Million European Restaurants from Tripadvisor



Exploratory Data Analysis is an approach of analyzing data sets to summarize their main characteristics, often using statistical graphics and other data visualization methods.Read more

alt
alt

Tripadvisor, Inc. is an American online travel company that operates a website and mobile app with user-generated content and a comparison shopping website. It also offers online hotel reservations and bookings for transportation, lodging, travel experiences, and restaurants.

Tripadvisor helps hundreds of millions of people each month become better travelers, from planning to booking to taking a trip. Travelers across the globe use the Tripadvisor site and app to discover where to stay, what to do and where to eat based on guidance from those who have been there before. Travelers turn to Tripadvisor to find deals on accommodations, book experiences, reserve tables at delicious restaurants and discover great places nearby. As a travel guidance company available in 43 markets and 22 languages, Tripadvisor makes planning easy no matter the trip type. Know more


About Data :
The TripAdvisor dataset includes 1,083,397 restaurants with attributes such as location data, average rating, number of reviews, open hours, cuisine types, awards, etc. total of 42 attributes.

The dataset combines the restaurants from the main European countries.
This data set has been downloaded from Kaggle.


Kaggle allows users to find and publish data sets, explore and build models in a web-based data-science environment, work with other data scientists and machine learning engineers, and enter competitions to solve data science challenges.


Purpose

  • We are using this datasset to provide further information in regards to the restaurants details that make them successful and appreciated by the users, with the possibility to compare the common features of different European countries regarding the average ratings, awards, open hours, reviews count, etc.

  • This information can also be useful to improve the services of the Restaurants and also help if one wants to set up a new Restaurant.

How to Run the Code

You can execute the code by using the "Run" button at the top of this page and selecting "Run on Colab". You can make changes and save your own version of the notebook to Jovian by executing the following cells:

jovian.commit()
[jovian] Detected Colab notebook... [jovian] Uploading colab notebook to Jovian... Committed successfully! https://jovian.ai/gouravitandel1945/eda-european-restaurants

Here’s a step by step outline of this project:

  1. Download the data using opendatasets library from Kaggle.
  2. Perform below actions on the Data.
  • Load the file using pandas.
  • Look at some information about the file.
  • Fix any missing or incorrect values.
  1. Analyse below columns to meangingful get insights out of it.
  • City
  • Top Tags
  • Claimed
  • gluten free
  • terrible
  1. Answer below questions to generate insights from the dataset.
  • Which countries have the maximum number of high rated restaurants?
  • Which restaurants have won the maximum of awards and which countries?
  • Are the popular restaurants affordable?
  • Are there any restaurants that are highly rated in one category and poorly rated in another?
  • Which cusines/dish are commonly are served?
  • Which are the restaurants that one should go if they are looking for veg, vegan and glutan free food ?
  • Show the restaurants for a country based on the popularity score.
  1. Summarise the project and derive conclusion out of it.

Download the Data

In this section, we will download the data using opendatasets library