An Exploratory Data Analysis on e-Commerce events

This dataset contains 900K detailed users events from an electronics store e-Commerce website

alt

Image credits - rupixen.com from Unsplash

About the data

In this analysis we will use the "e-Commerce events history in electronics store" data by Michael Kechinov on Kaggle from the REES46 Marketing Platform.

The Kaggle dataset is in .csv file format having 900K row and 9 columns.

Each row in the file represents an event. All events are related to products and users. Each event is like many-to-many relation between products and users.

Below are description of 9 columns

event_time: Time when the event occured (in UTC) in the format "2019-10-01 00:00:04+00:00"
event_type: A single event either "view", "cart" or "purchase"
product_id: A unique code for each product
category_id: A unique code for each product category
category_code: A '.' separated text with names of categories and subcategories "computers.network.router"
brand: Brand name of the product "sony"
price: Price of the product in decimals "31.90"
user_id: A permanent unique ID per shopper "1515915625519388267"
user_session: A temporary ID per session "LJuJVLEjPT"

Events in the column event_type can be:

view - a user viewed a product
cart - a user added a product to shopping cart
remove_from_cart - a user removed a product from shopping cart
purchase - a user purchased a product

What does each column tell us

At time event_time and during session user_session, user user_id purchased (event_type is "purchase") a product product_id of brand brand in category category_code of price price.