Learn practical skills, build real-world projects, and advance your career

Dataset Information

The data scientists at BigMart have collected 2013 sales data for 1559 products across 10 stores in different cities. Also, certain attributes of each product and store have been defined. The aim is to build a predictive model and find out the sales of each product at a particular store.

Using this model, BigMart will try to understand the properties of products and stores which play a key role in increasing sales.

VariableDescription
Item_IdentifierUnique product ID
Item_WeightWeight of product
Item_Fat_ContentWhether the product is low fat or not
Item_VisibilityThe % of total display area of all products in a store allocated to the particular product
Item_TypeThe category to which the product belongs
Item_MRPMaximum Retail Price (list price) of the product
Outlet_IdentifierUnique store ID
Outlet_Establishment_YearThe year in which store was established
Outlet_SizeThe size of the store in terms of ground area covered
Outlet_Location_TypeThe type of city in which the store is located
Outlet_TypeWhether the outlet is just a grocery store or some sort of supermarket
Item_Outlet_SalesSales of the product in the particulat store. This is the outcome variable to be predicted.

Import modules

import pandas as pd
import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt
import warnings
%matplotlib inline
warnings.filterwarnings('ignore')

Loading the dataset