Learn practical skills, build real-world projects, and advance your career

Exploratory Data Analysis of Particulate Matter 2.5 in Beizing


Particulate Matter distribution in the 5 cities of China
The task of this project is to analyse the distribution of particulate matter PM 2.5 in Beijing. The dataset is borrowed from UCI machine learning repository Link. The PM measurements were carried out in 4 checkpoints US Embassy, Nongzhanguan, Dongsihuan, and Dongsi. All the measuring points except US POST were done by China’s Ministry of Environmental Protection (MEP).

The data set contains 52584 rows and 18 columns.

In the map below you can find how distant these measuring points were located.

PM_Beizing.png

from google.colab import drive
drive.mount('/content/drive')
Mounted at /content/drive
!pip install jovian --upgrade -q;
#!pip install numpy
#!pip install pandas
!pip install pandas-profiling;
Requirement already satisfied: pandas-profiling in /usr/local/lib/python3.6/dist-packages (1.4.1) Requirement already satisfied: six>=1.9 in /usr/local/lib/python3.6/dist-packages (from pandas-profiling) (1.15.0) Requirement already satisfied: pandas>=0.19 in /usr/local/lib/python3.6/dist-packages (from pandas-profiling) (1.1.2) Requirement already satisfied: jinja2>=2.8 in /usr/local/lib/python3.6/dist-packages (from pandas-profiling) (2.11.2) Requirement already satisfied: matplotlib>=1.4 in /usr/local/lib/python3.6/dist-packages (from pandas-profiling) (3.2.2) Requirement already satisfied: numpy>=1.15.4 in /usr/local/lib/python3.6/dist-packages (from pandas>=0.19->pandas-profiling) (1.18.5) Requirement already satisfied: python-dateutil>=2.7.3 in /usr/local/lib/python3.6/dist-packages (from pandas>=0.19->pandas-profiling) (2.8.1) Requirement already satisfied: pytz>=2017.2 in /usr/local/lib/python3.6/dist-packages (from pandas>=0.19->pandas-profiling) (2018.9) Requirement already satisfied: MarkupSafe>=0.23 in /usr/local/lib/python3.6/dist-packages (from jinja2>=2.8->pandas-profiling) (1.1.1) Requirement already satisfied: pyparsing!=2.0.4,!=2.1.2,!=2.1.6,>=2.0.1 in /usr/local/lib/python3.6/dist-packages (from matplotlib>=1.4->pandas-profiling) (2.4.7) Requirement already satisfied: cycler>=0.10 in /usr/local/lib/python3.6/dist-packages (from matplotlib>=1.4->pandas-profiling) (0.10.0) Requirement already satisfied: kiwisolver>=1.0.1 in /usr/local/lib/python3.6/dist-packages (from matplotlib>=1.4->pandas-profiling) (1.2.0)
# Importing the necessary libraries
import numpy as np
import pandas as pd
import pandas_profiling
import seaborn as sns
import matplotlib.pyplot as plt

%matplotlib inline