Learn practical skills, build real-world projects, and advance your career

Data for the month of May 2019

Exploratory Data Analysis and perdiction using simple sklearn models are being done

Various Libraries are being imported such as Numpy, Pandas, Matplotlib and Seaborn to extract statistical parameters

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
%matplotlib inline
sns.set
<function seaborn.rcmod.set(context='notebook', style='darkgrid', palette='deep', font='sans-serif', font_scale=1, color_codes=True, rc=None)>

.JSOLN files are being imported from the system

df_array = []
for i in range (1, 32):
    if i < 10:
        df_array.append(pd.read_json('2019-05-0' + str(i) + '.jsonl', lines = True))
    else:
        df_array.append(pd.read_json('2019-05-' + str(i) + '.jsonl', lines = True))
df = pd.concat(df_array, ignore_index = True, sort = True)

This is to merge all the files into a single .csv file