May 2019 A63b4 - Notebook by nishujain94 (nishujain94)

Learn practical skills, build real-world projects, and advance your career

Created 5 years ago

Data for the month of May 2019

Exploratory Data Analysis and perdiction using simple sklearn models are being done

Various Libraries are being imported such as Numpy, Pandas, Matplotlib and Seaborn to extract statistical parameters

import numpy as np
import pandas as pd
import matplotlib.pyplot as plt
import seaborn as sns
%matplotlib inline
sns.set

<function seaborn.rcmod.set(context='notebook', style='darkgrid', palette='deep', font='sans-serif', font_scale=1, color_codes=True, rc=None)>

.JSOLN files are being imported from the system

df_array = []
for i in range (1, 32):
    if i < 10:
        df_array.append(pd.read_json('2019-05-0' + str(i) + '.jsonl', lines = True))
    else:
        df_array.append(pd.read_json('2019-05-' + str(i) + '.jsonl', lines = True))
df = pd.concat(df_array, ignore_index = True, sort = True)

Data for the month of May 2019

Exploratory Data Analysis and perdiction using simple sklearn models are being done

Various Libraries are being imported such as Numpy, Pandas, Matplotlib and Seaborn to extract statistical parameters

.JSOLN files are being imported from the system

This is to merge all the files into a single .csv file