Learn practical skills, build real-world projects, and advance your career
Created 3 years ago
Let's begin by installing the required libraries.
!pip install opendatasets scikit-learn plotly folium --upgrade --quiet
|████████████████████████████████| 22.3 MB 1.2 MB/s
|████████████████████████████████| 20.6 MB 1.2 MB/s
|████████████████████████████████| 94 kB 4.8 MB/s
ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts.
datascience 0.10.6 requires folium==0.2.1, but you have folium 0.12.1 which is incompatible.
!pip install pandas numpy matplotlib seaborn --quiet
Download and prepare the dataset for training
import os
from zipfile import ZipFile
from urllib.request import urlretrieve
dataset_url = 'https://github.com/JovianML/opendatasets/raw/master/data/house-prices-advanced-regression-techniques.zip'
urlretrieve(dataset_url, 'house-prices.zip')
with ZipFile('house-prices.zip') as f:
f.extractall(path='house-prices')
os.listdir('house-prices')
['test.csv', 'sample_submission.csv', 'train.csv', 'data_description.txt']