Learn practical skills, build real-world projects, and advance your career

Let's begin by installing the required libraries.

!pip install opendatasets scikit-learn plotly folium --upgrade --quiet
|████████████████████████████████| 22.3 MB 1.2 MB/s |████████████████████████████████| 20.6 MB 1.2 MB/s |████████████████████████████████| 94 kB 4.8 MB/s ERROR: pip's dependency resolver does not currently take into account all the packages that are installed. This behaviour is the source of the following dependency conflicts. datascience 0.10.6 requires folium==0.2.1, but you have folium 0.12.1 which is incompatible.
!pip install pandas numpy matplotlib seaborn --quiet

Download and prepare the dataset for training

import os
from zipfile import ZipFile
from urllib.request import urlretrieve

dataset_url = 'https://github.com/JovianML/opendatasets/raw/master/data/house-prices-advanced-regression-techniques.zip'
urlretrieve(dataset_url, 'house-prices.zip')
with ZipFile('house-prices.zip') as f:
    f.extractall(path='house-prices')
    
os.listdir('house-prices')
['test.csv', 'sample_submission.csv', 'train.csv', 'data_description.txt']