Learn practical skills, build real-world projects, and advance your career

Course Project - Real-World Machine Learning Model

This Jupyter notebook was created as part of the course Machine Learning with Python: Zero to GBMs. The problem that we are dealing is prediction of the trip duration of taxi using data sampled for this purpose. The dataset can be found and downloaded from Kaggle.

The objectives of this notebook are:

  • Defining the predicting task we need to solve
  • Splitting a dataset into training, validation & test sets
  • Preparing the data (cleaning, remove null values, scaling, encoding categorical values etc.)
  • Exploring the data set (exploratory data analysis)
  • Pick machine learning models and fitting them using sklearn python library
  • Making predictions on the test set
  • Hyperparameter tuning
  • Save and re-use the trained model

Use the "Run" button to execute the code.

!pip install jovian --upgrade --quiet
WARNING: Running pip as root will break packages and permissions. You should install packages reliably by using venv: https://pip.pypa.io/warnings/venv
import jovian
!pip install plotly numpy pandas matplotlib seaborn opendatasets xgboost graphviz lightgbm scikit-learn xgboost lightgbm --quiet
WARNING: Running pip as root will break packages and permissions. You should install packages reliably by using venv: https://pip.pypa.io/warnings/venv
# Execute this to save new versions of the notebook
#jovian.commit(project="machine-learning-course-project")