Learn practical skills, build real-world projects, and advance your career

New York City Taxi Trip Duration

About Data (Source Kaggle)

The competition dataset is based on the 2016 NYC Yellow Cab trip record data made available in Big Query on Google Cloud Platform. The data was originally published by the NYC Taxi and Limousine Commission (TLC). The data was sampled and cleaned for the purposes of this playground competition. Based on individual trip attributes, participants should predict the duration of each trip in the test set

!pip install jovian opendatasets xgboost graphviz lightgbm scikit-learn xgboost lightgbm --upgrade --quiet
|████████████████████████████████| 166.7 MB 14 kB/s |████████████████████████████████| 2.0 MB 26.1 MB/s |████████████████████████████████| 22.3 MB 26.6 MB/s
!pip install opendatasets --quiet
import opendatasets as od
import pandas as pd 
import numpy as np
url="https://www.kaggle.com/c/nyc-taxi-trip-duration/data"