Learn practical skills, build real-world projects, and advance your career

NYC Taxi Trip Duration Prediction

This Data set Contains fields like:

  1. id - a unique identifier for each trip
    2 . vendor_id - a code indicating the provider associated with the trip record
  2. pickup_datetime - date and time when the meter was engaged
  3. dropoff_datetime - date and time when the meter was disengaged
  4. passenger_count - the number of passengers in the vehicle (driver entered value)
  5. pickup_longitude - the longitude where the meter was engaged
  6. pickup_latitude - the latitude where the meter was engaged
  7. dropoff_longitude - the longitude where the meter was disengaged
  8. dropoff_latitude - the latitude where the meter was disengaged
    10.store_and_fwd_flag - This flag indicates whether the trip record was held in vehicle memory before sending to the vendor because the vehicle did not have a connection to the server - Y=store and forward; N=not a store and forward trip
  9. trip_duration - duration of the trip in seconds

You can download the dataset from https://www.kaggle.com/c/nyc-taxi-trip-duration/data

Downloading the data

import pandas as pd
import numpy as np

import seaborn as sns
import matplotlib.pyplot as plt
import plotly.express as px
import matplotlib

matplotlib.rcParams['figure.figsize'] = 10,10
nyc_df = pd.read_csv('train.csv')
nyc_df