Learn practical skills, build real-world projects, and advance your career
Created 3 years ago
NYC Taxi Trip Duration Prediction
This Data set Contains fields like:
- id - a unique identifier for each trip
2 . vendor_id - a code indicating the provider associated with the trip record - pickup_datetime - date and time when the meter was engaged
- dropoff_datetime - date and time when the meter was disengaged
- passenger_count - the number of passengers in the vehicle (driver entered value)
- pickup_longitude - the longitude where the meter was engaged
- pickup_latitude - the latitude where the meter was engaged
- dropoff_longitude - the longitude where the meter was disengaged
- dropoff_latitude - the latitude where the meter was disengaged
10.store_and_fwd_flag - This flag indicates whether the trip record was held in vehicle memory before sending to the vendor because the vehicle did not have a connection to the server - Y=store and forward; N=not a store and forward trip - trip_duration - duration of the trip in seconds
You can download the dataset from https://www.kaggle.com/c/nyc-taxi-trip-duration/data
Downloading the data
import pandas as pd
import numpy as np
import seaborn as sns
import matplotlib.pyplot as plt
import plotly.express as px
import matplotlib
matplotlib.rcParams['figure.figsize'] = 10,10
nyc_df = pd.read_csv('train.csv')
nyc_df