project NYC Taxi Trip Duration Dataset
Introduction Let us walk through the Exploratory Data Analysis on NYC Taxi Trip Duration Dataset. What is Exploratory Data Analysis? Exploratory Data Analysis is investigating data and drawing out insights from it to study its main characteristics. EDA can be done using statistical and visualization techniques. Why is EDA important? We simply can’t make sense of such huge datasets if we don’t explore the data. Exploring and analyzing the data is important to see how features are contributing to the target variable, identifying anomalies and outliers to treat them lest they affect our model, to study the nature of the features, and be able to perform data cleaning so that our model building process is as efficient as possible s. Importing necessary libraries import pandas as pd #data processing import numpy as np #linear algebra #data visualisation import seaborn as sns sns.set() import matplotlib.pyplot as plt %matplotlib inline import datetime as dt import warnings...