Learn practical skills, build real-world projects, and advance your career

A categorical analysis of the Udemy courses out there

This project is based off the Udemy Courses dataset on Kaggle. This is an EDA notebook, where I try to answer some interesting questions about the dataset.

I will be using pandas, numpy, matplotlib and seaborn to aid me in my work.

Downloading the Dataset

The dataset can be downloaded from Kaggle at this link.

If you are using Binder on Jovian.ml, you can find it in the attachments category

# Confirming the dataset exists
CSV_PATH = "udemy_courses.csv"
import os
os.path.getsize(CSV_PATH)
693946
project_name = "zerotopandas-course-project" # change this (use lowercase letters and hyphens only)

Data Preparation and Cleaning

We will start by viewing information about the various columns and cleaning up the dataset