Learn practical skills, build real-world projects, and advance your career

Indian Crop Production: Exploratory Data Analysis to chart Agriculture highlights using Python

Came across this fascinating and important all emcompassing dataset on State-wise Crop Production in India at kaggle for 1997 to 2015 https://www.kaggle.com/abhinand05/crop-production-in-india.

It also gave information about different seasonal crops at district level and area of cultivation along with total crop production.India being agriculture rich country, this data will have lots of minor and major facts which will help in charting a next successful agriculture revolution after 1965. Few facts of green revolution of India taken from https://en.wikipedia.org/wiki/Green_Revolution_in_India:
"Under premiership of Congress leader Lal Bahadur Shastri, the Green Revolution within India commenced in 1965 that led to an increase in food grain production, especially in Punjab, Haryana, and Uttar Pradesh"

Doing an exploratory data analysis of this dataset would give insights into Indian agriculture status: state-wise, district-wise, crop-wise, area-wise and levels of productions. A complete analysis will paint a beautiful story of this important aspect of India.

Using some tricks and tips from "Data Analysis with Python: Zero to Pandas](zerotopandas.com)" will apply to this rich and diverse dataset to glean important and broad aspects which showcase our agriculture might and how it is natured and enhanced by scientific knowledge combined with traditional cultivation process.

Downloading the Dataset

The Dataset was downloaded using jovian opendatasets with kaggle link in .csv format. Opendatasets is collection or repository of interesting datasets from different domains which will help Data Science enthuiast to experiment and sharpen their skills using different modelling concepts to either to classify or forecast future incoming data.

!pip install jovian opendatasets --upgrade --quiet

Let's begin by downloading the data, and listing the files within the dataset.

# Change this
dataset_url = 'https://www.kaggle.com/abhinand05/crop-production-in-india/crop_production.csv'