Learn practical skills, build real-world projects, and advance your career

Unsupervised Learning using Scikit Learn - Machine Learning with Python

This tutorial is a part of the Machine Learning with Python: Zero to GBMs and Zero to Data Science Bootcamp by Jovian.

alt

The following topics are covered in this tutorial:

  • Overview of unsupervised learning algorithms in Scikit-learn
  • Clustering algorithms: K Means, DBScan, Hierarchical clustering etc.
  • Dimensionality reduction (PCA) and manifold learning (t-SNE)

How to run the code

This tutorial is an executable Jupyter notebook hosted on Jovian. You can run this tutorial and experiment with the code examples in a couple of ways: using free online resources (recommended) or on your computer.

Option 1: Running using free online resources (1-click, recommended)

The easiest way to start executing the code is to click the Run button at the top of this page and select Run on Binder. You can also select "Run on Colab" or "Run on Kaggle", but you'll need to create an account on Google Colab or Kaggle to use these platforms.

Option 2: Running on your computer locally

To run the code on your computer locally, you'll need to set up Python, download the notebook and install the required libraries. We recommend using the Conda distribution of Python. Click the Run button at the top of this page, select the Run Locally option, and follow the instructions.

Jupyter Notebooks: This tutorial is a Jupyter notebook - a document made of cells. Each cell can contain code written in Python or explanations in plain English. You can execute code cells and view the results, e.g., numbers, messages, graphs, tables, files, etc., instantly within the notebook. Jupyter is a powerful platform for experimentation and analysis. Don't be afraid to mess around with the code & break things - you'll learn a lot by encountering and fixing errors. You can use the "Kernel > Restart & Clear Output" menu option to clear all outputs and start again from the top.

Let's install the required libraries.

!pip install numpy pandas matplotlib seaborn --quiet