Advanced Clustering in Python

In this course, learners will prepare data for, implement, and optimize three advanced clustering models in Python while comparing their different use cases. In particular, this course focuses on the suitability of different clustering methods for different kinds of data: numerical, categorical, and mixed. Learners will distinguish between K-modes, mean-shift, and K-prototypes models, developing their understanding of when each model will best meet their needs.

4 hours of instruction

In this course, learners will prepare data for, implement, and optimize three advanced clustering models in Python while comparing their different use cases. In particular, this course focuses on the suitability of different clustering methods for different kinds of data: numerical, categorical, and mixed. Learners will distinguish between K-modes, mean-shift, and K-prototypes models, developing their understanding of when each model will best meet their needs.

OBJECTIVES

  1. Mine data to find latent patterns and groups on categorical data using K-Modes, mean-shift and K-Prototypes
  2. Evaluate the accuracy and effectiveness of the models
  3. Identify use case of each of the clustering methods

PREREQUISITES

Learners must be comfortable using Python to manipulate data, must know how to create basic visualizations and should have moderate background on how clustering works and it’s use cases.

SYLLABUS & TOPICS COVERED

  1. K-modes
    • K-modes use cases and theory
    • Implementation of K-modes on a dataset
  2. Mean-shift
    • Mean-shift use cases and theory
    • Implementation of Mean-shift on a dataset
  3. K-prototypes
    • K-prototypes use cases and theory
    • Implementation of K-prototypes on a dataset

SOFTWARE REQUIREMENTS

You will have access to a Python-based JupyterHub environment for this course. No additional download or installation is required.

About Instructor

DataSociety

148 Courses

Not Enrolled
This course is currently closed