Introduction to Clustering in Python

Clustering is a machine learning technique that can be used to group unlabeled data based on shared features. In this course, learners will identify use cases for clustering algorithms and become familiar with the theoretical underpinnings of unsupervised machine learning (working with unlabeled data). In particular, learners will build, evaluate, and interpret a K-means model in Python, based on one of the most commonly used clustering algorithms.

View Course details

DataSociety

4 hours of instruction

OBJECTIVES

Mine data to find latent patterns and groups on numerical data using K-Means clustering
Evaluate the accuracy and effectiveness of clustering
Identify use cases where clustering analyses are relevant and where they are not applicable

PREREQUISITES

Learners must be comfortable using Python to manipulate data and must know how to create basic visualizations.

SYLLABUS & TOPICS COVERED

K-Means
- Unsupervised learning and its use cases
- The theory behind K-Means algorithm
- Implementation of K-Means on a dataset

SOFTWARE REQUIREMENTS

You will have access to a Python-based JupyterHub environment for this course. No additional download or installation is required.

About Instructor

DataSociety

148 Courses

Introduction to Clustering in Python

About Instructor

DataSociety

Committed to your success with open source. OpenTeams is your easy point of access to a range of services from our open source expert network, from commercial open source support to open source training, staffing & recruiting services, and more.

Resources

OpenTeams