Clustering in NLP

This course covers the clustering concepts of natural language processing, equipping learners with the ability to cluster text data into groups and topics by finding similarities between different documents.

View Course details

OpenTeams

4 hours of instruction

OBJECTIVES

Understand measures of similarity and distance
Learn and implement cosine similarity on text documents
Understand how similar documents can be clustered into topics

PREREQUISITES

Topic Modeling in NLP

SYLLABUS & TOPICS COVERED

Cosine Similarity
- Measures of similarity and distance
- Theory and implementation of cosine similarity find most similar documents
Clustering Documents
- Clustering as an unsupervised method in text analysis
- Hierarchical clustering algorithm in a nutshell
- How to implement clustering on a corpus of documents

SOFTWARE REQUIREMENTS

You will have access to a Python-based JupyterHub environment for this course. No additional download or installation is required.

About Instructor

OpenTeams

56 Courses

Clustering in NLP

About Instructor

OpenTeams

Committed to your success with open source. OpenTeams is your easy point of access to a range of services from our open source expert network, from commercial open source support to open source training, staffing & recruiting services, and more.

Resources

OpenTeams