In this course, learners will prepare data for, implement, and optimize three advanced clustering models in Python while comparing their different use cases. In particular, this course focuses on the suitability of different clustering methods for different kinds of data: numerical, categorical, and mixed. Learners will distinguish between K-modes, mean-shift, and K-prototypes models, developing their understanding of when each model will best meet their needs.
Learn how to apply seasonal analysis and ARIMA models and how to decompose and identify seasonal and non- seasonal factors all while learning the nuances of building sophisticated time series models.
This course covers the clustering concepts of natural language processing, equipping learners with the ability to cluster text data into groups and topics by finding similarities between different documents.
Data is often messy, requiring cleaning and restructuring before it can be reliably used in a program or project. In this course, learners will augment their understanding of Python using two of the most popular libraries for data cleaning and wrangling, NumPy and Pandas. First, learners will practice working with NumPy objects, transforming data into efficient arrays for ease of analysis. Then, learners will clean and structure arrays into readable tabular DataFrames using Pandas, allowing them to profile a dataset for key answers and values.
Decision tree models are classification algorithms that sort novel data into categories based on iterative splitting, like the branches of a tree, according to input parameters. In this course, learners will identify use cases for decision trees in Python. They will wrangle data and implement a decision tree model before attempting to evaluate its effectiveness. Finally, learners will use their knowledge of the mathematics behind decision trees to tune the model and improve its classificatory function.
This course covers an overview of ensemble learning methods like random forest and boosting. At the end of this course, students will be able to implement and compare random forest algorithm and boosting.
This course helps students to identify the most impactful features for your model. It will build upon foundational machine learning techniques to hone predictive skills and discover critical danger points in patterns. By the end of this course, students will be able to determine key features in models.
There was a problem reporting this post.
Please confirm you want to block this member.
You will no longer be able to:
Please note: This action will also remove this member from your connections and send a report to the site admin. Please allow a few minutes for this process to complete.