Data is often messy, requiring cleaning and restructuring before it can be reliably used in a program or project. In this course, learners will augment their understanding of base R using an open-source set of packages intended for data cleaning and wrangling, the tidyverse. After installing this package, learners will practice working with functions that allow data to be selected, filtered, summarized, rearranged, and otherwise transformed according to analyst-vetted best practices.
Decision tree models are classification algorithms that sort novel data into categories based on iterative splitting, like the branches of a tree, according to input parameters. In this course, learners will identify use cases for decision trees in R. They will wrangle data and implement a decision tree model before attempting to evaluate its effectiveness. Finally, learners will use their knowledge of the mathematics behind decision trees to tune the model and improve its classificatory function.
This course covers an overview of ensemble learning methods like random forest and boosting. At the end of this course, students will be able to implement and compare random forest algorithm and boosting.
There was a problem reporting this post.
Please confirm you want to block this member.
You will no longer be able to:
Please note: This action will also remove this member from your connections and send a report to the site admin. Please allow a few minutes for this process to complete.