4 hours of instruction
This course covers the basics of natural language processing, equipping learners with the ability to clean and process large amounts of text data required for text analysis.
OBJECTIVES
- Describe how text mining can be used effectively in commercial applications and industry
- Process and format text data for analysis
- Extract key summary metrics and words from a corpus of documents
PREREQUISITES
Students must be comfortable using Python to manipulate data and must know how to create basic visualizations.
SYLLABUS & TOPICS COVERED
- Basics Of NLP
- Text mining use cases and challenges
- Text analysis terminology
- Text Processing in R
- Text processing steps
- Term Document matrix
- Word distribution in a corpus
SOFTWARE REQUIREMENTS
You will have access to an R-based Posit Cloud environment for this course. No additional download or installation is required.