Processing Text with R Essential Training

Go to class
Write Review

Free Online Course: Processing Text with R Essential Training provided by LinkedIn Learning is a comprehensive online course, which lasts for Less than 1 hour of material. The course is taught in English and is free of charge. Upon completion of the course, you can receive an e-certificate from LinkedIn Learning. Processing Text with R Essential Training is taught by Kumaran Ponnambalam.

Overview
  • Learn key techniques for cleansing and processing text in R, and discover how to convert text to a form that's ready for analytics and predictions.

Syllabus
  • Introduction

    • The emergence of text analytics
    1. Introduction to Text Mining
    • Purpose
    • Document
    • Corpus
    • R text processing libraries
    • Setting up the environment
    2. Corpus in R
    • PCorpus and VCorpus
    • Reading files with CorpusReader
    • Exploring the corpus
    • Persisting the corpus
    3. Text Cleansing and Extraction
    • Setup for processing
    • Cleansing text
    • Stop word removal
    • Stemming
    • Managing metadata
    4. TF-IDF
    • Introduction to tf-idf
    • Generating term frequency matrix
    • Improving term frequency matrix
    • Plotting term frequency
    • Generating tf-idf
    5. N-Grams
    • N-grams concepts
    • Using RWeka NGramTokenizer
    • Creating an n-gram text frequency matrix
    • Extracting n-gram pairs
    6. Best Practices
    • Storing text
    • Processing text data
    • Scalability
    Conclusion
    • Next steps