Data Cleaning

Go to class
Write Review

Data Cleaning provided by Kaggle is a comprehensive online course, which lasts for 4 hours worth of material. Data Cleaning is taught by Rachael Tatman. Upon completion of the course, you can receive an e-certificate from Kaggle. The course is taught in Englishand is Free Certificate. Visit the course page at Kaggle for detailed price information.

Overview
  • Master efficient workflows for cleaning real-world, messy data.

    • Drop missing values, or fill them in with an automated workflow.
    • Transform numeric variables to have helpful properties.
    • Help Python recognize dates as composed of day, month, and year.
    • Avoid UnicoodeDecodeErrors when loading CSV files.
    • Efficiently fix typos in your data.

Syllabus
    • Handling Missing Values
    • Scaling and Normalization
    • Parsing Dates
    • Character Encodings
    • Inconsistent Data Entry