Spark for Machine Learning & AI

Go to class
Write Review

Free Online Course: Spark for Machine Learning & AI provided by LinkedIn Learning is a comprehensive online course, which lasts for 1-2 hours worth of material. The course is taught in English and is free of charge. Upon completion of the course, you can receive an e-certificate from LinkedIn Learning. Spark for Machine Learning & AI is taught by Dan Sullivan.

Overview
  • Discover the powerful Apache Spark platform for machine learning. Learn about preprocessing data, applying algorithms to a variety of machine learning problems, and more.

Syllabus
  • Introduction

    • Welcome
    1. Introduction to Spark and MLlib
    • Introduction to Spark
    • Steps in the machine learning process
    • Install Spark
    • Organizing data in DataFrames
    • Components of Spark MLlib
    2. Data Preparation and Transformation
    • Introduction to preprocessing
    • Normalize numeric data
    • Standardize numeric data
    • Bucketize numeric data
    • Tokenize text data
    • TF-IDF
    • Summary of preprocessing
    3. Clustering
    • Introduction to clustering
    • K-means clustering
    • Hierarchical clustering
    • Summary of clustering techniques
    4. Classification
    • Introduction to classification
    • Preprocessing the Iris data set
    • Naive Bayes classification
    • Multilayer perceptron classification
    • Decision trees classification
    • Summary of classification algorithms
    5. Regression
    • Introduction to regresssion
    • Preprocessing regression data
    • Linear regression
    • Decision tree regression
    • Gradient-boosted tree regression
    • Summary of regression algorithms
    6. Recommendations
    • Understand recommendation systems
    • Collaborative filtering
    Conclusion
    • Tips for using Spark MLlib