Apache Flink: Batch Mode Data Engineering

Go to class
Write Review

Free Online Course: Apache Flink: Batch Mode Data Engineering provided by LinkedIn Learning is a comprehensive online course, which lasts for 1-2 hours worth of material. The course is taught in English and is free of charge. Upon completion of the course, you can receive an e-certificate from LinkedIn Learning. Apache Flink: Batch Mode Data Engineering is taught by Kumaran Ponnambalam.

Overview
  • Explore how to build batch mode data pipelines with Apache Flink, the powerful and popular stream-processing platform.

Syllabus
  • Introduction

    • Batch mode engineering
    1. Apache Flink
    • What is Apache Flink?
    • Apache Flink features
    • Architecture of Apache Flink
    • Flink program structure
    • Flink execution flow
    2. Setting Up Flink
    • Installing Flink standalone
    • Creating a Flink project
    • Build a sample Flink program
    • Running jobs on the cluster
    • Using the Flink web interface
    • Setting up the exercise files
    3. Dataset API
    • DataSet API concepts
    • Reading a CSV File
    • Using Map
    • Using FlatMap
    • Using filters
    • Using aggregates
    • Using Reduce
    4. Advanced Capabilities
    • Using POJO classes
    • Join operations
    • Using MySQL with Flink
    • Using broadcast variables
    5. Use Case Project
    • Problem definition
    • Computing total score
    • Printing scores for physics
    • Computing average scores across subjects
    • Find the top student for each subject
    Conclusion
    • Next steps