PySpark - Python Spark Hadoop coding framework & testing

Go to class
Write Review

PySpark - Python Spark Hadoop coding framework & testing provided by Udemy is a comprehensive online course, which lasts for 3-4 hours worth of material. PySpark - Python Spark Hadoop coding framework & testing is taught by FutureX Skills. Upon completion of the course, you can receive an e-certificate from Udemy. The course is taught in Englishand is Paid Course. Visit the course page at Udemy for detailed price information.

Overview
  • Big data Python Spark PySpark coding framework logging error handling unit testing PyCharm PostgreSQL Hive data pipeline

    What you'll learn:

    • Python Spark PySpark industry standard coding practices - Logging, Error Handling, reading configuration, unit testing
    • Building a data pipeline using Hive, Spark and PostgreSQL
    • Python Spark Hadoop development using PyCharm

    This course will bridge the gap between your academic and real world knowledge and prepare you for an entry level Big Data Python Spark developer role. You will learn the following

    • Python Spark coding best practices

    • Logging

    • Error Handling

    • Reading configuration from properties file

    • Doing development work using PyCharm

    • Using your local environment as a Hadoop Hive environment

    • Reading and writing to a Postgres database using Spark

    • Python unit testing framework

    • Building a data pipeline using Hadoop , Spark and Postgres

    Prerequisites :

    • Basic programming skills

    • Basic database knowledge

    • Hadoop entry level knowledge