Google Cloud Professional Data Engineer Course [2019 Update]

Go to class
Write Review

Google Cloud Professional Data Engineer Course [2019 Update] provided by Udemy is a comprehensive online course, which lasts for 4-5 hours worth of material. Google Cloud Professional Data Engineer Course [2019 Update] is taught by Samuel Lee. Upon completion of the course, you can receive an e-certificate from Udemy. The course is taught in Englishand is Paid Course. Visit the course page at Udemy for detailed price information.

Overview
  • Take this course to prepare for the GCP Data Engineers Exam. Updated to reflect latest exam content.

    What you'll learn:

    • Understand major components of GCP, why and when to use its Products
    • Connect into GCP VMs using SSH
    • Build a dataset using BigQuery
    • Repeat command from BigQuery in Datalab and Plot a Graph
    • Dashboarding in Datastudio
    • Machine Learning and AI Fundamentals
    • Hadoop
    • Application of GCP Products in Real World Applications

    [UPDATED CONTENT 2019 Exam]

    Storage Solutions

    • OLAP vs OLTP databases

    • Consistency concepts.

    • Transactional consistency for various data storage solutions.

    Cloud Storage

    • Gsutil command line interface.

    Datastore

    • Datastore indexing - what is it, how to update, upload.

    BigQuery

    • Update of BigQuery practicals including authorised views in the new BQ UI.

    • Concepts of temporary tables.

    • Types of schemas BQ accepts.

    BigTable

    • BigTable fit for purpose of time-series data.

    • Cbt command line interface for BigTable.

    • BigTable consistency concepts and highly available configuration.

    Dataflow

    • Deploying dataflow jobs and what’s running in the background.

    • Dataflow job monitoring through console -> Cloud Dataflow Monitoring Interface and also gcloud dataflow commands.

    • Updating a dataflow streaming job on the fly.

    • Logging of Cloud Dataflow jobs.

    • Cloud Dataflow Practical - Running job locally and using Dataflow Service

    Hadoop & Dataproc

    • Apache Spark jobs

    Stackdriver

    • Export logs to BigQuery for further analysis, why and how.

    Machine Learning Solutions - New Section

    • Introduction of new GCP ML products and open source products such as Cloud Machine Learning Engine, BigQuery ML, Kubeflow & Spark ML

    • Cloud AutoML -> AutoML Vision, AutoML Vision Edge

    • Dialogflow - GCP’s Chatbot builder

    • Concept of edge computing and distributed computing

    • Google cloud’s TPU (Tensor Processing Unit)

    • Common terms in Machine Learning terminology such as features, labels, models, linear and logistic regression, classification, clustering/networks and supervised/unsupervised learning.

    Migration into GCP - New Section

    • How to migrate data into GCP - Transfer Appliance & Storage Transfer Service

    Dataprep - New Section

    • What is Dataprep?

    • Dataprep practical section, runs Dataflow job in background - nice interface for non-coders

    Security on GCP - New Section

    • Cloud security best practices

    • Securely interacting with Cloud Storage

    • Penetration testing

    • Bastion/Jumphost

    • Encryption

    • Data loss prevention api

    • Live migration

    Cloud Composer - New Section

    • What is cloud composer?

    Hi I’m Sam, a big data engineer, full stack web developer and machine learning/AI Enthusiast teaching you GCP in the most efficient and down to earth approach. I will teach you the core components of GCP required to pass the data engineers exam using a real world applications approach. All the practicals in this course show you techniques used by big data engineers on the GCP.

    Course is streamlined to aim to get you to pass the GCP Data Engineers Certification. Therefore, it is the most time efficient course to learn about GCP if you want to have a good understanding of GCP’s products and have the intention of becoming a certified data engineer in the future. The course is streamlined to under 5 hours! Learn all about GCP over a weekend or in a day !

    Infrastructure solutions will be presented for various use cases as you learn the most when solving real problems! Theory and Practicals will be placed to aim to pass the Data Engineers Exam with the shortest amount of time. In the exam most questions will be targeted on the why and not the how. For example you will be very hard pressed to find a question that asks you to choose the correct code snippet out of the 3 code snippets etc.

    Student Feedback:

    Hi Samuel. Hope this finds you well. I passed the GCP data engineering exam last week and just want to thank you for your Udemy course that summarises the exam materials so well! Have a good week ahead!

    The course is helpful for my preparation of Google Data Engineering Certification Exam. It also gives a good and brief overview of GCP products that is lacking in other courses. The knowledge gained from this course can be applied to using GCP in data scientist and data engineering work.

    I had tried coursera courses from google. It's too longer and has lots of marketing pitches. I like your approach. You should create another course like this for AWS or GCP architect.

    Course is split up into sections as below:

    Introduction - Explore questions, Why Cloud, Why GCP, main differentiators of GCP/main selling points, setup your free GCP account

    Compute Engine - Overview of compute engine and pricing innovations, zones & regions, various machine types and practical to spin up VMs and access them via SSH, Mac and Windows supported

    Storage Solutions - Overview of GCP’s data storage solutions including Cloud Storage, Cloud Datastore, Cloud Spanner, Cloud SQL, BigQuery & BigTable. We will compare these storage solutions with each other and explain use cases where one storage solution will excel over another.

    IAM & Billing - Different member types, roles and permissions, resource hierarchy and billing process

    BigQuery - BigQuery Pricing structure, tips for reducing processing cost, Partitioned & Wildcard tables, Authorised views, Practicals in BigQuery using standard SQL

    Cloud Datalab - How to use Cloud Datalab in a practical with a live feed from BigQuery to explore the dataset.

    Cloud Pub/Sub - Pub/Sub concepts and its components especially decoupling and the uses of Pub/Sub

    Hadoop & Dataproc - Overview of hadoop and major components which will be tested in the exam

    Cloud Dataflow - What is dataflow, the dataflow model, how and why its used with relation to other GCP Products

    Stackdriver - Stackdriver functions such as debugging, error reporting, monitoring, alerting, tracing and logging.

    Tensorflow & AI - Brief overview of machine learning and neural networks, play with neural networks with a playground and understand GCP’s AI products and APIs

    Case Study - Finally, Put your new learnt GCP knowledge to use in a real world application business case. Similar case studies will be present in the exams.