Analyzing Big Data with Hive

Go to class
Write Review

Free Online Course: Analyzing Big Data with Hive provided by LinkedIn Learning is a comprehensive online course, which lasts for 1-2 hours worth of material. The course is taught in English and is free of charge. Upon completion of the course, you can receive an e-certificate from LinkedIn Learning. Analyzing Big Data with Hive is taught by Ben Sullins.

Overview
  • Learn how to use Hive to analyze large datasets and derive information from Hadoop. Learn how to work with tables, structures, aggregations, clauses, functions, and more.

Syllabus
  • Introduction

    • Welcome
    • What you should know before watching this course
    • Using the exercise files
    1. Hive Concepts and Setup
    • Why use Hive
    • How Hive works
    • Setting up our demo environment
    2. Working with Data in Hive
    • Understanding table structures in Hive
    • Creating tables in Hive
    • Handling CSV files in Hive
    • Partitioning tables
    3. Retrieving Data from Hive
    • Simple SELECT statement
    • Retrieving data from complex structures
    4. Aggregating Data
    • Simple aggregations
    • Enhanced aggregations with grouping sets
    • Using CUBE and ROLLUP
    5. Filtering Results
    • Simple filter with the WHERE clause
    • Filtering aggregates with HAVING clause
    • Finding similar values with LIKE
    6. Joining Tables
    • Combining tables with JOIN
    • When to use SEMI JOIN
    • Joining multiple tables together
    7. Manipulating Data
    • Types of data manipulation functions
    • String functions
    • Math functions
    • Date functions
    • Conditional functions
    Conclusion
    • Next steps