Section 1 : Introduction
|
Lecture 1 | Introduction |
Section 2 : Prerequisites
|
Lecture 1 | Cluster Setup on Google Cloud | 00:05:24 Duration |
|
Lecture 2 | Data setup on the cluster | 00:06:38 Duration |
|
Lecture 3 | Data setup in Hive Metastore | |
|
Lecture 4 | Understanding datasets provided for practice tests |
Section 3 : Let's Warm Up - Apache Spark Introduction
|
Lecture 1 | Introduction to Apache Spark | 00:06:45 Duration |
|
Lecture 2 | Resilient Distributed Datasets |
Section 4 : How to Transform, Stage, and Store the data
|
Lecture 1 | Dataframe abstraction in Spark | 00:04:26 Duration |
|
Lecture 2 | Working with CSV data | 00:26:29 Duration |
|
Lecture 3 | Working with JSON data | 00:08:43 Duration |
|
Lecture 4 | Working with Parquet data | 00:07:55 Duration |
|
Lecture 5 | Working with Avro data | 00:08:36 Duration |
|
Lecture 6 | Working with ORC data | 00:04:53 Duration |
|
Lecture 7 | Working with DataFrame Columns | 00:11:12 Duration |
|
Lecture 8 | Manipulating dates with spark Dataframes | 00:10:19 Duration |
|
Lecture 9 | Manipulating Strings with spark Dataframes | 00:09:35 Duration |
|
Lecture 10 | Working with Hive metastore |
Section 5 : How to do data analysis
|
Lecture 1 | Understanding GROUP BY and ORDER BY | 00:10:29 Duration |
|
Lecture 2 | Understanding Ranking functions | |
|
Lecture 3 | Understanding Windowing functions | 00:10:06 Duration |