Modernizing Data Lakes and Data Warehouses with GCP

  • 4.7
Approx. 9 hours to complete

Course Summary

Learn how to design, build, and operate data warehouses and data lakes on Google Cloud Platform (GCP).

Key Learning Points

  • Learn how to design and build data warehouses and data lakes on GCP
  • Explore GCP tools for ingesting, transforming, and querying data
  • Gain hands-on experience with real-world scenarios and case studies

Related Topics for further study


Learning Outcomes

  • Design and build data warehouses and data lakes on GCP
  • Ingest and transform data using GCP tools
  • Query and analyze data in GCP

Prerequisites or good to have knowledge before taking this course

  • Basic understanding of data modeling and SQL
  • Familiarity with cloud computing concepts

Course Difficulty Level

Intermediate

Course Format

  • Online self-paced
  • Video lectures
  • Hands-on labs
  • Real-world scenarios

Similar Courses

  • Data Engineering, Big Data, and Machine Learning on GCP
  • Data Engineering on Google Cloud Platform

Related Education Paths


Notable People in This Field

  • Martin Fowler
  • Ken Collier

Related Books

Description

The two key components of any data pipeline are data lakes and warehouses. This course highlights use-cases for each type of storage and dives into the available data lake and warehouse solutions on Google Cloud Platform in technical detail. Also, this course describes the role of a data engineer, the benefits of a successful data pipeline to business operations, and examines why data engineering should be done in a cloud environment. Learners will get hands-on experience with data lakes and warehouses on Google Cloud Platform using QwikLabs.

Outline

  • Introduction
  • Course Introduction
  • Getting Started with Google Cloud and Qwiklabs
  • Introduction to Data Engineering
  • Explore the role of a data engineer
  • Analyze data engineering challenges
  • Intro to BigQuery
  • Data Lakes and Data Warehouses
  • Demo: Federated Queries with BigQuery
  • Transactional Databases vs Data Warehouses
  • Partner effectively with other data teams
  • Manage data access and governance
  • Demo: Finding PII in your dataset with DLP API
  • Build production-ready pipelines
  • Review GCP customer case study
  • Recap
  • Lab Intro: Using BigQuery to do Analysis
  • Introduction to Data Engineering
  • Building a Data Lake
  • Introduction to Data Lakes
  • Data Storage and ETL options on GCP
  • Building a Data Lake using Cloud Storage
  • Demo: Optimizing cost with Google Cloud Storage classes and Cloud Functions
  • Securing Cloud Storage
  • Storing All Sorts of Data Types
  • Demo: Running federated queries on Parquet and ORC files in BigQuery
  • Storing Relational Data in the Cloud
  • Cloud SQL as a relational Data Lake
  • Lab: Loading Taxi Data into Cloud SQL
  • Building a Data Lake
  • Building a data warehouse
  • The Modern Data Warehouse
  • Intro to BigQuery
  • Demo: Querying TB of Data in seconds
  • Getting Started
  • Loading Data
  • Lab Intro: Loading Data into BigQuery
  • Exploring Schemas
  • Demo: Exploring Schemas
  • Schema Design
  • Nested and Repeated Fields
  • Demo: Nested and Repeated Fields
  • Lab Intro: Working with JSON and Array Data in BigQuery
  • Optimizing with Partitioning and Clustering
  • Demo: Creating Partitioned Tables
  • Demo: Partitioning and Clustering
  • Preview: Transforming Batch and Streaming Data
  • Recap
  • Monitoring BigQuery usage and reservations
  • Intro to BigQuery System Table Reports
  • Demo: BigQuery System Tables with Data Studio
  • BigQuery Pricing
  • Building a Data Warehouse
  • Summary
  • Course Summary

Summary of User Reviews

Learn about data lakes and data warehouses on Google Cloud Platform (GCP) with this comprehensive course on Coursera. Students have praised the course for its practical approach to learning and comprehensive coverage of topics.

Key Aspect Users Liked About This Course

Practical approach to learning

Pros from User Reviews

  • Comprehensive coverage of topics
  • Hands-on exercises help to reinforce learning
  • Great for beginners to data lakes and data warehouses
  • Course materials are well-structured and easy to follow

Cons from User Reviews

  • Some users found the course too basic
  • Lack of advanced topics for experienced professionals
  • Course can be slow-paced at times
  • Not enough emphasis on real-world applications
  • Some users experienced technical difficulties with the platform
English
Available now
Approx. 9 hours to complete
Google Cloud Training
Google Cloud
Coursera

Instructor

Google Cloud Training

  • 4.7 Raiting
Share
Saved Course list
Cancel
Get Course Update
Computer Courses