Introduction to Designing Data Lakes on AWS

  • 4.7
Approx. 14 hours to complete

Course Summary

This course teaches the fundamentals of designing and implementing data lakes on AWS. You will learn how to build scalable and secure data lakes that can handle large amounts of data in various formats.

Key Learning Points

  • Learn how to design and implement data lakes on AWS
  • Understand how to store and manage data in various formats
  • Discover how to secure and manage access to your data lake

Related Topics for further study


Learning Outcomes

  • Design and create a data lake on AWS
  • Implement data management best practices
  • Secure and manage access to your data lake

Prerequisites or good to have knowledge before taking this course

  • Basic understanding of AWS services
  • Familiarity with data management concepts

Course Difficulty Level

Intermediate

Course Format

  • Self-paced
  • Video lectures
  • Hands-on exercises

Similar Courses

  • Data Warehousing for Business Intelligence
  • Data Engineering, Big Data, and Machine Learning on GCP

Related Education Paths


Related Books

Description

In this class, Introduction to Designing Data Lakes on AWS, we will help you understand how to create and operate a data lake in a secure and scalable way, without previous knowledge of data science! Starting with the "WHY" you may want a data lake, we will look at the Data-Lake value proposition, characteristics and components.

Knowledge

  • Where to start with a Data Lake?
  • How to build a secure and scalable Data Lake?
  • What are the common components of a Data Lake?
  • Why do you need a Data Lake and what it's value?

Outline

  • Week 1
  • Introduction to Designing Data Lakes in AWS
  • Meet the Instructors
  • Introduction to Week 1
  • Why Data Lakes
  • Characteristics of Data Lakes
  • Data Lakes Components
  • Comparison of a Data Lake to a Data Warehouse
  • Discussing Sample Data Lake Architectures
  • Course Welcome and Student Information
  • Pre-Course Survey
  • Data Lake Characteristics and Components
  • Data Lakes and Data Warehouses
  • Mid-Course Survey
  • Week 1 Quiz
  • Week 2
  • Introduction to Week 2
  • AWS Data Lake Related Services
  • Amazon S3
  • AWS Glue Data Catalog
  • AWS Services Used for Data Movement
  • AWS Services for Data Processing
  • AWS Services for Analytics
  • AWS Services for Predictive Analytics and Machine Learning
  • Introduction to AWS LakeFormation
  • Amazon S3 and Glue Data Catalog
  • Data Movement
  • EMR, Glue Jobs, Lambda, Kinesis Analytics, RedShift
  • AWS Lake Formation
  • Week 2 Quiz
  • Week 3
  • Introduction to Week 3
  • Use the Right Tool for the Job
  • Understanding Data Structure and When To Process Data
  • Data Streaming Ingestion With Kinesis Services
  • Batch Data Ingestion with AWS Transfer Family
  • Batch Data Ingestion with AWS Snow Family
  • Data Cataloging
  • Using Glue Crawlers
  • Reviewing the Ingestion Part in Data Lake Architectures
  • Diving Deep on Amazon Kinesis
  • Batch Data Ingestion with AWS Services
  • The Importance of Data Cataloging
  • Week 3 Quiz
  • Week 4
  • Introduction to Week 4
  • Data Prep and AWS Glue Jobs
  • File Optimizations
  • Using S3, Glue and Athena to Get Insights about NYC Taxi Data
  • Introduction to Data Lake Security
  • The Power of Data Visualization
  • Introduction to Amazon QuickSight
  • Amazon QuickSight Demo
  • Registry of Open Data on AWS
  • Course Wrap Up
  • Columnar Data Formats and Amazon Athena Optimizations
  • Security and Compliance
  • Data visualization, Amazon QuickSight
  • Registry of Open Data
  • Post-Course Survey
  • Week 4 Quiz
  • Final Assessment

Summary of User Reviews

Introduction to Designing Data Lakes in AWS is a highly recommended course with great content and valuable insights. Many users found the course to be informative and engaging, with clear explanations and practical examples of how to design data lakes in AWS.

Key Aspect Users Liked About This Course

The course provides practical examples and hands-on exercises, which help users to understand the concepts better.

Pros from User Reviews

  • Great content and valuable insights
  • Clear explanations and practical examples
  • Engaging and informative
  • Hands-on exercises for better understanding
  • Flexible schedule and self-paced learning

Cons from User Reviews

  • Some users found the course to be too basic
  • Not suitable for advanced users
  • Limited interaction with the instructor
  • No certification offered
  • Lacks depth in certain areas
English
Available now
Approx. 14 hours to complete
Morgan Willis, Rafael Lopes
Amazon Web Services
Coursera

Instructor

Morgan Willis

  • 4.7 Raiting
Share
Saved Course list
Cancel
Get Course Update
Computer Courses