Explore

Real Time Spark Project for Beginners: Hadoop, Spark, Docker

6.5 hours on-demand video

$ 9.99

Save Course

Go to Course

Brief Introduction

Building Real Time Data Pipeline Using Apache Kafka, Apache Spark, Hadoop, PostgreSQL, Django and Flexmonster on Docker

Description

In many data centers, different type of servers generate large amount of data(events, Event in this case is status of the server in the data center) in real-time.
There is always a need to process these data in real-time and generate insights which will be used by the server/data center monitoring people and they have to track these server's status regularly and find the resolution in case of issues occurring, for better server stability.
Since the data is huge and coming in real-time, we need to choose the right architecture with scalable storage and computation frameworks/technologies.
Hence we want to build the Real Time Data Pipeline Using Apache Kafka, Apache Spark, Hadoop, PostgreSQL, Django and Flexmonster on Docker to generate insights out of this data.
The Spark Project/Data Pipeline is built using Apache Spark with Scala and PySpark on Apache Hadoop Cluster which is on top of Docker.
Data Visualization is built using Django Web Framework and Flexmonster.

Requirements

Requirements
Basic understanding of Programming Language
Basic understanding of Apache Hadoop
Basic understanding of Apache Spark

Recommended for you

Apache Kafka Series - Learn Apache Kafka for Beginners v2

START HERE: Learn Apache Kafka 2. 0 Ecosystem, Core Concepts, Real World Java Producers/Consumers & Big Data Architecture UPDATE SEPTEMBER 2018: Course newly recorded with Kafka 2....

Save Course

Telecom Customer Churn Prediction in Apache Spark (ML)

Learn Apache Spark machine learning by creating a Telecom customer churn prediction project using Databricks Notebook g. , Hadoop or Kubernetes)....

Save Course

Data Science & Deep Learning for Business™ 20 Case Studies

Use Python to solve problems in Retail, Marketing, Product Recommendation, Customer Clustering, NLP, Forecasting & more!   What student reviews of this course are saying,...

Save Course

Data Science, Analytics & AI for Business & the Real World™

Use Data Science & Statistics To Solve Business Problems & Gain Insights Into Everyday Problems With 35+ Case Studies   Our Complete 2020 Data Science Learning path includes:...

Save Course