Big Data for Architects

Do you want a guide that will help you to pick the right Big Data technology for your project? Or do you want to get a solid understanding of the Big Data architecture and pipelines? This course will help you out.

After highlighting the course structure and learning objectives, the course will take you through the steps needed for setting up the environment. Next, you will understand the Big Data logical architecture, study the evolution of Big Data technologies, and explore Big Data pipelines. Moving along, you will become familiar with ingestion frameworks, such as Kafka, Flume, Nifi, and Sqoop. Next, you will learn about key storage frameworks, such as HDFS, HBase, Kudu, and Cassandra. Finally, you will go through the various data formats and uncover key data processing and data analysis frameworks.

By the end of this course, you will have a good understanding of the Big Data architecture and technologies and will have developed the skills to build real-world Big Data pipelines.

All the resources and support files for this course are available at https://github.com/PacktPublishing/Big-Data-for-Architects

Type

video

Category

publication date

2021-01-21

what you will learn

Create a Google account and a Dataproc cluster
Understand the Big Data architecture and pipelines
Study factors to consider while comparing ingestion frameworks
Gain a solid understanding of storage frameworks
Distinguish between text and binary data format
Find the key differences between the Spark, Tez, and Flink frameworks
Build a scalable Extract, Transform, Load (ETL) pipeline with Kafka Connect

duration

499

key features

Get a holistic picture of the Big Data ecosystem * Become an expert in choosing Big Data technology as per the requirements * Get ready to build end-to-end Big Data batch and streaming pipelines

approach

With the help of simple explanations, white-board sessions, and interesting activities, this course will make you familiar with the Big Data architecture and technologies. It will make you confident to design Big Data pipelines using modern frameworks.

audience

If you are a software engineer, who is looking to build Big Data pipelines or planning to appear for certifications such as CCA175 or CCA159, this video course is for you. A basic understanding of Big Data is needed to get started with this course.

meta description

Get to grips with big data technologies and work on real-world big data projects confidently

short description

This course will help you explore the world of Big Data technologies and frameworks. You will develop skills that will help you to pick the right Big Data technology and framework for your job and build the confidence to design robust Big Data pipelines.

subtitle

Learn how to build Big Data pipelines using Hadoop, Spark, NiFi, Kafka and many more

keywords

Big Data, Cloud, Apache Kafka, Spark, Pig, CCA175, CCA159

Product ISBN

9781801075596

ATLAS

ATLAS

Here to assist you. Please feel free to ask!

×

Hello and welcome! I'm ATLAS, your AI Learning Assistant. I’m here to provide expert answers and detailed explanations for your educational queries.