Azure Data Engineering Cookbook

Data engineering is one of the faster growing job areas as Data Engineers are the ones who ensure that the data is extracted, provisioned and the data is of the highest quality for data analysis. This book uses various Azure services to implement and maintain infrastructure to extract data from multiple sources, and then transform and load it for data analysis.
It takes you through different techniques for performing big data engineering using Microsoft Azure Data services. It begins by showing you how Azure Blob storage can be used for storing large amounts of unstructured data and how to use it for orchestrating a data workflow. You'll then work with different Cosmos DB APIs and Azure SQL Database. Moving on, you'll discover how to provision an Azure Synapse database and find out how to ingest and analyze data in Azure Synapse. As you advance, you'll cover the design and implementation of batch processing solutions using Azure Data Factory, and understand how to manage, maintain, and secure Azure Data Factory pipelines. You’ll also design and implement batch processing solutions using Azure Databricks and then manage and secure Azure Databricks clusters and jobs. In the concluding chapters, you'll learn how to process streaming data using Azure Stream Analytics and Data Explorer.
By the end of this Azure book, you'll have gained the knowledge you need to be able to orchestrate batch and real-time ETL workflows in Microsoft Azure.

Type
ebook
Category
publication date
2021-04-05
what you will learn

Use Azure Blob storage for storing large amounts of unstructured data
Perform CRUD operations on the Cosmos Table API
Implement elastic pools and business continuity with Azure SQL Database
Ingest and analyze data using Azure Synapse Analytics
Develop Data Factory data flows to extract data from multiple sources
Manage, maintain, and secure Azure Data Factory pipelines
Process streaming data using Azure Stream Analytics and Data Explorer

no of pages
454
duration
908
key features
Build highly efficient ETL pipelines using the Microsoft Azure Data services * Create and execute real-time processing solutions using Azure Databricks, Azure Stream Analytics, and Azure Data Explorer * Design and execute batch processing solutions using Azure Data Factory
approach
A problem-solution guide covering independent recipes on Data engineering tasks using different SQL and NoSQL databases.
audience
This book is for Data Engineers, Database administrators, Database developers, and extract, load, transform (ETL) developers looking to build expertise in Azure Data engineering using a recipe-based approach. Technical architects and database architects with experience in designing data or ETL applications either on-premise or on any other cloud vendor who wants to learn Azure Data engineering concepts will also find this book useful. Prior knowledge of Azure fundamentals and data engineering concepts is needed.
meta description
Over 90 recipes to help you orchestrate modern ETL/ELT workflows and perform analytics using Azure services more easily
short description
This book will help you design and implement modern ETL workflows along with data management, monitoring, and security aspects to meet the current organization's needs. You will use various services such as Azure Data Factory, Azure Databricks, Azure Stream Analytics, and Azure Data Explorer to design efficient data processing solutions.
subtitle
Design and implement batch and streaming analytics using Azure Cloud Services
keywords
Microsoft Azure, Azure Data Engineering, Data Engineering, Data Engineer, batch processing, Azure Databricks, Azure Stream Analytics, Azure Data Explorer
Product ISBN
9781800206557