top of page
DP-3012 | Implementing a Data Analytics Solution with Azure Synapse Analytics

DP-3012 | Implementing a Data Analytics Solution with Azure Synapse Analytics

 

This is a single day Instructor Lead Course designed to give the learners instruction on the SQL dedicated and serverless Spark pools and providing instruction of data wrangling and the ELT process using Synapse Pipelines which is very similar to those familiar with Azure Data Factory (ADF) to move data into the Synapse dedicated pool database.

 

Audience profile

The Audience should have familiarity with notebooks that use different languages and a Spark engine, such as Databricks, Jupyter Notebooks, Zeppelin notebooks and more. They should also have some experience with SQL, Python, and Azure tools, such as Data Factory.

 

Outline course

Module 1: Introduction to Azure Synapse Analytics

Learn about the features and capabilities of Azure Synapse Analytics - a cloud-based platform for big data processing and analysis.

  • Introduction
  • What is Azure Synapse Analytics
  • How Azure Synapse Analytics works
  • When to use Azure Synapse Analytics
  • Exercise - Explore Azure Synapse Analytics
  • Knowledge check
  • Summary

In this module, you'll learn how to:

  • Identify the business problems that Azure Synapse Analytics addresses.
  • Describe core capabilities of Azure Synapse Analytics.
  • Determine when to use Azure Synapse Analytics.

Module 2: Use Azure Synapse serverless SQL pool to query files in a data lake

With Azure Synapse serverless SQL pool, you can leverage your SQL skills to explore and analyze data in files, without the need to load the data into a relational database.

  • Introduction
  • Understand Azure Synapse serverless SQL pool capabilities and use cases
  • Query files using a serverless SQL pool
  • Create external database objects
  • Exercise - Query files using a serverless SQL pool
  • Knowledge check
  • Summary

After the completion of this module, you will be able to:

  • Identify capabilities and use cases for serverless SQL pools in Azure Synapse Analytics
  • Query CSV, JSON, and Parquet files using a serverless SQL pool
  • Create external database objects in a serverless SQL pool

Module 3: Analyze data with Apache Spark in Azure Synapse Analytics

Apache Spark is a core technology for large-scale data analytics. Learn how to use Spark in Azure Synapse Analytics to analyze and visualize data in a data lake.

  • Introduction
  • Get to know Apache Spark
  • Use Spark in Azure Synapse Analytics
  • Analyze data with Spark
  • Visualize data with Spark
  • Exercise - Analyze data with Spark
  • Knowledge check
  • Summary

After completing this module, you will be able to:

  • Identify core features and capabilities of Apache Spark.
  • Configure a Spark pool in Azure Synapse Analytics.
  • Run code to load, analyze, and visualize data in a Spark notebook.

Module 4: Use Delta Lake in Azure Synapse Analytics

Delta Lake is an open source relational storage area for Spark that you can use to implement a data lakehouse architecture in Azure Synapse Analytics.

  • Introduction
  • Understand Delta Lake
  • Create Delta Lake tables
  • Create catalog tables
  • Use Delta Lake with streaming data
  • Use Delta Lake in a SQL pool
  • Exercise - Use Delta Lake in Azure Synapse Analytics
  • Knowledge check
  • Summary

In this module, you'll learn how to:

  • Describe core features and capabilities of Delta Lake.
  • Create and use Delta Lake tables in a Synapse Analytics Spark pool.
  • Create Spark catalog tables for Delta Lake data.
  • Use Delta Lake tables for streaming data.
  • Query Delta Lake tables from a Synapse Analytics SQL pool.

Module 5: Analyze data in a relational data warehouse

Relational data warehouses are a core element of most enterprise Business Intelligence (BI) solutions, and are used as the basis for data models, reports, and analysis.

  • Introduction
  • Design a data warehouse schema
  • Create data warehouse tables
  • Load data warehouse tables
  • Query a data warehouse
  • Exercise - Explore a data warehouse
  • Knowledge check
  • Summary

In this module, you'll learn how to:

  • Design a schema for a relational data warehouse.
  • Create fact, dimension, and staging tables.
  • Use SQL to load data into data warehouse tables.
  • Use SQL to query relational data warehouse tables.

Module 6: Build a data pipeline in Azure Synapse Analytics

Pipelines are the lifeblood of a data analytics solution. Learn how to use Azure Synapse Analytics pipelines to build integrated data solutions that extract, transform, and load data across diverse systems.

  • Introduction
  • Understand pipelines in Azure Synapse Analytics
  • Create a pipeline in Azure Synapse Studio
  • Define data flows
  • Run a pipeline
  • Exercise - Build a data pipeline in Azure Synapse Analytics
  • Knowledge check
  • Summary

In this module, you will learn how to:

  • Describe core concepts for Azure Synapse Analytics pipelines.
  • Create a pipeline in Azure Synapse Studio.
  • Implement a data flow activity in a pipeline.
  • Initiate and monitor pipeline runs.

 

Descargue el temario para conocer el detalle completo de los contenidos.

 

Debido a las constantes actualizaciones de los contenidos de los cursos por parte del fabricante, el contenido de este temario puede variar con respecto al publicado en el sitio oficial, sin embargo, Netec siempre entregará la versión actualizada de éste.

DP-3012 | Implementing a Data Analytics Solution with Azure Synapse Analytics

SKU: MICROSOFT-DP-3012
bottom of page