Skip to content

Microsoft Azure Data Engineering Associate (DP-203) Study Guide

Menu
  • Contact Us
Menu

AZURE PURVIEW – Data Sources and Ingestion

Posted on 2022-01-252024-08-05 by Benjamin Goodwin

Azure Purview was introduced in Chapter 1. Azure Purview is very useful for governance, data discovery, and exploration. Keeping tabs on the data sources you have and what they contain is key to being able to securely manage them. There will be more on this in Chapter 8, which discusses data security and governance. When you click the Purview link, you are prompted to create an Azure Purview account or connect to one that you already have. You will do this later, at a more appropriate time.

Integration

The Triggers and the Integrated runtime features are directly related to pipelines, which is discussed in the section “Integrate,” which covers the features available when you select the Integrate hub option (refer to Figure 3.27).

You need a few things to make a pipeline work: something to trigger (invoke/execute) the pipeline and some compute to process the pipeline activity. Both of those needs are found in the Integration section on the Manage hub.

TRIGGERS

There are two ways to trigger a pipeline. You can do it manually, or you can schedule it. To manually trigger the pipeline in the Azure Synapse Analytics Studio (shown later), you can use REST API or Azure PowerShell. If you need the pipeline to run at scheduled intervals, for example, every hour, click the Triggers menu item followed by + New. If you have a Copy activity in your pipeline that ingests data from an ADLS container each hour, then you would want to schedule that here. Figure 3.36 shows an example of how that scheduled trigger configuration might look.

FIGUER 3.36 Azure Synapse Analytics integration triggers

There are four different types of triggers:

  • Schedule triggers
  • Tumbling window triggers
  • Storage event triggers
  • Custom event triggers

A tumbling window trigger has a few advantages over a scheduled trigger. For example, when you use a tumbling window trigger, you can run scheduled pipelines for past dates. This is not possible with scheduled triggers, which can only be run currently or at a future time. Failed pipeline runs triggered by a tumbling window trigger will be retried, whereas scheduled will not. Finally, tumbling window triggers can be bound only to a single pipeline, while a single scheduled trigger can invoke multiple pipelines, or multiple pipelines can be invoked by a single scheduled trigger. The storage event trigger is invoked when a blob is either created or deleted in the configured Azure Storage container. A custom event trigger provides the interface to subscribe to an Event Grid topic. An Event Grid is a product that acts as a middle tier between a message producer or a message queue and subscribers who want to receive those messages.

Leave a Reply Cancel reply

Your email address will not be published. Required fields are marked *

Archives

  • August 2024
  • July 2024
  • June 2024
  • May 2024
  • March 2024
  • February 2024
  • January 2024
  • December 2023
  • November 2023
  • October 2023
  • September 2023
  • August 2023
  • June 2023
  • May 2023
  • April 2023
  • March 2023
  • February 2023
  • January 2023
  • December 2022
  • November 2022
  • October 2022
  • July 2022
  • May 2022
  • March 2022
  • February 2022
  • January 2022
  • December 2021
  • November 2021
  • October 2021
  • September 2021
  • May 2021

Categories

  • ARM TEMPLATE
  • Create an Azure Data Factory
  • DATA EXPLORER POOLS
  • Design Analytical Stores
  • MANAGED PRIVATE ENDPOINTS
  • Microsoft DP-203
© 2025 Microsoft Azure Data Engineering Associate (DP-203) Study Guide All Rights Reserved