In this series we look at building a Streaming ETL with Azure Data Factory and CDC – Creating a Data Source Connection in Azure Data Factory. This is Part 5, The rest of the series is below.
- Enabling CDC
- Setting up Audit Tables
- Provisioning Azure Data Factory
- Provisioning Azure Blog Storage
- Create Data Source Connection in ADF
- Create Incremental Pipeline in ADF
- Create a Parameter Driven Pipeline
- Create a Rolling Trigger
This series uses the Adventureworks database. For more information on how to get that set up see my Youtube video for Downloading and Restoring the database.
- Create a new Dataset as a SQL Server.
data:image/s3,"s3://crabby-images/9abd4/9abd4a4aadc1eb3526c02407c5b8453c0b3ea950" alt=""
data:image/s3,"s3://crabby-images/60f99/60f99dce13d79e8bc90868c19e1fe3cce8a00f52" alt=""
- Name it DimProperty and select your Integrated Runtime for the local SQL server. For the table, select the CDC table.
data:image/s3,"s3://crabby-images/b7916/b79166da3120ccf7d903afd1a0367b7a85cd18b4" alt=""
- Create a new Dataset and this time select Azure Blob Storage and a DelimitedText.
data:image/s3,"s3://crabby-images/a781b/a781bda8ad06830382eb3a14c4e1ab764875e9e0" alt=""
data:image/s3,"s3://crabby-images/b1661/b1661211d2087f12351fb855d66350691f3cfe59" alt=""
- Name the csv_DimProperty and select New Linked Service.
data:image/s3,"s3://crabby-images/5d4b0/5d4b09148bde28dbeb10c669f9a8326a329fae81" alt=""
- Name the blob storage “DataLake” to match your storage account and point it to your storage account in the subscription.
data:image/s3,"s3://crabby-images/87aba/87aba6bc50a73bb2236b170f07d1ec3c8f6b46fb" alt=""
- Select the “datalake” from the file folder section or type it in and set first row as header and select ok to complete.
data:image/s3,"s3://crabby-images/b0873/b0873c32dc8d659814fc1f57aa178a7d7676690d" alt=""
- We should now have our two datasets in the resource’s sections for the property CDC transfer. Select Publish All at the top to save your changes
data:image/s3,"s3://crabby-images/373cf/373cf4fb78f005ed50d9f4fb6a8ab8de42085111" alt=""
Streaming ETL with Azure Data Factory and CDC – Creating a Data Source Connection in Azure Data Factory