Category: Big Data
-
Streaming ETL with Azure Data Factory and CDC – Enabling CDC
In this series we look at building a Streaming ETL with Azure Data Factory and CDC – Enabling CDC. This is Part 1, The rest of the series is below. Enabling CDC Setting up Audit Tables Provisioning Azure Data Factory Provisioning Azure Blog Storage Create Data Source Connection in ADF Create Incremental Pipeline in ADF…
-
Download free backgrounds to make your Power BI reports pop!
Download free backgrounds to make your Power BI reports pop! The difference between a good looking Power BI report and a great looking Power BI report is just a short extra mile using a custom background template. It is easy to create custom Backgrounds in Paint or Powerpoint to make your Power BI reports Pop!…
-
Modern Data Architecture – Part 9 – Load Data into Synapse Data Warehouse
Modern Data Architecture – Part 9 – Loading Data into Synapse Data Warehouse Now that we have provisioned a Synapses Data Analytics environment, we are now ready to begin loading data into this environment. In lab 7, we loaded a single table “sales.customer” to our data lake. To complete this lab, you will either need…
-
Modern Data Architecture – Part 8 – Provisioning a Synapsis SQL Data Warehouse
Modern Data Architecture – Part 8 – Provisioning a Synapsis SQL Data Warehouse Azure Synapse is a limitless analytics service that brings together enterprise data warehousing and Big Data analytics. It gives you the freedom to query data on your terms, using either serverless on-demand or provisioned resources—at scale. Azure Synapse brings these two worlds…
-
Modern Data Architecture – Part 7 – Staging Data into Data Lake
Modern Data Architecture – Part 7 – Staging Data into Data Lake In Lab 5, we demonstrated loaded data from our sample SQL server into our data lake. Although this tool is very handy, it creates a new service and set of tools that need to be provisioned and monitored. It is best to use…
-
Modern Data Architecture – Part 6 – Configuring and setting up Data Bricks
Modern Data Architecture – Part 6 – Configuring and setting up Data Bricks Azure Databricks is an Apache Spark-based analytics platform optimized for the Microsoft Azure cloud services platform. Databricks is an industry-leading, cloud-based data engineering tool used for processing and transforming massive quantities of data and exploring the data through machine learning models Azure…
-
Modern Data Architecture – Part 1 – Navigating the Azure Portal
Modern Data Architecture – Part 1 – Navigating the Azure Portal Logging into Azure and the Portal Open up a web browser and navigate to https://azure.microsoft.com/en-us/ and click on “Portal” to log in. You will be navigated to authentication portal to log in. You can either sign up for a new account which comes with…
-
Auto Spin up and Down Azure Virtual Machines (VM)
One of the best perks of moving your infrastructure to the cloud is the ability to only pay for the resources you use. When using Virtual Machines that usage can be even further limited to servers that stay on only when you need them (business hours or nightly data runs) and then shut down to…
-
Create a NodeJS Client Application to Submit Data to Event Hubs
Now that we have provisioned a Event Hub in Azure, let’s Create a NodeJS Client Application to Submit Data to Event Hubs. Prerequisites Visual Studio 2017 Install NodeJS SDK