1. Provisioning Azure Data Factory

The next piece of the puzzle is to fetch the csv files from the tennis_wta repository on github. For that we need to prepare another Azure resource – a data factory.

On the Azure portal, select + Create a resource, in the upper left-hand corner, then do a quick search for data factory.

On the Create Data Factory page, in the Basics section, choose the resource group you’ve created in the previous posts, choose same region as the resource group, and specify a globally unique name for the data factory. I will name it adf-wta. For version, pick V2 (V1 will soon be deprecated).

Under the Git configuration section, choose Configure Git later. We will setup the code repository later, from the Data Factory portal.

Hit Review + create to review the settings.

Select Create to start provisioning the Azure Data Factory.

What’s next
Now that we have provisioned the Azure Data Factory, we are ready to setup data factory components and build a pipeline to fetch the csv files.🐳

Want to read more?
Microsoft learning resources and documentation:
Introduction to Azure Data Factory
Compare Azure Data Factory with Data Factory version 1