Exercise 2: Setup Azure Data Factory

    Synopsis: In this exercise, attendees will create a baseline environment for Azure Data Factory development for further operationalization of data movement and processing. They will create a Data Factory service and then install the Data Management Gateway which is the agent that facilitates data movement from on-premises to Microsoft Azure.

    This exercise has 4 tasks:

    If, for whatever reason, you cannot complete this lab whether due to time contraints or if you are not able to troubleshoot an issue, we have created a “get out of jail free” exercise. If you wish to use this exercise at any time, please proceed to . Please note that using this exercise will let you surpass all of the Azure Data Factory exercises. After completing Appendix B, you can continue to Exercise 5.

    Task 1: Connect to the Lab VM

    1. NOTE: If you are already connected to your Lab VM, skip to Task 2.
    2. From the left side of the Azure portal, click on All resources.
    3. In the Filter items… box, type in lab.
    4. Select your lab VM. Keep in mind the name of the virtual machine will begin with the “app name” you provided when setting up this workshop environment (in the prerequisite deployment).
    5. At the top of the blade for your VM, click on Connect.

    6. Download and open the RDP file.

    7. When the Remote Desktop Connection screen appears, check the Don’t ask me again… box and click on Connect button.

      Screenshot

    1. Once you have logged in to the Lab VM, open a web browser. A shortcut for Chrome is on the Desktop and Internet Explorer can be accessed from the Start Screen.
    2. Download the AdventureWorks sample CSV file. You can download it from and save it to a new folder called C:\Data.

    Task 3: Install and Configure Data Management Gateway on the Lab VM

    1. To download the latest version of Microsoft Data Management Gateway, go to .
    2. Click on the Download button from the middle of the webpage.
    3. Select the 64-bit version and click Next.

    4. Run the installer once downloaded.

    5. When you see the following screen, click on the Next button.

    6. Check the box to accept the terms and click Next button.

    7. Click Next.
    8. Click Install.
    9. Click Finish.
    10. Once the installation is complete, the following screen will appear; keep it open for now. Next, you will provision the Data Factory service in Azure and obtain the gateway key so we can connect Data Factory to this "on-premises" server.

      Screenshot

    1. Launch a new browser session and navigate to https://portal.azure.com. Once prompted, log in with your Microsoft Azure credentials. If prompted, choose whether your account is an organization account or a Microsoft Account. This will be based on which account was used to provision your Azure subscription that are using for these labs.
      • Note : You may need to launch an InPrivate/Incognito session in your browser if you have multiple Microsoft Accounts.

    Task 5: Create new Azure Data Factory Service

    1. From left top corner of the Azure Portal, click on +New.

    2. Select Intelligence + analytics , click on Data Factory.

      Screenshot

    3. Make sure to you have the right subscription selected.
    4. For Resource Group, choose Use Existing and select the Resource Group you created when deploying the workshop prerequisites.
    5. Select East US or West US for the region.
    6. Check the box Pin to dashboard and click on the Create button.

    7. Deployment of the ADF will take couple of minutes.

    8. Once it has completed, you will be taken to the Data Factory blade.
    9. Click on Author and deploy.

      Screenshot

    10. Click on New data gateway.

      Screenshot

    11. Type in a Data gateway name like [insert your initial here]-dmg (example jcho-dmg ).

    12. Click OK button.
    13. Click on Configure and copy out the NEW KEY.

    14. Don’t close the current browser session.

    15. Go back to the Remote Desktop session of the virtual machine.
    16. Paste the Key into the box in the middle of the screen.

      Screenshot

    17. Click on the Register button.

    18. It will take couple of minutes to register. If it takes more than couple minutes and the screen does not respond or get an error message, then close the screen by click on the Cancel button.
    19. You should see below screen with confirmation message.

    Next Exercise: Exercise 3 - Develop Data Factory Pipeline for Data Movement