Exercise 6: Visualizing in Power BI Desktop

    Synopsis: In this exercise, attendees will construct a report in Power BI Desktop Client that uses the map visualization to illustrate the predicted delays, using the data originally scored using Machine Learning, but summarized in a table using Spark on HDInsight.

    This exercise has 4 tasks:

    1. NOTE: If you are already connected to your Lab VM, skip to .
    2. From the left side of the Azure portal, click on All resources.
    3. In the Filter items… box, type in lab.
    4. Select your lab VM. Keep in mind the name of the virtual machine will begin with the “app name” you provided when setting up this workshop environment (in the prerequisite deployment).
    5. At the top of the blade for your VM, click on Connect.

    6. Download and open the RDP file.

    7. When the Remote Desktop Connection screen appears, check the Don’t ask me again… box and click on Connect button.

      Screenshot

    8. Log in with the following credentials:

      • User name: cortana
    1. Launch Power BI Desktop using the shortcut on the Desktop of the Lab VM.
    2. Click on Get Data from the left side of the welcome window.

    3. Click on the Azure from the left and select Azure HDInsight Spark (Beta) from the new Get Data window. Click on the Connect button on the bottom right corner.

      Screenshot

    4. Click Continue on the bottom right corner of the new window.

    5. Type or copy/paste the URL of your Azure HDInsight Spark cluster, which can be found from the address bar when connected to the Azure HDInsight Spark cluster in previous Exercise 5. It should look something like this example: https://jcholab199spark.azurehdinsight.net.
    6. Click the OK button.
    7. When prompted, enter the cluster user name and password and then click Connect.

      • User name: cortana
      • Password: Password.1!!
    8. When the dialog box disappears, look for the flightdelaysummary in the list and check the box next to it.

    9. Click on the Load button from the bottom right corner of the window. It will take couple of minutes to load the data into the Power BI Desktop client.

    1. Once the data load is completed, you will find the flightdelaysummary to the right side of the screen under the Fields area.

    2. From the Visualizations area, which is left to the Fields area, click the Globe icon to add a Map visualization to the report design surface.

      Screenshot

    3. With the Map visualization still selected, in the Fields area at right, expand the tabled called flightdelayssummary.

    4. Click and drag the field labeled OriginLatLong and drop it into the Location field located just below visualizations.

      Screenshot

    5. Your map should look something like the following:

    6. Unselect the Map visual by clicking on the white space on the report page.

    7. From the Visualizations area, which is left to the Fields area, click the Stacked Column Chart icon to add a bar chart visualization to the report design surface.

      Screenshot

    8. With the Stacked Column Chart visualization still selected, in the Fields area at right, expand the tabled called flightdelayssummary.

    9. Click and drag the field labeled Day and drop it into the Axis field located just below visualizations.

      Screenshot

    10. Next, drag the field labeled AvgDelayProbability and drop it into the Value field.

    11. Grab the corner of the new Stacked Column Chart Visual and drag it out by making wide as the bottom of your report design surface.

    12. Your report should look something like the following:

    13. Unselect the Stacked Column Chart visual by clicking on the white space on the report page.

    14. From the Visualizations area, which is left to the Fields area, click the Treemap icon to add this visualization to the report design surface.

      Screenshot

    15. With the Treemap visualization still selected, in the Fields area at right, expand the tabled called flightdelayssummary.

    16. Click and drag the field labeled OriginAirportCode and drop it into the Group field located just below visualizations.

      Screenshot

    17. Next, drag the field labeled NumDelays and drop it into the Value field.

    18. Grab the corner of the new Treemap Visual and drag it out by making wide as the top of your report design surface. Your report should look similar to the following:

      Screenshot

    19. You can cross filter the visualizations on the report by click on the one of the other visuals within the report as shown below.

    Next Exercise: