Microsoft Power BI is a business analytics service that provides interactive visualizations with self-service business intelligence capabilities, enabling end users to create reports and dashboards by themselves without having to depend on information technology staff or database administrators.
When you use Databricks as a data source with Power BI, you can bring the advantages of Databricks performance and technology beyond data scientists and data engineers to all business users.
You can connect Power BI Desktop to your Databricks clusters and Databricks SQL warehouses by using the built-in Azure Databricks connector. (Although the connector is labelled as Azure Databricks, it works with Databricks on Google Cloud as well.)
SQL warehouses are in Public Preview.
Power BI Desktop 2.85.681.0 or above. Download the latest version. To use data managed by Unity Catalog with Power BI, you must use Power BI Desktop 2.98.683.0 or above (October 2021 release).
Power BI Desktop requires Windows. An alternative for other operating systems is to run Power BI Desktop on a physical host or a Windows-based virtual machine and then connect to it from your operating system.
If you use a version of Power BI Desktop below 2.85.681.0, you also need to install the Databricks ODBC driver in the same environment as Power BI Desktop.
Get the Server Hostname and HTTP Path.
Start Power BI Desktop.
Click Get data or File > Get data.
Click Get data to get started.
Search for Databricks, choose the Azure Databricks connector, and click Connect.
Although the connector is named Azure Databricks, it works with Databricks on Google Cloud.
Enter the Server Hostname and HTTP Path.
Optionally, enter the default database and catalog to use for the connection.
Select your Data Connectivity mode. For information about the difference between Import and DirectQuery, see Use DirectQuery in Power BI Desktop.
Enter your authentication credentials:
Personal Access Token: Enter your Databricks personal access token from the Requirements.
Username / Password: Not applicable.
Azure Active Directory: Not applicable.
Select the Databricks data to query from the Power BI Navigator.
The Databricks connector provides the
Databricks.Query data source that allows a user to provide a custom SQL query. This feature will be available in the Power BI February 2022 release.
Follow the steps described in Connect with Power BI Desktop to create a connection, using Import as the data connectivity mode.
In the Navigator, right click the top-most item containing the selected host name and HTTP path and click Transform Data to open the Power Query Editor.
In the function bar, replace the function name
Databricks.Queryand apply the change. This creates a Power Query function that takes a SQL query as parameter.
Enter the desired SQL query in the parameter field and click Invoke. This executes the query and a new table is created with the query results as its contents.
Power BI Desktop version 2.104.941.0 and above (May 2022 release) has built-in support for detecting Windows system-wide HTTP proxy configuration. Download the latest version.
Power BI Desktop can automatically detect and use your Windows system-wide HTTP proxy configuration.
If the proxy server does not provide a CRL distribution point (CDP), Power BI may show the error message
Details: "ODBC: ERROR [HY000] [Microsoft][DriverSupport] (1200)
-The revocation status of the certificate or one of the certificates in the certificate chain is unknown.". To fix this error, complete the following steps:
Create the file
C:\Program Files\Microsoft Power BI Desktop\bin\ODBC Drivers\Simba Spark ODBC Driver\microsoft.sparkodbc.iniif it does not exist.
Add the following config to your
The Power BI Delta Sharing Connector allows users to discover, analyze and visualize datasets shared with them through the Delta Sharing open protocol. The protocol enables secure exchange of datasets across products and platforms by leveraging REST and cloud storage.
Power BI Desktop 2.99.621.0 or above. Download the latest version.
After you download the credentials file, open it with a text editor to retrieve the warehouse URL and the token.
To connect to Databricks by using the Delta Sharing Connector, complete the following steps:
Open Power BI Desktop.
On the Get Data menu, search for Delta Sharing.
Select the connector and click Connect.
Enter the warehouse URL that you copied from the credentials file into the Delta Sharing Server URL field.
Optionally, in the Advanced Options tab, set a Row Limit for the maximum number of rows that you can download. This is set to 1 million rows by default.
For Authentication, copy the token that you retrieved from the credentials file into Bearer Token.
The Databricks connector does not support web proxy.
In the Databricks connector, the
Databricks.Query data source is not supported in combination with DirectQuery mode.
The Delta Sharing Connector is not yet available in the Power BI service.
The data that the Delta Sharing Connector loads must fit into the memory of your machine. To ensure this, the connector limits the number of imported rows to the Row Limit that was set earlier.