The Query Data Source function allows you to bring your data into Explore Desktop from Databricks, Snowflake, a SQL database, a URL, or any data source with an ODBC (Open Database Connectivity) driver.
Configuring Data Source Connections
In order to load remote data from a connected data source or ODBC, you'll first need to set up your data source connections.
To manage your data source connections:
- Navigate to Settings on the top right of Explore Desktop.
- Click the Data Sources tab.
- Follow the instructions below based on your remote data source.
- Click Add New Connection from the Data Sources tab in Settings.
-
- Rename the Connection, if desired.
- Choose MySQL from the Connection Type drop-down.
- Enter the Port Number. (3306 is the default)
- Enter the Database Name.
- Enter the Database URI.
- Enter your MySQL Username.
- Enter your MySQL Password.
- Click the Test Connection button.
When your connection is created, you will see it listed as an option in the drop-down menu when loading data as a query data source.
- Click Add New Connection from the Data Sources tab in Settings.
-
- Rename the Connection, if desired.
- Choose SQLServer from the Connection Type drop-down.
- Enter the Port Number. (1433 is the default)
- Enter the Database Name.
- Enter the Database URI.
- Enter your SQLServer Username.
- Enter your SQLServer Password.
- Click the Test Connection button.
When your connection is created, you will see it listed as an option in the drop-down menu when loading data as a query data source.
- Click Add New Connection from the Data Sources tab in Settings.
-
- Rename the Connection, if desired.
- Choose Redshift from the Connection Type drop-down.
- Enter the Port Number. (5439 is the default)
- Enter the Database Name.
- Enter the Database URI.
- Enter your Redshift Username.
- Enter your Redshift Password.
- Click the Test Connection button.
When your connection is created, you will see it listed as an option in the drop-down menu when loading data as a query data source.
- Click Add New Connection from the Data Sources tab in Settings.
-
- Rename the Connection, if desired.
- Choose Postgres from the Connection Type drop-down.
- Enter the Port Number. (5432 is the default)
- Enter the Database Name.
- Enter the Database URI.
- Enter your Postgres Username.
- Enter your Postgres Password.
- Click the Test Connection button.
When your connection is created, you will see it listed as an option in the drop-down menu when loading data as a query data source.
- Click Add New Connection from the Data Sources tab in Settings.
-
- Rename the Connection, if desired.
- Choose Hadoop (Hive) from the Connection Type drop-down.
- Enter the Port Number. (10000 is the default)
- Enter the Database Name.
- Enter the Database URI.
- Enter your Hadoop (Hive) Username.
- Enter your Hadoop (Hive) Password.
- Click the Test Connection button.
When your connection is created, you will see it listed as an option in the drop-down menu when loading data as a query data source.
- Click Add New Connection from the Data Sources tab in Settings.
-
- Rename the Connection, if desired.
- Choose Hadoop (Impala) from the Connection Type drop-down.
- Enter the Port Number. (21050 is the default)
- Enter the Database Name.
- Enter the Database URI.
- Enter your Hadoop (Impala) Username.
- Enter your Hadoop (Impala) Password.
- Click the Test Connection button.
When your connection is created, you will see it listed as an option in the drop-down menu when loading data as a query data source.
Connecting directly to a Snowflake instance allows users to push down queries and transformations into Snowflake where the data resides, eliminating the need to extract data. This leverages the in-database data processing capabilities while maintaining compliance, security, and governance.
To add a Snowflake connection:
- Click Add New Connection from the Data Sources tab in Settings.
-
- Rename the Connection, if desired.
-
Select Snowflake from the Connection Type drop-down.
- Enter a Port Number. (443 is the default)
-
Enter a Role.
-
Enter a Warehouse, if desired.
-
Enter a Database Name.
-
Enter a Database URI.
- Choose whether to use SSO.
-
Enter your Snowflake Username
-
Enter your Snowflake Password.
-
Click the Test Connection button.
When your connection is created, you will see it listed as an option in the drop-down menu when loading data as a query data source.
Connecting directly to a Databricks instance allows users to push down queries and transformations into Databricks where the data resides, eliminating the need to extract data. This leverages the in-database data processing capabilities, including Spark, while maintaining compliance, security, and governance.
To add a Databricks connection:
- Click Add New Connection from the Data Sources tab in Settings.
-
- Rename the Connection, if desired.
-
Select Databricks from the Connection Type drop-down.
- Enter a Port Number. (443 is the default)
- Enter an HTTP Path.
- Enter a Schema.
- Enter a Unity Catalog, if desired.
- Note: If you are connecting to Unity Catalog, enter "token" as the Username and your Personal Access Token (PAT) as the Password.
- Enter a Database URI.
-
Enter your Databricks Username
-
Enter your Databricks Password.
-
Click the Test Connection button.
When your connection is created, you will see it listed as an option in the drop-down menu when loading data as a query data source.
- Click Add New Connection from the Data Sources tab in Settings.
-
- Rename the Connection, if desired.
- Choose BigQuery from the Connection Type drop-down.
- Enter the Project ID.
- Select whether or not to use ADC (Application Default Credentials).
- Choose the Query Priority.
- Enter the Client ID.
- Enter the Client Secret.
- Click the Test Connection button.
When your connection is created, you will see it listed as an option in the drop-down menu when loading data as a query data source.
You can also connect to other ODBC Databases that are not listed above. To do so:
- In the Connection Type dropdown, select ODBC.
- In the JSON field, enter any valid ODBC connection options. At a minimum, it must contain the “DSN” option, referencing a configured ODBC Data Source.
- Enter a username/password as applicable.
- Click Test Connection.
Creating an ODBC Driver Connection Using an ODCB Configuration File
- Install and configure the ODBC Driver for the data source to which you want to connect.
- Open the Microsoft ODBC Data Sources (64-bit) control panel. In this panel you can add a data source and specify a (DSN) Data Source Name in addition to various authentication options.
- Create the file C:\Users\<user>\Documents\Virtualitics\Settings\odbc.json. The JSON key/value pairs in this file can contain any valid ODBC connection options and can overwrite and/or supplement the configuration from the ODBC Data Sources (64-bit)panel. At a minimum, it must contain the DSN option, referencing a configured ODBC Data Source.
Loading Your Data
- Once you've configured your data source connection, in the left-side navigation bar under Files, click Remote to load your data.
- In the Query Data Source window that appears, choose from where you'd like to load your data in the Load From dropdown.
- Data Source Connection: Choose the connection that you created earlier in the Data Sources menu.
- URL: Enter the URL for the data you'd like to load.
- ODBC Configuration File: The JSON file that you created earlier will be used to connect and load your data.
- Click Connect.
The Dataset Preview window will appear with your remotely-loaded data.
Previous Article |
Next Article |