Skip to main content

Query Presto Cluster with Apache Superset

Each Ahana Compute Plane includes an instance of Apache Superset. For more information about Apache Superset's provisioning in the Ahana Compute Plane, see Apache Superset Information.

To configure Apache Superset to query your Ahana-managed Presto cluster, follow these steps.

Configure Apache Superset

Create a Presto User

  1. In the Ahana SaaS Console, create a new Presto user. Remember the username and password. See Create a Presto User.
  2. Add the new Presto user to the Ahana-managed Presto cluster. See Managing Presto Users.

Find Apache Superset Connection information

In the Ahana SaaS Console, select Clusters, select Manage for the cluster, then select Superset.

Apache Superset endpoints

Select the copy icon to copy the text string in Apache Superset connection.

Sign In to Apache Superset

In Ahana SaaS Console, select the link in step 1 adding a new database.

adding a new database

In Apache Superset, select Sign In.

Apache Superset Login

If an Ahana Sign In window is displayed, enter your Ahana credentials then select Sign In.

Enter the connection string in Apache Superset

In Apache Superset, select Data, then select Databases from the drop-down.

Select + Database to open the Add Database window.

In Add Database, configure Superset to connect to your Presto cluster.

Superset Add Database Name and SQLAlchemy URI

In the Connection tab, enter values for Database Name and SQLAlchemy URI as described.

Database Name: Enter a descriptive name. This name is a label used only in Superset. It does not need to match any other name.

SQLAlchemy URI: Enter the connection string that you copied from Apache Superset connection in your Ahana SaaS Console, then edit it for your Presto cluster.

The format of the SQLAlchemy URI is

presto://<username>:<password>@<host>/<catalog_name>

Edit the connection string copied from Ahana SaaS Console:

  • <username> and <password> - Replace <username> and <password> with the name and the password of the Presto user account you created in Create a Presto User.
  • <host> - The <host> part of the connection string copied from Apache Superset connection should not need editing. <host> includes the hostname and a port number separated by a colon. The default port in the connection string copied from Apache Superset connection is 443.
  • <catalog_name> - A catalog name is required. To connect to the Ahana-managed Hive Metastore of your Presto cluster, add /ahana_hive at the end of the connection string.

Enter the Apache Superset JSON configuration for the Presto cluster

Ahana provides cluster-specific configuration in JSON format for Apache Superset.

In Ahana SaaS Console, in Apache Superset extra configuration, select Copy Config.

Copy Config

In the Apache Superset Add Database window, select the Extra tab.

Delete the text in Extra, then paste into Extra the copied configuration from Ahana.

Apache Superset Extra

Test the Connection

In the Apache Superset Add Database window, select the Connection tab, then select Test Connection to test if Superset can connect to and get a response from the Presto cluster.

Enable SQL Lab and Performance Features

Select the SQL Lab Settings tab.

Select the following checkboxes in SQL Lab Settings to enable these features:

  • Expose in SQL Lab - Makes the new database visible in SQL Lab of Apache Superset.

  • Allow CREATE TABLE AS - Allow creation of new tables by user queries in SQL Lab.

  • Allow CREATE VIEW AS - Allow creation of new views by user queries in SQL Lab.

  • Allow DML - Allow users to run non-SELECT statements such as UPDATE, DELETE, or CREATE in SQL Lab.

Select the Performance tab, then select the Asynchronous query execution checkbox.

Save the Database Configuration

Select Add to save the database configuration.

Query Presto with Apache Superset

Use SQL Lab in Apache Superset to query your Ahana-managed Presto cluster.

Open SQL Editor

In Apache Superset, select SQL Lab, then select SQL Editor.

Apache Superset SQL Editor

Select the Presto cluster

In the left of the window, use Database to select the database that you configured.

Select Presto cluster in SQL Editor

Query the Presto cluster

In the left of the window, select the schema and the table to query, or in the query editor field on the right enter a query, then select Run.

Select Presto cluster in SQL Editor