Skip to content

Getting started with Hopsworks.ai (Azure)#

This guide goes into detail for each of the steps of the cluster creation in Hopsworks.ai

Step 1 starting to create a cluster#

In Hopsworks.ai, select Create cluster:

Create a Hopsworks cluster
Create a Hopsworks cluster

Step 2 setting the General information#

Select the Resource Group (1) in which you created your storage account and user assigned managed identity (see above).

Note

If the Resource Group does not appear in the drop-down, make sure that you properly created and set the custom role for this resource group.

Name your cluster (2). Your cluster will be deployed in the Location of your Resource Group (3).

Select the Instance type (4) and Local storage (5) size for the cluster Head node.

Select the number of workers you want to start the cluster with (6). Select the Instance type (7) and Local storage size (8) for the worker nodes.

Note

It is possible to add or remove workers once the cluster is running.

To provide the capacity of adding and removing workers on demand, the Hopsworks clusters deployed by Hopsworks.ai store their data in an Azure storage container. In this step, you select which storage account and container to use for this purpose. Select the storage account (9) you want to use in Azure Storage account name. The name of the container in which the data will be stored is displayed in Azure Container name (10). You can change this name. For more details on how to create and configure a storage in Azure refer to Creating and configuring a storage

Note

You can choose to use a container already existing in your storage account by using the name of this container, but you need to first make sure that this container is empty.

Press Next (11):

General configuration
General configuration

Step 3 select a SSH key#

When deploying clusters, Hopsworks.ai installs a ssh key on the cluster's instances so that you can access them if necessary. Select the SSH key that you want to use to access cluster instances. For more detail on how to add a shh key in Azure refer to Adding a ssh key to your resource group

Choose SSH key
Choose SSH key

Step 4 select the User assigned managed identity:#

In order to let the cluster instances access to the Azure storage we need to attach a User assigned managed identity to the virtual machines. In this step you choose which identity to use. This identity need to have access right to the storage account you selected in Step 2. For more information about how to create this identity and give it access to the storage account refer to Creating and configuring a storage:

Choose the User assigned managed identity
Choose the User assigned managed identity

Step 5 set the backup retention policy:#

Note

This step is only accessible to enterprise users.

To back up the Azure blob storage data when taking a cluster backups we need to set a retention policy for the blob storage. In this step, you choose the retention period in days. You can deactivate the retention policy by setting this value to 0 but this will block you from taking any backup of your cluster.

Choose the backup retention policy
Choose the backup retention policy

Step 6 Virtual network selection#

In this step, you can select the virtual network which will be used by the Hopsworks cluster. You can either select an existing virtual network or let Hopsworks.ai create one for you. If you decide to let Hopsworks.ai create the virtual network for you, you can choose the CIDR block for this virtual network. Refer to Create a virtual network and subnet for more details on how to create your own virtual network in Azure.

Choose virtual network
Choose virtual network

Step 7 Subnet selection#

If you selected an existing virtual network in the previous step, this step lets you select which subnet of this virtual network to use. For more information about creating your own subnet refer to Create a virtual network and subnet.

If you did not select an existing virtual network in the previous step Hopsworks.ai will create the subnet for you. You can choose the CIDR block this subnet will use.

Choose subnet
Choose subnet

Step 8 Network Security group selection#

In this step, you can select the network security group you want to use to manage the inbound and outbound network rules. You can either let Hopsworks.ai create a network security group for you or select an existing security group. For more information about how to create your own network security group in Azure refer to Create a network security group.

Choose security group
Choose security group

Step 9 User management selection#

In this step, you can choose which user management system to use. You have three choices:

  • Managed: Hopsworks.ai automatically adds and removes users from the Hopsworks cluster when you add and remove users from your organization.
  • LDAP: integrate the cluster with your organization's LDAP/ActiveDirectory server.
  • Disabled: let you manage users manually from within Hopsworks.

Choose user management type
Choose user management type

Step 10 add tags to your instances.#

In this step, you can define tags that will be added to the cluster virtual machines.

Add tags
Add tags

Step 11 Review and create#

Review all information and select Create:

Review cluster information
Review cluster information

The cluster will start. This will take a few minutes:

Booting Hopsworks cluster
Booting Hopsworks cluster

As soon as the cluster has started, you will be able to log in to your new Hopsworks cluster with the username and password provided. You will also be able to stop, restart, or terminate the cluster.

Running Hopsworks cluster
Running Hopsworks cluster