Plan for Data Science Experience Local

Before you install DSX Local, you must set up the hardware and software for its private cloud architecture.

Decide your node configuration

Before you install DSX Local, consider how many nodes to use in the cluster. The selection should be made based on the type of workloads that will be run and the number of users that will be using the cluster. Clusters are very scalable due to the ability that additional compute and deployment (production compute) nodes can be added during or after installation. The deployment nodes are the production versions of the compute nodes, and have identical requirements.

Restriction: You cannot add more control nodes after an installation to any cluster type, so make sure you size your control node adequately to support projected growth.

The most basic configuration is a three node installation where each node shares control/compute. The three node cluster is scalable by adding additional compute or deployment nodes.

Five node diagram

For a larger production cluster (seven or more nodes), it is recommended to have three control nodes and three compute nodes plus one or two deployment nodes. Compute and deployment nodes can be added to scale out the cluster after installation as well as during installation.

Eight node diagram

Common configuration examples

The following table shows examples of common configurations. You can select how to break up the cluster based on your requirements.

Cluster type # Node breakdown Notes
3 nodes 3 shared control/compute Unable to deploy assets.
4 nodes 3 shared control/compute + 1 deploy
7 nodes 3 control + 3 compute + 1 deploy
8 nodes 3 control + 3 compute + 2 deploy
11 nodes 3 control + 6 compute + 2 deploy

Note that asset deployment requires at least one deployment node.