Install JobSet
This page describes how to install Alauda Build of JobSet on Alauda Container Platform through the Operator Hub. JobSet is shipped as a Helm-based operator: installing it from the Operator Hub deploys the jobset-operator controller; creating a JobSetOperatorCtl instance then deploys the JobSet controller and webhook that reconcile JobSet custom resources.
Upload the Operator Package
Download the JobSet Operator package, for example jobset-operator.ALL.xxxx.tgz.
Use violet to upload the package to the platform repository:
Install the Operator
In Administrator view:
- Go to
Marketplace/Operator Hub. - Select the destination cluster.
- Search for Alauda Build of JobSet (the package name in the marketplace is
jobset-operator). - Click Install.
- Keep the default installation settings. The operator installs into the
jobset-systemnamespace by default. - Complete the installation.
Confirm the operator controller is running:
The jobset-operator pod should be in Running state, for example:
Create a JobSetOperatorCtl Instance
Installing the operator alone does not deploy the JobSet controller — the operator reconciles JobSetOperatorCtl resources and only deploys the JobSet controller and webhook when one exists.
Save the following as jobsetoperatorctl.yaml:
Apply it:
After the operator reconciles the resource, the JobSet controller pod is deployed in the same namespace:
Expected output (the operator pod and the JobSet controller pod side by side):
The JobSet CRD is registered by the operator and can be verified:
Next Steps
Once the JobSet controller is running, JobSet custom resources can be submitted to run distributed AI/ML and HPC workloads.
Continue with the Quickstart to run a simple JobSet demo.