Install KServe
TOC
PrerequisitesRequired DependenciesInstallation NotesUpload OperatorInstall OperatorVerificationCreate KServe InstanceVerificationEnvoy Gateway ConfigurationEnvoy AI Gateway ConfigurationKServe Gateway ConfigurationGIE (gateway-api-inference-extension) ConfigurationUpgrading Alauda Build of KServeVerificationPrerequisites
Before installing Alauda Build of KServe, you need to ensure the following dependencies are installed:
Required Dependencies
Alauda build of Envoy Gateway is natively integrated into ACP 4.2. For environments running earlier versions (including ACP 4.0 and 4.1), please contact Customer Support for compatibility and installation guidance.
Installation Notes
- Required Dependencies: All required dependencies must be installed before installing Alauda Build of KServe.
- GIE Integration: GIE is bundled and enabled by default. If your environment already has GIE installed separately, set
gie.builtIntofalsein the operator configuration to disable the built-in installation.
Upload Operator
Download the Alauda Build of KServe Operator installation file (e.g., kserve-operator.ALL.xxxx.tgz).
Use the violet command to publish it to the platform repository:
Install Operator
In Administrator view:
-
Click Marketplace / OperatorHub.
-
At the top of the console, from the Cluster dropdown list, select the destination cluster where you want to install the KServe Operator.
-
Search for and select Alauda Build of KServe, then click Install.
Install Alauda Build of KServe window will pop up.
-
Leave Channel unchanged.
-
Check whether the Version matches the Alauda Build of KServe version you want to install.
-
Leave Installation Location unchanged, it should be
kserve-operatorby default. -
Select Manual for Upgrade Strategy.
-
Click Install.
Verification
Confirm that the Alauda Build of KServe tile shows one of the following states:
Installing: installation is in progress; wait for this to change toInstalled.Installed: installation is complete.
Create KServe Instance
After the operator is installed, create a KServe custom resource to deploy the KServe instance.
Switch to YAML view and apply the following configuration, then adjust the callout fields for your environment:
spec.namespace— Kubernetes namespace where KServe components are deployed. Default:kserve.global.clusterName— Cluster name as registered in the platform. Example:business-1.global.deployFlavor—single-nodefor non-HA,ha-clusterfor production HA.global.platformAddress— Alauda Container Platform management endpoint address. Example:https://192.168.131.112.preset.GIE— Built-in Gateway API Inference Extension for enhanced AI capabilities. See GIE Configuration.preset.envoy_ai_gateway— AI-specific gateway for intelligent routing and policy enforcement. See Envoy AI Gateway Configuration.preset.envoy_gateway— Underlying Envoy-based gateway infrastructure. See Envoy Gateway Configuration.preset.kserve_gateway— Ingress gateway for KServe inference services. See KServe Gateway Configuration.global.registry.address— The container registry endpoint used by the target cluster (global.clusterName) to pull KServe infrastructure and runtime images. Example:registry.alauda.cn:60070.kserve.controller.deploymentMode— Set toKnativefor serverless features like scale-to-zero, orStandardfor native Kubernetes deploymentskserve.controller.gateway.domain— Domain for the ingress gateway to expose inference service endpoints. Use a wildcard domain, e.g.,*.example.com.kserve.storage.caBundleConfigMapName— ConfigMap name containing the CA bundle for storage connections.
Verification
Check the status of the KServe resource:
The instance is ready when the status shows DEPLOYED: True.
Envoy Gateway Configuration
Envoy AI Gateway Configuration
KServe Gateway Configuration
GIE (gateway-api-inference-extension) Configuration
Upgrading Alauda Build of KServe
- Upload the new version of the Alauda Build of KServe operator package using the
violettool. - Go to the
Administrator->Marketplace->OperatorHubpage, find Alauda Build of KServe, and click Confirm to apply the new version.
Verification
After upgrading, confirm that the Alauda Build of KServe tile shows Installed and verify the KServe instance status: