Use this file to discover all available pages before exploring further.
A RayService runs a Ray Serve application on a managed RayCluster. The KubeRay operator handles rolling upgrades, health checks, and zero-downtime updates.
Edit serveConfigV2 (e.g., bump num_replicas) and re-apply. KubeRay performs a zero-downtime rolling update — bringing up new replicas, draining the old ones, and only switching traffic when the new version is healthy.
The operator polls the Serve health endpoint. If a deployment stays unhealthy past deploymentUnhealthySecondThreshold, the operator triggers a rollback to the previous stable version.