The autoscaler runs on the head node and provisions or terminates worker VMs based on resource demand.Documentation Index
Fetch the complete documentation index at: https://ray-preview.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Bounds
min_workers is the floor for that node type; max_workers is the per-type cap. The cluster-wide max_workers is an absolute cap.
Idle timeout
Upscaling speed
1.0 doubles the cluster on every check; 0.5 adds half. Higher values reduce time-to-capacity at the cost of bursty bills.
Resource matching
The autoscaler picks the cheapest node type that satisfies a pending resource request. Use custom resources to differentiate node classes:Spot instances
For preemptible workers, configure the provider’s spot options (AWSInstanceMarketOptions, GCP preemptible: true, Azure priority: Spot).
Verifying
Next steps
Best practices
Run >100 nodes reliably.
Logging
Aggregate logs across nodes.