Tune trials can save checkpoints to durable storage. Use them for fault tolerance, PBT (which copies checkpoints between trials), and to recover the best trial after a run finishes.Documentation Index
Fetch the complete documentation index at: https://ray-preview.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Save a checkpoint
train.report with a checkpoint persists it to the trial’s storage directory.
Resume a trial
Inside the training function:Best checkpoint of a result
Configure retention
PBT and checkpoints
PBT copies checkpoints between trials when promoting good performers. Your training function must:- Save a checkpoint at every reporting step.
- Restore from a checkpoint at the start of each step (or at the start of the function, when applicable).
Next steps
Analyzing results
Inspect, compare, and export trial results.
Distributed tuning
Sync checkpoints between nodes.