UseDocumentation Index
Fetch the complete documentation index at: https://ray-preview.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
write_* operators to persist a dataset to disk or cloud storage. Writes execute in parallel — each block becomes one or more output files.
File formats
Partitioning
Write a partitioned dataset by passing a partitioning column:region=us/day=2025-01-01/...).
Compression
Most writers accept acompression argument:
Custom writers
SubclassDatasink for systems Ray Data doesn’t ship with:
Concurrency and resources
Passnum_rows_per_file, concurrency, num_cpus, etc. to control writer behavior:
Next steps
Loading data
Read the data back in.
Working with tensors
Save and reload tensor columns.