Use these methods to understand the shape and contents of a dataset before you process it.Documentation Index
Fetch the complete documentation index at: https://ray-preview.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Schema
Count
count triggers full execution of any pending transforms — but it streams blocks, so it doesn’t load the dataset into memory.
Sample rows
Statistics
Plan
Iterate without consuming
To inspect without modifying the dataset, prefertake or iter_batches(prefetch_blocks=0):
Type-check rows
numpy, pandas, and pyarrow produce different Python types for the same logical data.
Next steps
Iterating
Consume datasets in training and inference loops.
Performance tips
Diagnose slow pipelines.