ADocumentation Index
Fetch the complete documentation index at: https://ray-preview.mintlify.app/llms.txt
Use this file to discover all available pages before exploring further.
Preprocessor encapsulates a fit/transform pipeline. It wraps a stateful transformation that can be fit on training data and applied to validation, test, and inference data.
Built-in preprocessors
| Preprocessor | What it does |
|---|---|
StandardScaler | Subtract mean, divide by std. |
MinMaxScaler | Scale to a specified range. |
Categorizer | Convert string columns to integer codes. |
OneHotEncoder | One-hot encode a categorical column. |
LabelEncoder | Map labels to integers. |
Tokenizer | Tokenize text columns. |
HashingVectorizer | Hash tokens to feature buckets. |
Concatenator | Concatenate columns into a tensor column. |
Chain | Compose multiple preprocessors. |
Fit and transform
fit computes the per-column mean and std; transform applies them.
Compose with Chain
Custom preprocessors
SubclassPreprocessor:
Save and load
Next steps
Train integration
Use preprocessors in Ray Train.
Transforming data
Lower-level UDF transformations.