getml.preprocessors¶
Contains routines for preprocessing data frames.
Classes¶
|
Reduces the cardinality of high-cardinality categorical columns. |
The EmailDomain preprocessor extracts the domain from e-mail addresses. |
|
|
The Imputation preprocessor replaces all NULL values in numerical columns with the mean of the remaining columns. |
|
A mapping preprocessor maps categorical values, discrete values and individual words in a text field to numerical values. |
|
The Seasonal preprocessor extracts seasonal data from time stamps. |
|
The Substring preprocessor extracts substrings from categorical columns and unused string columns. |
A TextFieldSplitter splits columns with role |