com.microsoft.ml.spark.featurize
Custom value for imputation, supports numeric, string and boolean types.
Custom value for imputation, supports numeric, string and boolean types. Date and Timestamp currently not supported.
Fits the dataset, prepares the transformation function.
Fits the dataset, prepares the transformation function.
The input dataset.
The model for removing missings.
The names of the inputColumns
The names of the inputColumns
The names of the output columns
The names of the output columns
Removes missing values from input dataset. The following modes are supported: Mean - replaces missings with mean of fit column Median - replaces missings with approximate median of fit column Custom - replaces missings with custom value specified by user For mean and median modes, only numeric column types are supported, specifically:
Int
,Long
,Float
,Double
For custom mode, the types above are supported and additionally:String
,Boolean