com.microsoft.ml.spark.stages
TextPreprocessor takes a dataframe and a dictionary that maps (text -> replacement text), scans each cell in the input col and replaces all substring matches with the corresponding value. Priority is given to longer keys and from left to right.
TextPreprocessor
The name of the input column
The name of the output column
- The input dataset, to be transformed
The DataFrame that results from column selection
TextPreprocessor
takes a dataframe and a dictionary that maps (text -> replacement text), scans each cell in the input col and replaces all substring matches with the corresponding value. Priority is given to longer keys and from left to right.