Class containing the list of column names to perform special featurization steps for.
colNamesToHash - List of column names to hash.
colNamesToDuplicateForMissings - List of column names containing doubles to duplicate
so we can remove missing values from them.
colNamesToTypes - Map of column names to their types.
colNamesToCleanMissings - List of column names to clean missing values from (ignore).
colNamesToVectorize - List of column names to vectorize using FastVectorAssembler.
categoricalColumns - List of categorical columns to pass through or turn into indicator array.
conversionColumnNamesMap - Map from old column names to new.
addedColumnNamesMap - Map from old columns to newly generated columns for featurization.
Class containing the list of column names to perform special featurization steps for. colNamesToHash - List of column names to hash. colNamesToDuplicateForMissings - List of column names containing doubles to duplicate so we can remove missing values from them. colNamesToTypes - Map of column names to their types. colNamesToCleanMissings - List of column names to clean missing values from (ignore). colNamesToVectorize - List of column names to vectorize using FastVectorAssembler. categoricalColumns - List of categorical columns to pass through or turn into indicator array. conversionColumnNamesMap - Map from old column names to new. addedColumnNamesMap - Map from old columns to newly generated columns for featurization.