Packages

class CleanMissingData extends Estimator[CleanMissingDataModel] with HasInputCols with HasOutputCols with Wrappable with DefaultParamsWritable with BasicLogging

Removes missing values from input dataset. The following modes are supported: Mean - replaces missings with mean of fit column Median - replaces missings with approximate median of fit column Custom - replaces missings with custom value specified by user For mean and median modes, only numeric column types are supported, specifically: Int, Long, Float, Double For custom mode, the types above are supported and additionally: String, Boolean

Linear Supertypes
BasicLogging, DefaultParamsWritable, MLWritable, Wrappable, RWrappable, PythonWrappable, BaseWrappable, HasOutputCols, HasInputCols, Estimator[CleanMissingDataModel], PipelineStage, Logging, Params, Serializable, Serializable, Identifiable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. CleanMissingData
  2. BasicLogging
  3. DefaultParamsWritable
  4. MLWritable
  5. Wrappable
  6. RWrappable
  7. PythonWrappable
  8. BaseWrappable
  9. HasOutputCols
  10. HasInputCols
  11. Estimator
  12. PipelineStage
  13. Logging
  14. Params
  15. Serializable
  16. Serializable
  17. Identifiable
  18. AnyRef
  19. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new CleanMissingData()
  2. new CleanMissingData(uid: String)

Value Members

  1. val cleaningMode: Param[String]
  2. final def clear(param: Param[_]): CleanMissingData.this.type
    Definition Classes
    Params
  3. def copy(extra: ParamMap): Estimator[CleanMissingDataModel]
    Definition Classes
    CleanMissingData → Estimator → PipelineStage → Params
  4. val customValue: Param[String]

    Custom value for imputation, supports numeric, string and boolean types.

    Custom value for imputation, supports numeric, string and boolean types. Date and Timestamp currently not supported.

  5. def explainParam(param: Param[_]): String
    Definition Classes
    Params
  6. def explainParams(): String
    Definition Classes
    Params
  7. final def extractParamMap(): ParamMap
    Definition Classes
    Params
  8. final def extractParamMap(extra: ParamMap): ParamMap
    Definition Classes
    Params
  9. def fit(dataset: Dataset[_]): CleanMissingDataModel

    Fits the dataset, prepares the transformation function.

    Fits the dataset, prepares the transformation function.

    dataset

    The input dataset.

    returns

    The model for removing missings.

    Definition Classes
    CleanMissingData → Estimator
  10. def fit(dataset: Dataset[_], paramMaps: Seq[ParamMap]): Seq[CleanMissingDataModel]
    Definition Classes
    Estimator
    Annotations
    @Since( "2.0.0" )
  11. def fit(dataset: Dataset[_], paramMap: ParamMap): CleanMissingDataModel
    Definition Classes
    Estimator
    Annotations
    @Since( "2.0.0" )
  12. def fit(dataset: Dataset[_], firstParamPair: ParamPair[_], otherParamPairs: ParamPair[_]*): CleanMissingDataModel
    Definition Classes
    Estimator
    Annotations
    @Since( "2.0.0" ) @varargs()
  13. final def get[T](param: Param[T]): Option[T]
    Definition Classes
    Params
  14. def getCleaningMode: String
  15. def getCustomValue: String
  16. final def getDefault[T](param: Param[T]): Option[T]
    Definition Classes
    Params
  17. def getInputCols: Array[String]

    Definition Classes
    HasInputCols
  18. final def getOrDefault[T](param: Param[T]): T
    Definition Classes
    Params
  19. def getOutputCols: Array[String]

    Definition Classes
    HasOutputCols
  20. def getParam(paramName: String): Param[Any]
    Definition Classes
    Params
  21. final def hasDefault[T](param: Param[T]): Boolean
    Definition Classes
    Params
  22. def hasParam(paramName: String): Boolean
    Definition Classes
    Params
  23. val inputCols: StringArrayParam

    The names of the inputColumns

    The names of the inputColumns

    Definition Classes
    HasInputCols
  24. final def isDefined(param: Param[_]): Boolean
    Definition Classes
    Params
  25. final def isSet(param: Param[_]): Boolean
    Definition Classes
    Params
  26. def logClass(): Unit
    Definition Classes
    BasicLogging
  27. def logFit[T](f: ⇒ T): T
    Definition Classes
    BasicLogging
  28. def logPredict[T](f: ⇒ T): T
    Definition Classes
    BasicLogging
  29. def logTrain[T](f: ⇒ T): T
    Definition Classes
    BasicLogging
  30. def logTransform[T](f: ⇒ T): T
    Definition Classes
    BasicLogging
  31. def logVerb[T](verb: String, f: ⇒ T): T
    Definition Classes
    BasicLogging
  32. def makePyFile(conf: CodegenConfig): Unit
    Definition Classes
    PythonWrappable
  33. def makeRFile(conf: CodegenConfig): Unit
    Definition Classes
    RWrappable
  34. val outputCols: StringArrayParam

    The names of the output columns

    The names of the output columns

    Definition Classes
    HasOutputCols
  35. lazy val params: Array[Param[_]]
    Definition Classes
    Params
  36. def pyAdditionalMethods: String
    Definition Classes
    PythonWrappable
  37. def pyInitFunc(): String
    Definition Classes
    PythonWrappable
  38. def save(path: String): Unit
    Definition Classes
    MLWritable
    Annotations
    @Since( "1.6.0" ) @throws( ... )
  39. final def set[T](param: Param[T], value: T): CleanMissingData.this.type
    Definition Classes
    Params
  40. def setCleaningMode(value: String): CleanMissingData.this.type
  41. def setCustomValue(value: String): CleanMissingData.this.type
  42. def setInputCols(value: Array[String]): CleanMissingData.this.type

    Definition Classes
    HasInputCols
  43. def setOutputCols(value: Array[String]): CleanMissingData.this.type

    Definition Classes
    HasOutputCols
  44. def toString(): String
    Definition Classes
    Identifiable → AnyRef → Any
  45. def transformSchema(schema: StructType): StructType
    Definition Classes
    CleanMissingData → PipelineStage
    Annotations
    @DeveloperApi()
  46. val uid: String
    Definition Classes
    CleanMissingDataBasicLogging → Identifiable
  47. val ver: String
    Definition Classes
    BasicLogging
  48. def write: MLWriter
    Definition Classes
    DefaultParamsWritable → MLWritable