Class/Object

com.microsoft.ml.spark.featurize

CleanMissingData

Related Docs: object CleanMissingData | package featurize

Permalink

class CleanMissingData extends Estimator[CleanMissingDataModel] with HasInputCols with HasOutputCols with Wrappable with DefaultParamsWritable

Removes missing values from input dataset. The following modes are supported: Mean - replaces missings with mean of fit column Median - replaces missings with approximate median of fit column Custom - replaces missings with custom value specified by user For mean and median modes, only numeric column types are supported, specifically: Int, Long, Float, Double For custom mode, the types above are supported and additionally: String, Boolean

Linear Supertypes
DefaultParamsWritable, MLWritable, HasOutputCols, HasInputCols, Wrappable, Estimator[CleanMissingDataModel], PipelineStage, Logging, Params, Serializable, Serializable, Identifiable, AnyRef, Any
Ordering
  1. Alphabetic
  2. By Inheritance
Inherited
  1. CleanMissingData
  2. DefaultParamsWritable
  3. MLWritable
  4. HasOutputCols
  5. HasInputCols
  6. Wrappable
  7. Estimator
  8. PipelineStage
  9. Logging
  10. Params
  11. Serializable
  12. Serializable
  13. Identifiable
  14. AnyRef
  15. Any
  1. Hide All
  2. Show All
Visibility
  1. Public
  2. All

Instance Constructors

  1. new CleanMissingData()

    Permalink
  2. new CleanMissingData(uid: String)

    Permalink

Value Members

  1. final def !=(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  2. final def ##(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  3. final def $[T](param: Param[T]): T

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  4. final def ==(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  5. def additionalPythonMethods(): String

    Permalink
    Definition Classes
    Wrappable
  6. final def asInstanceOf[T0]: T0

    Permalink
    Definition Classes
    Any
  7. val cleaningMode: Param[String]

    Permalink
  8. final def clear(param: Param[_]): CleanMissingData.this.type

    Permalink
    Definition Classes
    Params
  9. def clone(): AnyRef

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  10. def copy(extra: ParamMap): Estimator[CleanMissingDataModel]

    Permalink
    Definition Classes
    CleanMissingData → Estimator → PipelineStage → Params
  11. def copyValues[T <: Params](to: T, extra: ParamMap): T

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  12. val customValue: Param[String]

    Permalink

    Custom value for imputation, supports numeric, string and boolean types.

    Custom value for imputation, supports numeric, string and boolean types. Date and Timestamp currently not supported.

  13. final def defaultCopy[T <: Params](extra: ParamMap): T

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  14. final def eq(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  15. def equals(arg0: Any): Boolean

    Permalink
    Definition Classes
    AnyRef → Any
  16. def explainParam(param: Param[_]): String

    Permalink
    Definition Classes
    Params
  17. def explainParams(): String

    Permalink
    Definition Classes
    Params
  18. final def extractParamMap(): ParamMap

    Permalink
    Definition Classes
    Params
  19. final def extractParamMap(extra: ParamMap): ParamMap

    Permalink
    Definition Classes
    Params
  20. def finalize(): Unit

    Permalink
    Attributes
    protected[java.lang]
    Definition Classes
    AnyRef
    Annotations
    @throws( classOf[java.lang.Throwable] )
  21. def fit(dataset: Dataset[_]): CleanMissingDataModel

    Permalink

    Fits the dataset, prepares the transformation function.

    Fits the dataset, prepares the transformation function.

    dataset

    The input dataset.

    returns

    The model for removing missings.

    Definition Classes
    CleanMissingData → Estimator
  22. def fit(dataset: Dataset[_], paramMaps: Array[ParamMap]): Seq[CleanMissingDataModel]

    Permalink
    Definition Classes
    Estimator
    Annotations
    @Since( "2.0.0" )
  23. def fit(dataset: Dataset[_], paramMap: ParamMap): CleanMissingDataModel

    Permalink
    Definition Classes
    Estimator
    Annotations
    @Since( "2.0.0" )
  24. def fit(dataset: Dataset[_], firstParamPair: ParamPair[_], otherParamPairs: ParamPair[_]*): CleanMissingDataModel

    Permalink
    Definition Classes
    Estimator
    Annotations
    @Since( "2.0.0" ) @varargs()
  25. final def get[T](param: Param[T]): Option[T]

    Permalink
    Definition Classes
    Params
  26. final def getClass(): Class[_]

    Permalink
    Definition Classes
    AnyRef → Any
  27. def getCleaningMode: String

    Permalink
  28. def getCustomValue: String

    Permalink
  29. final def getDefault[T](param: Param[T]): Option[T]

    Permalink
    Definition Classes
    Params
  30. def getInputCols: Array[String]

    Permalink

    Definition Classes
    HasInputCols
  31. final def getOrDefault[T](param: Param[T]): T

    Permalink
    Definition Classes
    Params
  32. def getOutputCols: Array[String]

    Permalink

    Definition Classes
    HasOutputCols
  33. def getParam(paramName: String): Param[Any]

    Permalink
    Definition Classes
    Params
  34. final def hasDefault[T](param: Param[T]): Boolean

    Permalink
    Definition Classes
    Params
  35. def hasParam(paramName: String): Boolean

    Permalink
    Definition Classes
    Params
  36. def hashCode(): Int

    Permalink
    Definition Classes
    AnyRef → Any
  37. def initializeLogIfNecessary(isInterpreter: Boolean, silent: Boolean): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  38. def initializeLogIfNecessary(isInterpreter: Boolean): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  39. val inputCols: StringArrayParam

    Permalink

    The names of the inputColumns

    The names of the inputColumns

    Definition Classes
    HasInputCols
  40. final def isDefined(param: Param[_]): Boolean

    Permalink
    Definition Classes
    Params
  41. final def isInstanceOf[T0]: Boolean

    Permalink
    Definition Classes
    Any
  42. final def isSet(param: Param[_]): Boolean

    Permalink
    Definition Classes
    Params
  43. def isTraceEnabled(): Boolean

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  44. def log: Logger

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  45. def logDebug(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  46. def logDebug(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  47. def logError(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  48. def logError(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  49. def logInfo(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  50. def logInfo(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  51. def logName: String

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  52. def logTrace(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  53. def logTrace(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  54. def logWarning(msg: ⇒ String, throwable: Throwable): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  55. def logWarning(msg: ⇒ String): Unit

    Permalink
    Attributes
    protected
    Definition Classes
    Logging
  56. final def ne(arg0: AnyRef): Boolean

    Permalink
    Definition Classes
    AnyRef
  57. final def notify(): Unit

    Permalink
    Definition Classes
    AnyRef
  58. final def notifyAll(): Unit

    Permalink
    Definition Classes
    AnyRef
  59. val outputCols: StringArrayParam

    Permalink

    The names of the output columns

    The names of the output columns

    Definition Classes
    HasOutputCols
  60. lazy val params: Array[Param[_]]

    Permalink
    Definition Classes
    Params
  61. def save(path: String): Unit

    Permalink
    Definition Classes
    MLWritable
    Annotations
    @Since( "1.6.0" ) @throws( ... )
  62. final def set(paramPair: ParamPair[_]): CleanMissingData.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  63. final def set(param: String, value: Any): CleanMissingData.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  64. final def set[T](param: Param[T], value: T): CleanMissingData.this.type

    Permalink
    Definition Classes
    Params
  65. def setCleaningMode(value: String): CleanMissingData.this.type

    Permalink
  66. def setCustomValue(value: String): CleanMissingData.this.type

    Permalink
  67. final def setDefault(paramPairs: ParamPair[_]*): CleanMissingData.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  68. final def setDefault[T](param: Param[T], value: T): CleanMissingData.this.type

    Permalink
    Attributes
    protected
    Definition Classes
    Params
  69. def setInputCols(value: Array[String]): CleanMissingData.this.type

    Permalink

    Definition Classes
    HasInputCols
  70. def setOutputCols(value: Array[String]): CleanMissingData.this.type

    Permalink

    Definition Classes
    HasOutputCols
  71. final def synchronized[T0](arg0: ⇒ T0): T0

    Permalink
    Definition Classes
    AnyRef
  72. def toString(): String

    Permalink
    Definition Classes
    Identifiable → AnyRef → Any
  73. def transformSchema(schema: StructType): StructType

    Permalink
    Definition Classes
    CleanMissingData → PipelineStage
    Annotations
    @DeveloperApi()
  74. def transformSchema(schema: StructType, logging: Boolean): StructType

    Permalink
    Attributes
    protected
    Definition Classes
    PipelineStage
    Annotations
    @DeveloperApi()
  75. val uid: String

    Permalink
    Definition Classes
    CleanMissingData → Identifiable
  76. final def wait(): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  77. final def wait(arg0: Long, arg1: Int): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  78. final def wait(arg0: Long): Unit

    Permalink
    Definition Classes
    AnyRef
    Annotations
    @throws( ... )
  79. def write: MLWriter

    Permalink
    Definition Classes
    DefaultParamsWritable → MLWritable

Inherited from DefaultParamsWritable

Inherited from MLWritable

Inherited from HasOutputCols

Inherited from HasInputCols

Inherited from Wrappable

Inherited from Estimator[CleanMissingDataModel]

Inherited from PipelineStage

Inherited from Logging

Inherited from Params

Inherited from Serializable

Inherited from Serializable

Inherited from Identifiable

Inherited from AnyRef

Inherited from Any

getParam

param

setParam

Ungrouped