object DatasetUtils
Linear Supertypes
Ordering
- Alphabetic
- By Inheritance
Inherited
- DatasetUtils
- AnyRef
- Any
- Hide All
- Show All
Visibility
- Public
- All
Type Members
- case class CardinalityTriplet[T](groupCounts: List[Int], currentValue: T, currentCount: Int) extends Product with Serializable
Value Members
- def countCardinality[T](input: Seq[T]): Array[Int]
- def getInitScores(rows: Array[Row], initScoreColumn: Option[String], schema: StructType): Option[Array[Double]]
- def getRowAsDoubleArray(row: Row, columnParams: ColumnParams): Array[Double]
- def releaseArrays(labelsChunkedArray: floatChunkedArray, weightChunkedArrayOpt: Option[floatChunkedArray], initScoreChunkedArrayOpt: Option[doubleChunkedArray]): Unit
-
def
sampleRowsForArrayType(rowsIter: Iterator[Row], columnParams: ColumnParams): (Iterator[Row], Boolean)
Sample the first several rows to determine whether to construct sparse or dense matrix in lightgbm native code.
Sample the first several rows to determine whether to construct sparse or dense matrix in lightgbm native code.
- rowsIter
Iterator of rows.
- columnParams
The column parameters.
- returns
A reconstructed iterator with the same original rows and whether the matrix should be sparse or dense.
- def validateGroupColumn(col: String, schema: StructType): Unit