mmlspark.opencv package

Submodules

mmlspark.opencv.ImageSetAugmenter module

class mmlspark.opencv.ImageSetAugmenter.ImageSetAugmenter(java_obj=None, flipLeftRight=True, flipUpDown=False, inputCol='image', outputCol='ImageSetAugmenter_aea500e47fd0_output')[source]

Bases: mmlspark.core.schema.Utils.ComplexParamsMixin, pyspark.ml.util.JavaMLReadable, pyspark.ml.util.JavaMLWritable, pyspark.ml.wrapper.JavaTransformer

Parameters
  • flipLeftRight (bool) – Symmetric Left-Right

  • flipUpDown (bool) – Symmetric Up-Down

  • inputCol (object) – The name of the input column

  • outputCol (object) – The name of the output column

flipLeftRight = Param(parent='undefined', name='flipLeftRight', doc='Symmetric Left-Right')
flipUpDown = Param(parent='undefined', name='flipUpDown', doc='Symmetric Up-Down')
getFlipLeftRight()[source]
Returns

Symmetric Left-Right

Return type

flipLeftRight

getFlipUpDown()[source]
Returns

Symmetric Up-Down

Return type

flipUpDown

getInputCol()[source]
Returns

The name of the input column

Return type

inputCol

static getJavaPackage()[source]

Returns package name String.

getOutputCol()[source]
Returns

The name of the output column

Return type

outputCol

inputCol = Param(parent='undefined', name='inputCol', doc='The name of the input column')
outputCol = Param(parent='undefined', name='outputCol', doc='The name of the output column')
classmethod read()[source]

Returns an MLReader instance for this class.

setFlipLeftRight(value)[source]
Parameters

flipLeftRight – Symmetric Left-Right

setFlipUpDown(value)[source]
Parameters

flipUpDown – Symmetric Up-Down

setInputCol(value)[source]
Parameters

inputCol – The name of the input column

setOutputCol(value)[source]
Parameters

outputCol – The name of the output column

setParams(flipLeftRight=True, flipUpDown=False, inputCol='image', outputCol='ImageSetAugmenter_aea500e47fd0_output')[source]

Set the (keyword only) parameters

mmlspark.opencv.ImageTransformer module

class mmlspark.opencv.ImageTransformer.ImageTransformer(java_obj=None, inputCol='image', outputCol='ImageTransformer_e4c7d0112202_output', stages=None)[source]

Bases: mmlspark.opencv._ImageTransformer._ImageTransformer

Resizes the image to the given width and height

Parameters
  • height (int) – The height to resize to (>=0)

  • width (int) – The width to resize to (>=0)

blur(height, width)[source]

Blurs the image using a normalized box filter

Parameters
  • height (double) – The height of the box filter (>= 0)

  • width (double) – The width of the box filter (>= 0)

colorFormat(format)[source]

Formats the image to the given image format

Parameters

format (int) – The format to convert to, please see OpenCV cvtColor function documentation for all formats

crop(x, y, height, width)[source]

Crops the image given the starting x,y coordinates and the width and height

Parameters
  • x (int) – The initial x coordinate (>=0)

  • y (int) – The initial y coordinate (>=0)

  • height (int) – The height to crop to (>=0)

  • width (int) – The width to crop to (>=0)

flip(flipCode=1)[source]
gaussianKernel(appertureSize, sigma)[source]

Blurs the image by applying a gaussian kernel

Parameters
  • appertureSize (double) – The aperture size, which should be odd and positive

  • sigma (double) – The standard deviation of the gaussian

resize(height, width)[source]

Resizes the image to the given width and height

Parameters
  • height (int) – The height to resize to (>=0)

  • width (int) – The width to resize to (>=0)

threshold(threshold, maxVal, thresholdType)[source]

Thresholds the image, please see OpenCV threshold function documentation for more information

Parameters
  • threshold – (double) The threshold value

  • maxVal (double) – The maximum value to use

  • thresholdType (double) – The type of threshold, can be binary, binary_inv, trunc, zero, zero_inv

mmlspark.opencv.ImageTransformer.toImage(array, path='', mode=16)[source]

Converts a one-dimensional array to a 2 dimensional image

Parameters
  • array (array) –

  • path (str) –

  • ocvType (int) –

Returns

2 dimensional image

Return type

object

mmlspark.opencv.ImageTransformer.toNDArray(image)[source]

Converts an image to a 1-dimensional array

Parameters

image (object) – The image to be converted

Returns

The image as a 1-dimensional array

Return type

array

Module contents

MMLSpark is an ecosystem of tools aimed towards expanding the distributed computing framework Apache Spark in several new directions. MMLSpark adds many deep learning and data science tools to the Spark ecosystem, including seamless integration of Spark Machine Learning pipelines with Microsoft Cognitive Toolkit (CNTK), LightGBM and OpenCV. These tools enable powerful and highly-scalable predictive and analytical models for a variety of datasources.

MMLSpark also brings new networking capabilities to the Spark Ecosystem. With the HTTP on Spark project, users can embed any web service into their SparkML models. In this vein, MMLSpark provides easy to use SparkML transformers for a wide variety of Microsoft Cognitive Services. For production grade deployment, the Spark Serving project enables high throughput, sub-millisecond latency web services, backed by your Spark cluster.

MMLSpark requires Scala 2.11, Spark 2.4+, and Python 3.5+.