mmlspark.opencv package¶

Submodules¶

mmlspark.opencv.ImageSetAugmenter module¶

class mmlspark.opencv.ImageSetAugmenter.ImageSetAugmenter(java_obj=None, flipLeftRight=True, flipUpDown=False, inputCol='image', outputCol='ImageSetAugmenter_aea500e47fd0_output')[source]¶

Bases: mmlspark.core.schema.Utils.ComplexParamsMixin, pyspark.ml.util.JavaMLReadable, pyspark.ml.util.JavaMLWritable, pyspark.ml.wrapper.JavaTransformer

Parameters

flipLeftRight (bool) – Symmetric Left-Right
flipUpDown (bool) – Symmetric Up-Down
inputCol (object) – The name of the input column
outputCol (object) – The name of the output column

flipLeftRight = Param(parent='undefined', name='flipLeftRight', doc='Symmetric Left-Right')¶

flipUpDown = Param(parent='undefined', name='flipUpDown', doc='Symmetric Up-Down')¶

getFlipLeftRight()[source]¶

Returns: Symmetric Left-Right
Return type: flipLeftRight

getFlipUpDown()[source]¶

Returns: Symmetric Up-Down
Return type: flipUpDown

getInputCol()[source]¶

Returns: The name of the input column
Return type: inputCol

static getJavaPackage()[source]¶: Returns package name String.

getOutputCol()[source]¶

Returns: The name of the output column
Return type: outputCol

inputCol = Param(parent='undefined', name='inputCol', doc='The name of the input column')¶

outputCol = Param(parent='undefined', name='outputCol', doc='The name of the output column')¶

classmethod read()[source]¶: Returns an MLReader instance for this class.

setFlipLeftRight(value)[source]¶

Parameters: flipLeftRight – Symmetric Left-Right

setFlipUpDown(value)[source]¶

Parameters: flipUpDown – Symmetric Up-Down

setInputCol(value)[source]¶

Parameters: inputCol – The name of the input column

setOutputCol(value)[source]¶

Parameters: outputCol – The name of the output column

setParams(flipLeftRight=True, flipUpDown=False, inputCol='image', outputCol='ImageSetAugmenter_aea500e47fd0_output')[source]¶: Set the (keyword only) parameters

mmlspark.opencv.ImageTransformer module¶

class mmlspark.opencv.ImageTransformer.ImageTransformer(java_obj=None, inputCol='image', outputCol='ImageTransformer_e4c7d0112202_output', stages=None)[source]¶

Bases: mmlspark.opencv._ImageTransformer._ImageTransformer

Resizes the image to the given width and height

Parameters

height (int) – The height to resize to (>=0)
width (int) – The width to resize to (>=0)

blur(height, width)[source]¶

Blurs the image using a normalized box filter

Parameters

height (double) – The height of the box filter (>= 0)
width (double) – The width of the box filter (>= 0)

colorFormat(format)[source]¶

Formats the image to the given image format

Parameters: format (int) – The format to convert to, please see OpenCV cvtColor function documentation for all formats

crop(x, y, height, width)[source]¶

Crops the image given the starting x,y coordinates and the width and height

Parameters

x (int) – The initial x coordinate (>=0)
y (int) – The initial y coordinate (>=0)
height (int) – The height to crop to (>=0)
width (int) – The width to crop to (>=0)

flip(flipCode=1)[source]¶

gaussianKernel(appertureSize, sigma)[source]¶

Blurs the image by applying a gaussian kernel

Parameters

appertureSize (double) – The aperture size, which should be odd and positive
sigma (double) – The standard deviation of the gaussian

resize(height, width)[source]¶

Resizes the image to the given width and height

Parameters

height (int) – The height to resize to (>=0)
width (int) – The width to resize to (>=0)

threshold(threshold, maxVal, thresholdType)[source]¶

Thresholds the image, please see OpenCV threshold function documentation for more information

Parameters

threshold – (double) The threshold value
maxVal (double) – The maximum value to use
thresholdType (double) – The type of threshold, can be binary, binary_inv, trunc, zero, zero_inv

mmlspark.opencv.ImageTransformer.toImage(array, path='', mode=16)[source]¶

Converts a one-dimensional array to a 2 dimensional image

Parameters

array (array) –
path (str) –
ocvType (int) –

Returns

2 dimensional image

Return type

object

mmlspark.opencv.ImageTransformer.toNDArray(image)[source]¶

Converts an image to a 1-dimensional array

Parameters: image (object) – The image to be converted
Returns: The image as a 1-dimensional array
Return type: array

Module contents¶

MMLSpark is an ecosystem of tools aimed towards expanding the distributed computing framework Apache Spark in several new directions. MMLSpark adds many deep learning and data science tools to the Spark ecosystem, including seamless integration of Spark Machine Learning pipelines with Microsoft Cognitive Toolkit (CNTK), LightGBM and OpenCV. These tools enable powerful and highly-scalable predictive and analytical models for a variety of datasources.

MMLSpark also brings new networking capabilities to the Spark Ecosystem. With the HTTP on Spark project, users can embed any web service into their SparkML models. In this vein, MMLSpark provides easy to use SparkML transformers for a wide variety of Microsoft Cognitive Services. For production grade deployment, the Spark Serving project enables high throughput, sub-millisecond latency web services, backed by your Spark cluster.

MMLSpark requires Scala 2.11, Spark 2.4+, and Python 3.5+.