mmlspark.io.http package¶
Submodules¶
mmlspark.io.http.CustomInputParser module¶
-
class
mmlspark.io.http.CustomInputParser.
CustomInputParser
(inputCol=None, outputCol=None, udfPython=None, udfScala=None)[source]¶ Bases:
mmlspark.core.schema.Utils.ComplexParamsMixin
,pyspark.ml.util.JavaMLReadable
,pyspark.ml.util.JavaMLWritable
,pyspark.ml.wrapper.JavaTransformer
- Parameters
-
getUdfPython
()[source]¶ - Returns
User Defined Python Function to be applied to the DF input col
- Return type
-
setParams
(inputCol=None, outputCol=None, udfPython=None, udfScala=None)[source]¶ Set the (keyword only) parameters
mmlspark.io.http.CustomOutputParser module¶
-
class
mmlspark.io.http.CustomOutputParser.
CustomOutputParser
(inputCol=None, outputCol=None, udfPython=None, udfScala=None)[source]¶ Bases:
mmlspark.core.schema.Utils.ComplexParamsMixin
,pyspark.ml.util.JavaMLReadable
,pyspark.ml.util.JavaMLWritable
,pyspark.ml.wrapper.JavaTransformer
- Parameters
-
getUdfPython
()[source]¶ - Returns
User Defined Python Function to be applied to the DF input col
- Return type
-
setParams
(inputCol=None, outputCol=None, udfPython=None, udfScala=None)[source]¶ Set the (keyword only) parameters
mmlspark.io.http.HTTPTransformer module¶
-
class
mmlspark.io.http.HTTPTransformer.
HTTPTransformer
(concurrency=1, concurrentTimeout=100.0, handler=None, inputCol=None, outputCol=None, timeout=60.0)[source]¶ Bases:
mmlspark.core.schema.Utils.ComplexParamsMixin
,pyspark.ml.util.JavaMLReadable
,pyspark.ml.util.JavaMLWritable
,pyspark.ml.wrapper.JavaTransformer
- Parameters
concurrency (int) – max number of concurrent calls (default: 1)
concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
handler (object) – Which strategy to use when handling requests (default: UserDefinedFunction(<function2>,StringType,None))
inputCol (str) – The name of the input column
outputCol (str) – The name of the output column
timeout (double) – number of seconds to wait before closing the connection (default: 60.0)
-
getConcurrentTimeout
()[source]¶ - Returns
max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
- Return type
double
-
getHandler
()[source]¶ - Returns
Which strategy to use when handling requests (default: UserDefinedFunction(<function2>,StringType,None))
- Return type
-
getTimeout
()[source]¶ - Returns
number of seconds to wait before closing the connection (default: 60.0)
- Return type
double
-
setConcurrency
(value)[source]¶ - Parameters
concurrency (int) – max number of concurrent calls (default: 1)
-
setConcurrentTimeout
(value)[source]¶ - Parameters
concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
-
setHandler
(value)[source]¶ - Parameters
handler (object) – Which strategy to use when handling requests (default: UserDefinedFunction(<function2>,StringType,None))
-
setParams
(concurrency=1, concurrentTimeout=100.0, handler=None, inputCol=None, outputCol=None, timeout=60.0)[source]¶ Set the (keyword only) parameters
- Parameters
concurrency (int) – max number of concurrent calls (default: 1)
concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
handler (object) – Which strategy to use when handling requests (default: UserDefinedFunction(<function2>,StringType,None))
inputCol (str) – The name of the input column
outputCol (str) – The name of the output column
timeout (double) – number of seconds to wait before closing the connection (default: 60.0)
mmlspark.io.http.JSONInputParser module¶
-
class
mmlspark.io.http.JSONInputParser.
JSONInputParser
(headers={}, inputCol=None, method='POST', outputCol=None, url=None)[source]¶ Bases:
mmlspark.core.schema.Utils.ComplexParamsMixin
,pyspark.ml.util.JavaMLReadable
,pyspark.ml.util.JavaMLWritable
,pyspark.ml.wrapper.JavaTransformer
- Parameters
-
getMethod
()[source]¶ - Returns
method to use for request, (PUT, POST, PATCH) (default: POST)
- Return type
-
setMethod
(value)[source]¶ - Parameters
method (str) – method to use for request, (PUT, POST, PATCH) (default: POST)
mmlspark.io.http.JSONOutputParser module¶
mmlspark.io.http.PartitionConsolidator module¶
-
class
mmlspark.io.http.PartitionConsolidator.
PartitionConsolidator
(concurrency=1, concurrentTimeout=100.0, inputCol=None, outputCol=None, timeout=60.0)[source]¶ Bases:
mmlspark.core.schema.Utils.ComplexParamsMixin
,pyspark.ml.util.JavaMLReadable
,pyspark.ml.util.JavaMLWritable
,pyspark.ml.wrapper.JavaTransformer
- Parameters
concurrency (int) – max number of concurrent calls (default: 1)
concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
inputCol (str) – The name of the input column
outputCol (str) – The name of the output column
timeout (double) – number of seconds to wait before closing the connection (default: 60.0)
-
getConcurrentTimeout
()[source]¶ - Returns
max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
- Return type
double
-
getTimeout
()[source]¶ - Returns
number of seconds to wait before closing the connection (default: 60.0)
- Return type
double
-
setConcurrency
(value)[source]¶ - Parameters
concurrency (int) – max number of concurrent calls (default: 1)
-
setConcurrentTimeout
(value)[source]¶ - Parameters
concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
-
setParams
(concurrency=1, concurrentTimeout=100.0, inputCol=None, outputCol=None, timeout=60.0)[source]¶ Set the (keyword only) parameters
- Parameters
concurrency (int) – max number of concurrent calls (default: 1)
concurrentTimeout (double) – max number seconds to wait on futures if concurrency >= 1 (default: 100.0)
inputCol (str) – The name of the input column
outputCol (str) – The name of the output column
timeout (double) – number of seconds to wait before closing the connection (default: 60.0)
mmlspark.io.http.ServingFunctions module¶
mmlspark.io.http.SimpleHTTPTransformer module¶
-
class
mmlspark.io.http.SimpleHTTPTransformer.
SimpleHTTPTransformer
(concurrency=1, concurrentTimeout=100.0, errorCol=None, flattenOutputBatches=None, handler=None, inputCol=None, inputParser=None, miniBatcher=None, outputCol=None, outputParser=None, timeout=60.0)[source]¶ Bases:
mmlspark.io.http._SimpleHTTPTransformer._SimpleHTTPTransformer
mmlspark.io.http.StringOutputParser module¶
-
class
mmlspark.io.http.StringOutputParser.
StringOutputParser
(inputCol=None, outputCol=None)[source]¶ Bases:
mmlspark.core.schema.Utils.ComplexParamsMixin
,pyspark.ml.util.JavaMLReadable
,pyspark.ml.util.JavaMLWritable
,pyspark.ml.wrapper.JavaTransformer
- Parameters
Module contents¶
MicrosoftML is a library of Python classes to interface with the Microsoft scala APIs to utilize Apache Spark to create distibuted machine learning models.
MicrosoftML simplifies training and scoring classifiers and regressors, as well as facilitating the creation of models using the CNTK library, images, and text.