MLFlowTracker

Instance Constructors

new MLFlowTracker()

Value Members

final def !=(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def ##(): Int

Definition Classes
AnyRef → Any
final def ==(arg0: Any): Boolean

Definition Classes
AnyRef → Any
final def asInstanceOf[T0]: T0

Definition Classes
Any
def clone(): AnyRef

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( ... )
def convertInferenceConfigToDataFrame(config: InferenceMainConfig): DataFrame

Seems a bit counter-intuitive to do this, but this allows for cloud-agnostic storage of the config.
Seems a bit counter-intuitive to do this, but this allows for cloud-agnostic storage of the config. Otherwise, a configuration would need to be created to manage which cloud this is operating on and handle native SDK object writers. Instead of re-inventing the wheel here, a DataFrame can be serialized to any cloud-native storage medium with very little issue.
config
The inference configuration generated for a particular modeling run
returns
A DataFrame consisting of a single row and a single field. Cell 1:1 contains the json string.

Definition Classes
InferenceTools
def convertInferenceConfigToJson(config: InferenceMainConfig): InferenceJsonReturn

Handler method for converting the InferenceMainConfig object to a serializable Json String with correct scala-compatible data structures.
Handler method for converting the InferenceMainConfig object to a serializable Json String with correct scala-compatible data structures.
config
instance of InferenceMainConfig
returns
[InferenceJsonReturn] consisting of compact form (for logging) and prettyprint form (human readable)

Definition Classes
InferenceTools
def convertJsonConfigToClass(jsonConfig: String): InferenceMainConfig

Handler method for converting a read-in json config String to an instance of InferenceMainConfig
Handler method for converting a read-in json config String to an instance of InferenceMainConfig
jsonConfig
the config as a Json-formatted String
returns
config as InstanceOf[InferenceMainConfig]

Definition Classes
InferenceTools
def convertMainConfigToJson(config: MainConfig): MainJsonReturn

Definition Classes
InferenceTools
def createHostedMlFlowClient(): MlflowClient
def createInferencePayload(dataFrame: DataFrame, modelingColumnsPayload: Array[String], allColumnsPayload: Array[String]): InferencePayload

Definition Classes
InferenceTools
def deleteCustomTags(client: MlflowClient, runId: String, tagKeys: Seq[String]): Unit
final def eq(arg0: AnyRef): Boolean

Definition Classes
AnyRef
def equals(arg0: Any): Boolean

Definition Classes
AnyRef → Any
def extractInferenceConfigFromDataFrame(configDataFrame: DataFrame): InferenceMainConfig

Extract the InferenceMainConfig from a stored DataFrame containing the string-encoded json in row 1, column 1
Extract the InferenceMainConfig from a stored DataFrame containing the string-encoded json in row 1, column 1
configDataFrame
A Dataframe that contains the configuration for the Inference run.
returns
an instance of InferenceMainConfig

Definition Classes
InferenceTools
def extractInferenceJsonFromDataFrame(configDataFrame: DataFrame): String

From a supplied DataFrame that contains the configuration in cell 1:1, get the json string
From a supplied DataFrame that contains the configuration in cell 1:1, get the json string
configDataFrame
A Dataframe that contains the configuration for the Inference run.
returns
The string-encoded json payload for InferenceMainConfig

Definition Classes
InferenceTools
def finalize(): Unit

Attributes
protected[java.lang]
Definition Classes
AnyRef
Annotations
@throws( classOf[java.lang.Throwable] )
def generateMlFlowRunId(): String
def getArtifactLogSetting: Boolean
final def getClass(): Class[_]

Definition Classes
AnyRef → Any
def getMLFlowClient: MlflowClient

Get a single MLFlow Client for the instance of the object.
Get a single MLFlow Client for the instance of the object. Reduce garbage collection by not creating a version each time the object is called. As of 0.7.1
def getMlFlowBestSuffix: String
def getMlFlowCustomRunTags: Map[String, String]
def getMlFlowExperimentName: String
def getMlFlowLoggingMode: String
def getMlFlowTrackingURI: String
def getModelSaveDirectory: String
def getOrCreateExperimentId(client: MlflowClient, experimentName: String = _mlFlowExperimentName): String

Method for either getting an existing experiment by name, or creating a new one by name and returning the id
Method for either getting an existing experiment by name, or creating a new one by name and returning the id
returns
the experiment id from either an existing run or the newly created one.
def hashCode(): Int

Definition Classes
AnyRef → Any
final def isInstanceOf[T0]: Boolean

Definition Classes
Any
def logArtifactsOff(): MLFlowTracker.this.type
def logArtifactsOn(): MLFlowTracker.this.type
def logCustomTags(client: MlflowClient, runId: String, tags: Map[String, String]): Unit
def logMlFlowDataAndModels(runData: Array[GenericModelReturn], modelFamily: String, modelType: String, inferenceSaveLocation: String, optimizationStrategy: String): MLFlowReportStructure

Public method for logging a model, parameters, and metrics to MlFlow
Public method for logging a model, parameters, and metrics to MlFlow
runData
Full collection parameters, results, and models for the autoML experiment
modelFamily
Type of Model Family used (e.g. "RandomForest")
modelType
Type of Model used (e.g. "regression")
def logMlFlowForPipeline(mlFlowRunId: String, runData: Array[GenericModelReturn], modelFamily: String, modelType: String, optimizationStrategy: String): MLFlowReportStructure

This method does not save any artifacts or inference configs.
This method does not save any artifacts or inference configs. For the Best Model logging mode, it logs params and metrics to a given mlFlowRunId For the tuning logging mode, it logs params and metrics to separate mlFlowRunIds
final def ne(arg0: AnyRef): Boolean

Definition Classes
AnyRef
final def notify(): Unit

Definition Classes
AnyRef
final def notifyAll(): Unit

Definition Classes
AnyRef
def removeArrayOfColumns(payload: InferencePayload, removalArray: Array[String]): InferencePayload

Definition Classes
InferenceTools
lazy val sc: SparkContext

Definition Classes
SparkSessionWrapper
def setMainConfig(value: MainConfig): MLFlowTracker.this.type
def setMlFlowBestSuffix(value: String): MLFlowTracker.this.type
def setMlFlowCustomRunTags(value: Map[String, String]): MLFlowTracker.this.type
def setMlFlowExperimentName(value: String): MLFlowTracker.this.type
def setMlFlowHostedAPIToken(value: String): MLFlowTracker.this.type
def setMlFlowLoggingMode(value: String): MLFlowTracker.this.type
def setMlFlowTrackingURI(value: String): MLFlowTracker.this.type
def setModelSaveDirectory(value: String): MLFlowTracker.this.type
lazy val spark: SparkSession

Definition Classes
SparkSessionWrapper
final def synchronized[T0](arg0: ⇒ T0): T0

Definition Classes
AnyRef
def toString(): String

Definition Classes
AnyRef → Any
final def wait(): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long, arg1: Int): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )
final def wait(arg0: Long): Unit

Definition Classes
AnyRef
Annotations
@throws( ... )

Related Docs: object MLFlowTracker | package tracking

class MLFlowTracker extends InferenceTools

Instance Constructors

new MLFlowTracker()

Value Members

final def !=(arg0: Any): Boolean

final def ##(): Int

final def ==(arg0: Any): Boolean

final def asInstanceOf[T0]: T0

def clone(): AnyRef

def convertInferenceConfigToDataFrame(config: InferenceMainConfig): DataFrame

def convertInferenceConfigToJson(config: InferenceMainConfig): InferenceJsonReturn

def convertJsonConfigToClass(jsonConfig: String): InferenceMainConfig

def convertMainConfigToJson(config: MainConfig): MainJsonReturn

def createHostedMlFlowClient(): MlflowClient

def createInferencePayload(dataFrame: DataFrame, modelingColumnsPayload: Array[String], allColumnsPayload: Array[String]): InferencePayload

def deleteCustomTags(client: MlflowClient, runId: String, tagKeys: Seq[String]): Unit

final def eq(arg0: AnyRef): Boolean

def equals(arg0: Any): Boolean

def extractInferenceConfigFromDataFrame(configDataFrame: DataFrame): InferenceMainConfig

def extractInferenceJsonFromDataFrame(configDataFrame: DataFrame): String

def finalize(): Unit

def generateMlFlowRunId(): String

def getArtifactLogSetting: Boolean

final def getClass(): Class[_]

def getMLFlowClient: MlflowClient

def getMlFlowBestSuffix: String

def getMlFlowCustomRunTags: Map[String, String]

def getMlFlowExperimentName: String

def getMlFlowLoggingMode: String

def getMlFlowTrackingURI: String

def getModelSaveDirectory: String

def getOrCreateExperimentId(client: MlflowClient, experimentName: String = _mlFlowExperimentName): String

def hashCode(): Int

final def isInstanceOf[T0]: Boolean

def logArtifactsOff(): MLFlowTracker.this.type

def logArtifactsOn(): MLFlowTracker.this.type

def logCustomTags(client: MlflowClient, runId: String, tags: Map[String, String]): Unit

def logMlFlowDataAndModels(runData: Array[GenericModelReturn], modelFamily: String, modelType: String, inferenceSaveLocation: String, optimizationStrategy: String): MLFlowReportStructure

def logMlFlowForPipeline(mlFlowRunId: String, runData: Array[GenericModelReturn], modelFamily: String, modelType: String, optimizationStrategy: String): MLFlowReportStructure

final def ne(arg0: AnyRef): Boolean

final def notify(): Unit

final def notifyAll(): Unit

def removeArrayOfColumns(payload: InferencePayload, removalArray: Array[String]): InferencePayload

lazy val sc: SparkContext

def setMainConfig(value: MainConfig): MLFlowTracker.this.type

def setMlFlowBestSuffix(value: String): MLFlowTracker.this.type

def setMlFlowCustomRunTags(value: Map[String, String]): MLFlowTracker.this.type

def setMlFlowExperimentName(value: String): MLFlowTracker.this.type

def setMlFlowHostedAPIToken(value: String): MLFlowTracker.this.type

def setMlFlowLoggingMode(value: String): MLFlowTracker.this.type

def setMlFlowTrackingURI(value: String): MLFlowTracker.this.type

def setModelSaveDirectory(value: String): MLFlowTracker.this.type

lazy val spark: SparkSession

final def synchronized[T0](arg0: ⇒ T0): T0

def toString(): String

final def wait(): Unit

final def wait(arg0: Long, arg1: Int): Unit

final def wait(arg0: Long): Unit

Inherited from InferenceTools

Inherited from SparkSessionWrapper

Inherited from Serializable

Inherited from Serializable

Inherited from AnyRef

Inherited from Any

Ungrouped