package datarobot
- Alphabetic
- Public
- All
Type Members
-
case class
AdvancedOptions(blueprintThreshold: Option[Int] = None, responseCap: Option[Double] = None, seed: Option[Int] = None, weights: Option[String] = None, rateTopPctThreshold: Option[Float] = None, offset: Option[String] = None, exposure: Option[String] = None, eventsCount: Option[String] = None, smartDownsampled: Option[Boolean] = None, majorityDownsamplingRate: Option[Double] = None, downsampledMinorityRows: Option[Int] = None, downsampledMajorityRows: Option[Int] = None, accuracyOptimizedMb: Option[Boolean] = None, scaleoutModelingMode: Option[Boolean] = None, defaultMonotonicIncreasingFeaturelistId: Option[String] = None, defaultMonotonicDecreasingFeaturelistId: Option[String] = None, onlyIncludeMonotonicBlueprints: Option[Boolean] = None, blendBestModels: Option[Boolean] = None, minSecondaryValidationModelCount: Option[Int] = None, scoringCodeOnly: Option[Boolean] = None, prepareModelForDeployment: Option[Boolean] = None, allowedPairwiseInteractionGroups: Option[Seq[Seq[String]]] = None) extends Product with Serializable
- blueprintThreshold
an upper bound on running time (in hours), such that models exceeding the bound will be excluded in subsequent autopilot runs
- responseCap
defaults to False, if specified used to cap the maximum response of a model
- seed
defaults to null, the random seed to be used if specified
- weights
the name of the weight column, if specified, otherwise null.
- rateTopPctThreshold
- Optional, the percentage threshold between 0.1 and 50 for specifying the Rate@Top% metric.
- offset
the list of names of the offset columns, if specified, other- wise null.
- exposure
the name of the exposure column, if specified, other- wise null.
- eventsCount
the name of the event count column, if specified, otherwise null.
- smartDownsampled
whether the project uses smart downsampling to throw away excess rows of the majority class. Smart downsampled projects express all sample percents in terms of percent of minority rows (as opposed to percent of all rows).
- majorityDownsamplingRate
the percentage be- tween 0 and 100 of the majority rows that are kept, or null for projects without smart down- sampling
- downsampledMinorityRows
the total number of the minority rows available for modeling, or null for projects without smart downsampling
- downsampledMajorityRows
the total number of the majority rows available for modeling, or null for projects without smart downsampling
- accuracyOptimizedMb
Include additional, longer-running models that will be run by the autopilot and available to run manually
- scaleoutModelingMode
Specifies the behavior of Scaleout models for the project. This is one of disabled, repositoryOnly, autopilot
- defaultMonotonicIncreasingFeaturelistId
null or str, the ID of the featurelist specifying a set of features with a monotonically increasing relationship to the target. All blueprints generated in the project use this as their default monotonic constraint, but it can be overriden at model submission time.
- defaultMonotonicDecreasingFeaturelistId
null or str, the ID of the featurelist specifying a set of features with a monotonically decreasing relationship to the target. All blueprints generated in the project use this as their default monotonic constraint, but it can be overriden at model submission time.
- onlyIncludeMonotonicBlueprints
boolean (default to False), whether the project only includes blueprints support enforcing monotonic constraints
- blendBestModels
optional, defaults to True. Blend best models during Autopilot run.
- minSecondaryValidationModelCount
optional, defaults to 0. Compute “All backtest” scores (datetime models) or cross validation scores for the specified number of highest ranking models on the Leaderboard, if over the Autopilot default.
- scoringCodeOnly
optional, defaults to False. Keep only models that can be converted to scorable java code during Autopilot run.
- prepareModelForDeployment
optional, defaults to True. Prepare model for deployment during Autopilot run. The preparation includes creating reduced feature list models, retraining best model on higher sample size, computing insights and assigning “RECOMMENDED FOR DEPLOYMENT” label.
- allowedPairwiseInteractionGroups
(array) –) op- tional. For GAM models - specify groups of columns for which pairwise interactions will be allowed. E.g. if set to “B”, “C”], [“C”, “D” then GAM models will allow interactions between columns AxB, BxC, AxC, CxD. All others (AxD, BxD) will not be considered. If not specified - all possible interactions will be considered by model.
- returns
AdvancedOptions object
-
case class
AdvancedTuningSession(model: Model, description: Option[String] = None)(implicit client: DataRobotClient) extends Product with Serializable
- model
model to be tuned
- description
description of the tuning session
- case class Backtest(status: String, index: Int, score: Double, trainingDuration: String, trainingStartDate: String, trainingRowCount: String, trainingEndDate: String) extends Product with Serializable
- case class BinaryPrediction(rowId: Int, prediction: String, positiveProbability: Double, predictionValues: Seq[ClassificationPredictionValue]) extends Prediction with Product with Serializable
-
case class
Blueprint(projectId: Option[String], processes: Option[Array[String]], id: Option[String], modelType: Option[String], blueprintCategory: Option[String], monotonicIncreasingFeaturelistId: Option[String], monotonicDecreasingFeaturelistId: Option[String], supportsMonotonicConstraints: Option[Boolean]) extends Product with Serializable
Blueprint
Blueprint
- projectId
the project the blueprint belongs to
- processes
a list of strings representing processes the blueprint uses
- id
the blueprint ID of this blueprint - note that this is not an ObjectId
- modelType
the model this blueprint will produce
- blueprintCategory
(New in version v2.6) describes the category of the blueprint and indicates the kind of model this blueprint produces. Will be either “DataRobot” or “Scaleout DataRobot”.
- monotonicIncreasingFeaturelistId
the ID of the featurelist that defines the set of features with a monotonically increasing relationship to the target. If null, no such constraints are enforced.
- monotonicDecreasingFeaturelistId
the ID of the featurelist that defines the set of features with a monotonically decreasing relationship to the target. If null, no such constraints are enforced.
- supportsMonotonicConstraints
whether this model supports enforcing montonic constraints
- case class BlueprintEdge(from: Option[String], to: Option[String]) extends Product with Serializable
- case class BlueprintNode(id: Option[String], label: Option[String]) extends Product with Serializable
-
case class
Calendar(Id: String, created: String, name: String, source: String, numEvents: Int, numEventTypes: Int, calendarStartDate: String, calendarEndDate: String, projectId: Seq[String], role: String, multiseriesIdColumns: Seq[String]) extends Product with Serializable
- Id
– I know i know. Capitalized field, ugh. Did this the return from api request has this capitalized
- created
ISO-8601 string with the time that this calendar was created
- name
the name of this calendar. This will be source if no name was specified
- source
the filename of the uploaded calendar
- numEvents
- number of dates marked as having events in the calendard.
- numEventTypes
the number of distinct eventTypes in this calendar
- calendarStartDate
ISO-8601 date string of the earliest event seen in this cal- endar
- projectId
projectIds of projects currently using this calendar
- role
the role the requesting user has on this calendar
- multiseriesIdColumns
array of multiseries ID column names in calendar file. Currently only one multiseries ID column is supported.
- case class ClassificationPredictionValue(label: String, value: Double) extends Product with Serializable
- case class Coefficient(coefficient: Double, originalFeature: String, stageCoefficients: List[String], transformations: List[Map[String, String]], derivedFeature: String, type: String) extends Product with Serializable
- class DataRobotClient extends AnyRef
- class DateTimeBackTestsSetup extends AnyRef
- case class DateTimeBacktests(index: Option[Int], validationRowCount: Option[Int], primaryTrainingDuration: Option[String], primaryTrainingEndDate: Option[String], availableTrainingStartDate: Option[String], primaryTrainingStartDate: Option[String], validationEndDate: Option[String], availableTrainingDuration: Option[String], availableTrainingRowCount: Option[Int], gapEndDate: Option[String], validationDuration: Option[String], gapStartDate: Option[String], availableTrainingEndDate: Option[String], primaryTrainingRowCount: Option[Int], validationStartDate: Option[String], totalRowCount: Option[Int], gapRowCount: Option[Int], gapDuration: Option[String]) extends Product with Serializable
-
class
DateTimeModel extends Model
- returns
DateTimeModel object
-
case class
DateTimePartition(projectId: Option[String], datetimePartitionColumn: Option[String], dateFormat: Option[String], useTimeSeries: Option[Boolean], unsupervisedMode: Option[Boolean], defaultToAPriori: Option[Boolean], defaultToKnownInAdvance: Option[Boolean], defaultToDoNotDerive: Option[Boolean], featureDerivationWindowStart: Option[Int], featureDerivationWindowEnd: Option[Int], forecastWindowStart: Option[Int], forecastWindowEnd: Option[Int], windowsBasisUnit: Option[String], validationDuration: Option[String], availableTrainingStartDate: Option[String], availableTrainingDuration: Option[String], availableTrainingEndDate: Option[String], primaryTrainingStartDate: Option[String], primaryTrainingDuration: Option[String], primaryTrainingEndDate: Option[String], gapStartDate: Option[String], gapDuration: Option[String], gapEndDate: Option[String], holdoutStartDate: Option[String], holdoutDuration: Option[String], holdoutEndDate: Option[String], numberOfBacktests: Option[Int], autopilotDataSelectionMethod: Option[String], backtests: Option[Seq[DateTimeBacktests]], partitioningWarnings: Option[Seq[PartitionWarning]], featureSettings: Option[Seq[FeatureSetting]], numberOfKnownInAdvanceFeatures: Option[Int], numberOfDoNotDeriveFeatures: Option[Int], useCrossSeriesFeatures: Option[Boolean], aggregationType: Option[String], crossSeriesGroupByColumns: Option[String], calendarId: Option[String]) extends Product with Serializable
Date Time Partitioning use for a Time Series Project.
Date Time Partitioning use for a Time Series Project. This should not be constructed directly. See io.github.timsetsfire.datarobot.DateTimePartitioningMethod when setting up DateTime Parititioning for a project via UI.
- projectId
– The ID of the project
- datetimePartitionColumn
(string)–Thedatecolumnthatwillbeusedasadate-time partition column
- dateFormat
(string) – The date format of the partition column
- useTimeSeries
(boolean)–(Newinversionv2.8)Abooleanvalueindicatingwhether a time series project should be created instead of a regular project which uses datetime partitioning.
- unsupervisedMode
(boolean) – (New in version v2.20) A boolean value indicating whether an unsupervised project should be created
- defaultToAPriori
(boolean)–(Deprecatedinversionv2.11)RenamedtodefaultTo- KnownInAdvance. This parameter always has the same value as defaultToKnownInAdvance and will be removed in a future release.
- defaultToKnownInAdvance
(boolean) – (New in version v2.11) Indicates whether all features in a time series project default to being treated as known in advance, unless overridden by featureSettings. Features marked as known in advance must be specified into the future when making predictions. See the Time Series Overview for more context.
- defaultToDoNotDerive
(boolean) – (New in version v2.17) Indicates whether all features in a time series project default to being treated as do-not-derive features, which excludes them from feature derivation.
- featureDerivationWindowStart
– (New in version v2.8) Will only be specified for projects using time series. How many timeUnits of the datetimePartitionColumn into the past relative to the forecast point the feature derivation window should begin. Will be a negative integer, if present.
- featureDerivationWindowEnd
– (New in version 2.8) Will only be specified for projects using time series. How many timeUnits of the datetimePartitionColumn into the past relative to the forecast point the feature derivation window should end. Will be a non- positive integer, if present.
- forecastWindowStart
– (New in version v2.8) Will only be specified for projects using time series. How many timeUnits of the datetimePartitionColumn into the future relative to the forecast point the forecast window should start. Will be a non-negative integer, if present.
- forecastWindowEnd
– (New in version v2.8) Will only be specified for projects using time series. How many timeUnits of the datetimePartitionColumn into the future relative to the forecast point the forecast window should end. Will be a non-negative integer, if present.
- windowsBasisUnit
– (New in version v2.14) Will only be specified for projects using time series. Indicates which unit is basis for feature derivation window and forecast window. Will be either detected time unit or “ROW”.
- validationDuration
– The default validation duration for all backtests. Will not be specified if the primary date/time feature in a time series project is irregular.
- availableTrainingStartDate
–Thestartdateofavailabletrainingdataforscoring the holdout
- availableTrainingDuration
– The duration of available training duration for scor- ing the holdout
- availableTrainingEndDate
–Theenddateofavailabletrainingdataforscoringthe holdout
- primaryTrainingStartDate
– The start date of the primary training data for scoring the holdout
- primaryTrainingDuration
– The duration of the primary training data for scoring the holdout
- primaryTrainingEndDate
– The end date of the primary training data for scoring the holdout
- gapStartDate
– The start date of the gap between the training and holdout scoring data
- gapDuration
– The duration of the gap between the training and holdout scoring data
- gapEndDate
– The end date of gap between the training and holdout scoring data
- holdoutStartDate
– The start date of the holdout scoring data
- holdoutDuration
– The duration of the holdout scoring data
- holdoutEndDate
– The end date of the holdout scoring data
- numberOfBacktests
– The number of backtests used
- autopilotDataSelectionMethod
– Whether models created via the autopilot will use “rowCount” or “duration” as their dataSelectionMethod.
- backtests
– An array of the configured backtests
- partitioningWarnings
– An array of available warnings about potential problems with the chosen partitioning that could cause issues during modeling, although the partition- ing may be successfully submitted
- featureSettings
– An array of per feature settings
- numberOfKnownInAdvanceFeatures
– (New in version v2.14) Number of features that are marked as known in advance.
- numberOfDoNotDeriveFeatures
– (New in version v2.17) Number of features that are marked as “do not derive”.
- useCrossSeriesFeatures
(boolean) – (New in version v2.14) Indicating whether to use cross-series features.
- aggregationType
(string) – (New in version v2.14) The aggregation type to apply when creating cross-series features. Optional, must be one of “total” or “average”.
- crossSeriesGroupByColumns
(array) – (New in version v2.15) List of columns (currently of length 1). Optional setting that indicates how to further split series into related groups. For example, if every series is sales of an individual product, the series group-by could be the product category with values like “men’s clothing”, “sports equipment”, etc..
- calendarId
(string) – (new in version v2.15) Optional, the id of a calendar to use with this project.
-
case class
DateTimePartitioningMethod(validationType: Option[enums.ValidationType.Value] = None, datetimePartitionColumn: Option[String] = None, useTimeSeries: Option[Boolean] = None, unsupervisedMode: Option[Boolean] = None, multiseriesIdColumns: Option[Array[String]] = None, defaultToAPriori: Option[Boolean] = None, defaultToKnownInAdvance: Option[Boolean] = None, defaultToDoNotDerive: Option[Boolean] = None, featureDerivationWindowStart: Option[Int] = None, featureDerivationWindowEnd: Option[Int] = None, forecastWindowStart: Option[Int] = None, forecastWindowEnd: Option[Int] = None, windowsBasisUnit: Option[String] = None, validationDuration: Option[String] = None, disableHoldout: Option[Boolean] = None, holdoutStartDate: Option[String] = None, holdoutEndDate: Option[String] = None, holdoutDuration: Option[String] = None, gapDuration: Option[String] = None, numberOfBacktests: Option[Int] = None, autopilotDataSelectionMethod: Option[String] = None, treatAsExponential: Option[String] = None, differencingMethod: Option[String] = None, backtests: Option[Seq[DateTimeBackTestsSetup]] = None, featureSettings: Option[Seq[FeatureSetting]] = None, periodicities: Option[Seq[Periodicity]] = None, useCrossSeriesFeatures: Option[Boolean] = None, aggregationType: Option[String] = None, crossSeriesGroupByColumns: Option[Array[String]] = None, calendarId: Option[String] = None) extends PartitioningMethod with Product with Serializable
- datetimePartitionColumn
–Thedatecolumnthatwillbeusedasadate- time partition column
- useTimeSeries
(New in version v2.8) Optional, defaults to false. A boolean value indicating whether a time series project should be created instead of a regular project which uses datetime partitioning.
- unsupervisedMode
(New in version v2.20) Optional, defaults to false. A boolean value indicating whether an unsupervised project should be created.
- multiseriesIdColumns
(New in version v2.11) Optional, may only be specified for projects using time series. An array of column names identifying the multi- series id column(s) to use to identify series within the data. Currently only one multiseries id column may be specified. See the multiseries section of the docs for more context.
- defaultToAPriori
(Deprecated in version v2.11) Optional, renamed to defaultToKnownInAdvance, see below for more detail.
- defaultToKnownInAdvance
(New in version v2.11) Optional, for time series projects only. Sets whether all features default to being treated as known in advance features, which are features that are known into the future. Features marked as known in advance must be specified into the future when making predictions. The default is false, all features are not known in advance. Individual features can be set to a value different than the default using the featureSettings parameter. See the Time Series Overview for more context.
- defaultToDoNotDerive
(New in version v2.17) Optional, for time se- ries projects only. Sets whether all features default to being treated as do-not-derive features, excluding them from feature derivation. Individual features can be set to a value different than the default by using the featureSettings parameter.
- featureDerivationWindowStart
(New in version v2.8) Optional, may only be specified for projects using time series. How many timeUnits of the datetimeParti- tionColumn into the past relative to the forecast point the feature derivation window should begin. Must be a negative integer, if specified.
- featureDerivationWindowEnd
(int)–(Newinversion2.8)Optional,mayonlybe specified for projects using time series. How many timeUnits of the datetimePartitionCol- umn into the past relative to the forecast point the feature derivation window should end. Must be a non-positive integer, if specified.
- forecastWindowStart
(Newinversionv2.8)Optional,mayonlybespecified for projects using time series. How many timeUnits of the datetimePartitionColumn into the future relative to the forecast point the forecast window should start. Must be a non-negative integer, if specified.
- forecastWindowEnd
(New in version v2.8) Optional, may only be specified for projects using time series. How many timeUnits of the datetimePartitionColumn into the future relative to the forecast point the forecast window should end. Must be a non- negative integer, if specified.
- windowsBasisUnit
(New in version v2.14) Optional, may only be speci- fied for projects using time series. Indicates which unit is basis for feature derivation window and forecast window. Valid options are detected time unit or “ROW”. If omitted, the default value is detected time unit.
- validationDuration
Optional. A duration string representing the de- fault validation duration for all backtests. If the primary date/time feature in a time series project is irregular, you cannot set a default validation length. Instead, set each duration individually.
- disableHoldout
(New in version v2.8) Optional. A boolean value indi- cating whether date partitioning should skip allocating a holdout fold. If omitted, the default value is false. When specifying disableHoldout: true, holdoutStartDate and holdoutDura- tion must not be set.
- holdoutStartDate
Optional. A datetime string representing the start date of the holdout fold. When specifying holdoutStartDate, one of holdoutEndDate or holdout- Duration must also be specified. This attribute cannot be specified when disableHoldout is true.
- holdoutEndDate
Optional. A datetime string representing the end date of the holdout fold. When specifying holdoutEndDate, holdoutStartDate must also be speci- fied. This attribute cannot be specified when disableHoldout is true.
- holdoutDuration
Optional. A duration string representing the duration of the holdout fold. When specifying holdoutDuration, holdoutStartDate must also be spec- ified. This attribute cannot be specified when disableHoldout is true.
- gapDuration
Optional, a duration string representing the duration of the gap between the training and the holdout data for the holdout model. For time series projects, defaults to the duration of the gap between the end of the feature derivation win- dow and the beginning of the forecast window. For OTV projects, defaults to a zero duration (P0Y0M0D).
- numberOfBacktests
Optional, the number of backtests to use. If omitted, defaults to a positive value selected by the server based on the validation and gap durations.
- autopilotDataSelectionMethod
–Optional,either“duration”or“row- Count”. Defaults to “duration”. Whether models created via the autopilot will use “row- Count” or “duration” as their dataSelectionMethod.
- treatAsExponential
(New in version v2.9) Optional, defaults to “auto”. Used to specify whether to treat data as exponential trend and apply transformations like log-transform. Valid options are “always”, “never”, “auto”.
- differencingMethod
(New in version v2.9) Optional, defaults to “auto” for timeseries projects. Used to specify which differencing method to apply if the data is stationary. Valid options are “auto”, “simple”, “none”, “seasonal”. Parameter “periodicities” must be specified if “seasonal” is chosen.
- backtests
Optional. An array specifying individual backtests. The index of the backtests specified should range from 0 to numberOfBacktests - 1.
- featureSettings
(New in version v2.9) Optional, an array specifying per feature settings. Features can be left unspecified.
- periodicities
(Newinversionv2.9)Optional,alistofperiodicities.Ifthis is provided, parameter “differencing_method” will default to “seasonal” if not provided or “auto”.
- useCrossSeriesFeatures
(New in version v2.14) Indicating whether to use cross-series features.
- aggregationType
(New in version v2.14) The aggregation type to apply when creating cross-series features. Optional, must be one of “total” or “average”.
- crossSeriesGroupByColumns
(New in version v2.15) List of columns (currently of length 1). Optional setting that indicates how to further split series into related groups. For example, if every series is sales of an individual product, the series group-by could be the product category with values like “men’s clothing”, “sports equipment”, etc.. Must be used with multiseries and useCrossSeriesFeatures enabled.
- calendarId
– (New in version v2.15) Optional, the ID of the calendar to use with this project.
- class DateTimeProject extends Project
-
case class
Feature(id: Option[String], name: Option[String], projectId: Option[String], featureType: Option[String], importance: Option[Double], lowInformation: Option[Boolean], uniqueCount: Option[Int], naCount: Option[Int], dateFormat: Option[String], timeSeriesEligible: Option[Boolean], timeSeriesEligibilityReason: Option[String], timeUnit: Option[String], timeStep: Option[Int], min: Option[Double], max: Option[Double], mean: Option[Double], median: Option[Double], stdDev: Option[Double], targetLeakage: Option[String]) extends Product with Serializable
Feature
Feature
- id
(int) – the feature ID. (Note: Throughout the API, features are specified using their names, not this ID.)
- name
(string) – feature name
- projectId
(string) – the ID of the project the feature belongs to
- featureType
(string) – feature type: ‘Numeric’, ‘Categorical’, etc.
- importance
(float) – numeric measure of the strength of relationship between the feature and target (independent of any model or other features)
- lowInformation
(bool) – whether feature has too few values to be informative
- uniqueCount
(int) – number of unique values
- naCount
(int) – number of missing values
- dateFormat
(string)–(Newinversionv2.5)thedateformatstringforhowthisfeature was interpreted (or null if not a date feature). If not null, it will be compatible with https: //docs.python.org/2/library/time.html#time.strftime .
- timeSeriesEligible
(bool) – (New in version v2.8) whether this feature can be used as a datetime partitioning feature for time series projects. Only sufficiently regular date features can be selected as the datetime feature for time series projects. Always false for non-date features. Date features that cannot be used in datetime partitioning for a time series project may be eligible for an OTV project, which has less stringent requirements.
- timeSeriesEligibilityReason
(string)–(Newinversionv2.8)whythefeature is ineligible for time series projects, or “suitable” if it is eligible.
- timeUnit
(string) – (New in version v2.8) the unit for the interval between values of this feature, e.g. DAY, MONTH, HOUR. When specifying windows for time series projects, the windows are expressed in terms of this unit. Only present for date features eligible for time series projects, and null otherwise.
- timeStep
(int) – (New in version v2.8) The minimum time step that can be used to specify time series windows. The units for this value are the timeUnit. When specifying windows for time series projects, all windows must have durations that are integer multiples of this number. Only present for date features that are eligible for time series projects and null otherwise.
- min
– minimum value of the EDA sample of the feature.
- max
– maximum value of the EDA sample of the feature.
- mean
– arithmetic mean of the EDA sample of the feature.
- median
– median of the EDA sample of the feature.
- stdDev
– standard deviation of EDA sample of the feature.
- targetLeakage
(int) – whether or not the feature has target leakage. ‘SKIPPED_DETECTION’ indicates leakage detection was not run on the feature, ‘FALSE’ indicates no leakage, ‘MODERATE_RISK’ indicates a moderate risk of target leakage, and ‘HIGH_RISK’ indicates a high risk of target leakage
- case class FeatureAssocationDetails(feature1: String, feature2: String) extends Product with Serializable
- case class FeatureAssociation() extends Product with Serializable
- case class FeatureAssociationDetail(feature: Option[String], importanceSortIndex: Option[Int], clusterSortIndex: Option[Int], clusterName: Option[String], alphabeticSortIndex: Option[Int], clusterId: Option[Int], strengthSortIndex: Option[Int]) extends Product with Serializable
- case class FeatureAssociationMatrix(feature1: String, feature2: String) extends Product with Serializable
- case class FeatureEffect(featureType: Option[String], weightLabel: Option[String], featureName: Option[String], predictedVsActual: PredictedVsActualData, featureImpactScore: Double, partialDependence: PartialDependenceData) extends Product with Serializable
- case class FeatureEffects(projectId: String, modelId: String, featureEffects: List[FeatureEffect], source: String, backtestIndex: Option[String]) extends Product with Serializable
- case class FeatureFit(featureType: Option[String], weightLabel: Option[String], featureName: Option[String], predictedVsActual: PredictedVsActualData, featureImportanceScore: Double, partialDependence: PartialDependenceData) extends Product with Serializable
- case class FeatureFits(projectId: String, modelId: String, featureFit: List[FeatureFit], source: String, backtestIndex: Option[String]) extends Product with Serializable
- case class FeatureHistogram() extends Product with Serializable
- case class FeatureImpact(featureName: String, impactNormalized: Double, impactUnnormalized: Double, redundantWith: Option[String] = None) extends Product with Serializable
- case class FeatureImpacts(featureImpacts: List[FeatureImpact], shapBased: Boolean, count: Int, ranRedundancyDetection: Boolean, next: Option[String], previous: Option[String]) extends Product with Serializable
-
case class
FeatureSetting(featureName: String, aprior: Option[Boolean] = None, knownInAdvance: Option[Boolean] = None, doNotDerive: Option[Boolean] = None) extends Product with Serializable
FeatureSetting
FeatureSetting
- featureName
(string) – The name of the feature being specified.
- knownInAdvance
(boolean)–(Newinversionv2.11)Optional,fortimeseriesprojects only. Sets whether the feature is known in advance, i.e., values for future dates are known at prediction time. If not specified, the feature uses the value from the defaultToKnownInAd- vance flag.
- doNotDerive
(boolean) – (New in version v2.17) Optional, for time series projects only. Sets whether the feature is do-not-derive, i.e., is excluded from feature derivation. If not specified, the feature uses the value from the defaultToDoNotDerive flag.
-
case class
Featurelist(id: Option[String], projectId: Option[String], name: Option[String], features: Option[Seq[String]], numModels: Option[Int], created: Option[String], isUserCreated: Option[String], description: Option[String]) extends Product with Serializable
featurelist
featurelist
- id
(string) – the ID of the featurelist
- projectId
(string) – the ID of the project the featurelist belongs to
- name
(string) – the name of the featurelist
- features
(array) – a json array of names of features included in the featurelist
- numModels
(int) – (New in version v2.13) the number of models that currently use this featurelist. A model is considered to use a featurelist if it is used to train the model or as a monotonic constraint featurelist, or if the model is a blender with at least one component model using the featurelist.
- created
(string) – (New in version v2.13) a timestamp string specifying when the featurelist was created
- isUserCreated
(boolean) – (New in version v2.13) whether the featurelist was cre- ated manually by a user or by DataRobot automation
- description
(string) – (New in version v2.13) a user-friendly description of the fea- turelist, which can be updated by users
- class FrozenModel extends Model
- case class GroupCV(partitionKeyCols: String, reps: Int, holdoutPct: Double, seed: Int = 0) extends PartitioningMethod with Product with Serializable
- case class GroupTVH(partitionKeyCols: String, validationPct: Double, holdoutPct: Double, seed: Int = 0) extends PartitioningMethod with Product with Serializable
-
class
Job extends AnyRef
Jobs
- case class LiftChart(source: String, bins: List[Map[String, Double]]) extends Product with Serializable
- case class Metric(backtesting: Option[Double], holdout: Option[Double], backtestingScores: Seq[Option[Double]], crossValidation: Option[Double], validation: Option[Double]) extends Product with Serializable
-
class
Model extends AnyRef
Model
- case class ModelCoefficients(modelId: String, coefficients: List[Coefficient], intercept: Option[Double] = None, link: Option[String] = None) extends Product with Serializable
-
class
ModelJob extends Job
ModelJob
- case class ModelRecommendation(projectId: String, recommendationType: enums.ModelRecommendationType.Value, modelId: String) extends Product with Serializable
- case class ModelingFeature(uniqueCount: Int, lowInformation: Boolean, name: String, featureType: String, importance: Double, max: Option[String], dateFormat: Option[String], median: Option[String], targetLeakage: String, min: Option[String], stdDev: Option[Double], projectId: String, naCount: Option[Int], parentFeatureNames: Option[Seq[String]], featureLineageId: Option[String], mean: Option[String]) extends Product with Serializable
- case class ModelingFeaturelist(id: Option[String], projectId: Option[String], name: Option[String], features: Option[Seq[String]], numModels: Option[Int], created: Option[String], isUserCreated: Option[String], description: Option[String]) extends Product with Serializable
- case class MulticlassPrediction(rowId: Int, prediction: String, positiveProbability: Option[Double] = None, predictionValues: Seq[ClassificationPredictionValue]) extends Prediction with Product with Serializable
- case class MultiseriesMetrics(multiseriesId: String, validationScore: Double, backtestingScore: Option[Double], rowCount: Int, multiseriesValues: Seq[String], holdoutScore: Option[Double], duration: String) extends Product with Serializable
- case class NGram(count: Int, coefficient: Double, ngram: String, variable: String, class: String, frequency: Double, isStopword: Boolean) extends Product with Serializable
- case class PartialDependence(dependence: Double, label: String) extends Product with Serializable
- case class PartialDependenceData(isCapped: Boolean, data: List[PartialDependence]) extends Product with Serializable
-
class
Partition extends AnyRef
Partitioning for a given project that will be returned as part of a response from API
- case class PartitionWarning(partition: String, warnings: Seq[String], backtestIndex: Int) extends Product with Serializable
- trait PartitioningMethod extends AnyRef
- case class Periodicity(timeSteps: Option[Int] = None, timeUnit: Option[String] = None) extends Product with Serializable
-
class
PredictJob extends Job
PredictJob
- case class PredictedVsActual(bin: List[String], rowCount: Option[Double], actual: Option[Double], predicted: Option[Double], label: Option[String]) extends Product with Serializable
- case class PredictedVsActualData(isCapped: Boolean, data: List[PredictedVsActual]) extends Product with Serializable
- trait Prediction extends AnyRef
- case class PredictionDataset(name: String, id: String, projectId: String, numColumns: Int, numRows: Int, created: String, forecastPoint: Option[String], forecastPointCutoff: Option[String], predictionsStartDate: Option[String], predictionsEndDate: Option[String], actualValueColumn: Option[String], hasTimeSeriesExpandedDataset: Boolean, containsTargetValues: Option[Boolean], detectedActualValueColumns: Option[Array[String]], dataQualityWarnings: Map[String, Boolean], dataStartDate: Option[String], dataEndDate: Option[String], forecastPointRange: Seq[String], maxForecastDate: Option[String]) extends Product with Serializable
- case class Predictions[T <: Prediction](positiveClass: Option[String], predictions: Seq[T], task: String, includePredictionIntervals: Option[Boolean], predictionIntervalsSize: Option[Int]) extends Product with Serializable
- case class PredictionsMetaData(actualValueColumn: Option[String], forecastPoint: Option[String], predictionIntervalsSize: Option[String], url: String, projectId: String, predictionsEndDate: Option[String], includesPredictionIntervals: Option[Boolean], predictionsStartDate: Option[String], maxExplanations: Option[Int], explanationAlgorithm: Option[String], id: String, datasetId: String, modelId: String) extends Product with Serializable
- case class PrimeFileMetaData(language: String, isValid: Boolean, rulesetId: String, parentModelId: String, projectId: String, id: String, modelId: String) extends Product with Serializable
- class PrimeModel extends Model
- case class PrimeRuleset(projectId: String, rulesetId: String, score: Double, parentModelId: String, ruleCount: Int, modelId: Option[String]) extends Product with Serializable
- case class PrimeRulesets(rules: List[PrimeRuleset]) extends Product with Serializable
- class Project extends AnyRef
- case class RandomCV(reps: Int, holdoutPct: Double, seed: Int = 0) extends PartitioningMethod with Product with Serializable
- case class RandomTVH(validationPct: Double, holdoutPct: Double, seed: Int = 0) extends PartitioningMethod with Product with Serializable
-
case class
Recommender(recommenderItemId: Option[String], isRecommender: Option[Boolean], recommenderUserId: Option[String]) extends Product with Serializable
The recommender object will describe additional options specified if the project is used for a recommender problem.
The recommender object will describe additional options specified if the project is used for a recommender problem. It will be of the following form:
- recommenderItemId
if a recommender problem, the name of the column containing item ids, otherwise null
- isRecommender
indicates whether the project is a recommender problem
- recommenderUserId
if a recommender problem, the name of the column containing user ids, otherwise null
- case class RegressionPrediction(rowId: Int, prediction: Double, positiveProbability: Option[Double] = None) extends Prediction with Product with Serializable
- case class Residual(actual: Double, predicted: Double, residual: Double, rownumber: Double) extends Product with Serializable
- case class ResidualData(residualMean: Double, coefficientOfDetermination: Double, data: List[List[Double]], histogram: List[ResidualHistogramInterval]) extends Product with Serializable
- case class ResidualHistogramInterval(intervalEnd: Double, intervalStart: Double, occurrences: Int) extends Product with Serializable
- case class Residuals(source: String, data: ResidualData) extends Product with Serializable
- case class RocCurve(source: String, rocPoints: List[RocPoint], negativeClassPredictions: List[Double], positiveClassPrediction: List[Double]) extends Product with Serializable
- case class RocPoint(liftNegative: Double, f1Score: Double, truePositiveRate: Double, accuracy: Double, fractionPredictedAsNegative: Double, falseNegativeScore: Double, liftPositive: Double, falsePositiveRate: Double, matthewsCorrelationCoefficient: Double, trueNegativeRate: Double, negativePredictiveValue: Double, falsePositiveScore: Double, positivePredictiveValue: Double, threshold: Double, fractionPredictedAsPositive: Double, truePositiveScore: Double, trueNegativeScore: Double) extends Product with Serializable
- class Status extends AnyRef
- case class StratifedCV(reps: Int, holdoutPct: Double, seed: Int = 0) extends PartitioningMethod with Product with Serializable
- case class StratifiedTVH(validationPct: Double, holdoutPct: Double, seed: Int = 0) extends PartitioningMethod with Product with Serializable
- case class TimeSeriesClassification(rowId: Int, prediction: Double, positiveProbability: Option[Double] = None, forecastPoint: String, forecastDistance: Int, timestampe: String, seriesId: String, predictionThreshhold: Double, predictionValues: Seq[ClassificationPredictionValue], predictionIntervalLowerBound: Double, predictionIntervalUpperBound: Double, target: Option[String] = None) extends Prediction with Product with Serializable
- case class TimeSeriesPrediction(rowId: Int, prediction: Double, positiveProbability: Option[Double] = None, forecastPoint: String, forecastDistance: Int, timestampe: String, seriesId: String, predictionIntervalLowerBound: Double, predictionIntervalUpperBound: Double, target: Option[String] = None) extends Prediction with Product with Serializable
- case class TrainingInfo(predictionTrainingDuration: String, holdoutTrainingStartDate: String, predictionTrainingStartDate: String, holdoutTrainingDuration: String, predictionTrainingRowCount: Int, holdoutTrainingEndDate: String, predictionTrainingEndDate: String, holdoutTrainingRowCount: Int) extends Product with Serializable
- case class UserCV(userPartitionCol: String, cvHoldoutPct: Double, seed: Int = 0) extends PartitioningMethod with Product with Serializable
- case class UserTVH(userPartitionCol: String, trainingLevel: String, valdiationLevel: String, holdoutLevel: String, seed: Int = 0) extends PartitioningMethod with Product with Serializable
- case class WordCloud(ngrams: List[NGram]) extends Product with Serializable
Value Members
- object Blueprint extends Serializable
- object Calendar extends Serializable
- object DataRobotClient
- object DateTimeModel
- object DateTimeProject
- object Feature extends Serializable
- object FeatureAssociations
- object FeatureInsights
- object Featurelist extends Serializable
- object Implicits
- object Job
- object Model
- object ModelJob
- object ModelingFeature extends Serializable
- object ModelingFeaturelist extends Serializable
- object PredictionDataset extends Serializable
- object PredictionExplanations
- object Predictions extends Serializable
- object PrimeModel
-
object
Project
Factory for io.github.timsetsfire.datarobot.Project instances.
- object Status
- object Utilities