public class GBTRegressionModel extends PredictionModel<Vector,GBTRegressionModel> implements GBTRegressorParams, TreeEnsembleModel<DecisionTreeRegressionModel>, MLWritable, scala.Serializable
Constructor and Description |
---|
GBTRegressionModel(String uid,
DecisionTreeRegressionModel[] _trees,
double[] _treeWeights)
Construct a GBTRegressionModel
|
Modifier and Type | Method and Description |
---|---|
GBTRegressionModel |
copy(ParamMap extra)
Creates a copy of this instance with the same UID and some extra params.
|
double[] |
evaluateEachIteration(Dataset<?> dataset,
String loss)
Method to compute error or loss for every iteration of gradient boosting.
|
Vector |
featureImportances()
Estimate of the importance of each feature.
|
int |
getNumTrees()
Number of trees in ensemble
|
static GBTRegressionModel |
load(String path) |
int |
numFeatures()
Returns the number of features the model was trained on.
|
int |
numTrees()
Deprecated.
Use
getNumTrees instead. This method will be removed in 3.0.0. |
double |
predict(Vector features)
Predict label for the given features.
|
static MLReader<GBTRegressionModel> |
read() |
String |
toString()
Summary of the model
|
DecisionTreeRegressionModel[] |
trees()
Trees in this ensemble.
|
double[] |
treeWeights()
Weights for each tree, zippable with
trees |
String |
uid()
An immutable unique ID for the object and its derivatives.
|
MLWriter |
write()
Returns an
MLWriter instance for this ML instance. |
setFeaturesCol, setPredictionCol, transform, transformSchema
transform, transform, transform
convertToOldLossType, getLossType, getOldLossType, lossType
getOldBoostingStrategy, getValidationTol, setMaxIter, setStepSize, stepSize, validationTol
featureSubsetStrategy, getFeatureSubsetStrategy, getOldStrategy, getSubsamplingRate, setFeatureSubsetStrategy, setSubsamplingRate, subsamplingRate
cacheNodeIds, getCacheNodeIds, getMaxBins, getMaxDepth, getMaxMemoryInMB, getMinInfoGain, getMinInstancesPerNode, getOldStrategy, maxBins, maxDepth, maxMemoryInMB, minInfoGain, minInstancesPerNode, setCacheNodeIds, setCheckpointInterval, setMaxBins, setMaxDepth, setMaxMemoryInMB, setMinInfoGain, setMinInstancesPerNode, setSeed
validateAndTransformSchema
getLabelCol, labelCol
featuresCol, getFeaturesCol
getPredictionCol, predictionCol
clear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn
checkpointInterval, getCheckpointInterval
getMaxIter, maxIter
getStepSize
getValidationIndicatorCol, validationIndicatorCol
getImpurity, getOldImpurity, impurity, setImpurity
javaTreeWeights, toDebugString, totalNumNodes
save
initializeLogging, initializeLogIfNecessary, initializeLogIfNecessary, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning
public GBTRegressionModel(String uid, DecisionTreeRegressionModel[] _trees, double[] _treeWeights)
_trees
- Decision trees in the ensemble._treeWeights
- Weights for the decision trees in the ensemble.uid
- (undocumented)public static MLReader<GBTRegressionModel> read()
public static GBTRegressionModel load(String path)
public String uid()
Identifiable
uid
in interface Identifiable
public int numFeatures()
PredictionModel
numFeatures
in class PredictionModel<Vector,GBTRegressionModel>
public DecisionTreeRegressionModel[] trees()
TreeEnsembleModel
trees
in interface TreeEnsembleModel<DecisionTreeRegressionModel>
public int getNumTrees()
public double[] treeWeights()
TreeEnsembleModel
trees
treeWeights
in interface TreeEnsembleModel<DecisionTreeRegressionModel>
public double predict(Vector features)
PredictionModel
transform()
and output predictionCol
.predict
in class PredictionModel<Vector,GBTRegressionModel>
features
- (undocumented)public int numTrees()
getNumTrees
instead. This method will be removed in 3.0.0.public GBTRegressionModel copy(ParamMap extra)
Params
defaultCopy()
.copy
in interface Params
copy
in class Model<GBTRegressionModel>
extra
- (undocumented)public String toString()
TreeEnsembleModel
toString
in interface TreeEnsembleModel<DecisionTreeRegressionModel>
toString
in interface Identifiable
toString
in class Object
public Vector featureImportances()
Each feature's importance is the average of its importance across all trees in the ensemble The importance vector is normalized to sum to 1. This method is suggested by Hastie et al. (Hastie, Tibshirani, Friedman. "The Elements of Statistical Learning, 2nd Edition." 2001.) and follows the implementation from scikit-learn.
DecisionTreeRegressionModel.featureImportances
public double[] evaluateEachIteration(Dataset<?> dataset, String loss)
dataset
- Dataset for validation.loss
- The loss function used to compute error. Supported options: squared, absolutepublic MLWriter write()
MLWritable
MLWriter
instance for this ML instance.write
in interface MLWritable