DecisionTreeClassificationModel (Spark 2.4.6 JavaDoc)

Object
- org.apache.spark.ml.PipelineStage
- - org.apache.spark.ml.Transformer
  - - org.apache.spark.ml.Model<M>
    - - org.apache.spark.ml.PredictionModel<FeaturesType,M>
      - org.apache.spark.ml.classification.ClassificationModel<FeaturesType,M>
        
        org.apache.spark.ml.classification.ProbabilisticClassificationModel<Vector,DecisionTreeClassificationModel>
        
        org.apache.spark.ml.classification.DecisionTreeClassificationModel

All Implemented Interfaces:

java.io.Serializable, Logging, ClassifierParams, ProbabilisticClassifierParams, Params, HasCheckpointInterval, HasFeaturesCol, HasLabelCol, HasPredictionCol, HasProbabilityCol, HasRawPredictionCol, HasSeed, HasThresholds, PredictorParams, DecisionTreeClassifierParams, DecisionTreeModel, DecisionTreeParams, TreeClassifierParams, Identifiable, MLWritable
```
public class DecisionTreeClassificationModel
extends ProbabilisticClassificationModel<Vector,DecisionTreeClassificationModel>
implements DecisionTreeModel, DecisionTreeClassifierParams, MLWritable, scala.Serializable
```
Decision tree model (http://en.wikipedia.org/wiki/Decision_tree_learning) for classification. It supports both binary and multiclass labels, as well as both continuous and categorical features.

See Also:

Serialized Form

Method Summary

All Methods Static Methods Instance Methods Concrete Methods
Modifier and Type	Method and Description
`DecisionTreeClassificationModel`	`copy(ParamMap extra)` Creates a copy of this instance with the same UID and some extra params.
`Vector`	`featureImportances()` Estimate of the importance of each feature.
`static DecisionTreeClassificationModel`	`load(String path)`
`int`	`numClasses()` Number of classes (values which the label can take).
`int`	`numFeatures()` Returns the number of features the model was trained on.
`double`	`predict(Vector features)` Predict label for the given features.
`static MLReader<DecisionTreeClassificationModel>`	`read()`
`Node`	`rootNode()` Root of the decision tree
`String`	`toString()` Summary of the model
`String`	`uid()` An immutable unique ID for the object and its derivatives.
`MLWriter`	`write()` Returns an `MLWriter` instance for this ML instance.

Methods inherited from class org.apache.spark.ml.classification.ProbabilisticClassificationModel
normalizeToProbabilitiesInPlace, setProbabilityCol, setThresholds, transform

Methods inherited from class org.apache.spark.ml.classification.ClassificationModel
setRawPredictionCol

Methods inherited from class org.apache.spark.ml.PredictionModel
setFeaturesCol, setPredictionCol, transformSchema

Methods inherited from class org.apache.spark.ml.Model
hasParent, parent, setParent

Methods inherited from class org.apache.spark.ml.Transformer
transform, transform, transform

Methods inherited from class Object
equals, getClass, hashCode, notify, notifyAll, wait, wait, wait

Methods inherited from interface org.apache.spark.ml.tree.DecisionTreeModel
depth, maxSplitFeatureIndex, numNodes, toDebugString

Methods inherited from interface org.apache.spark.ml.tree.DecisionTreeParams
cacheNodeIds, getCacheNodeIds, getMaxBins, getMaxDepth, getMaxMemoryInMB, getMinInfoGain, getMinInstancesPerNode, getOldStrategy, maxBins, maxDepth, maxMemoryInMB, minInfoGain, minInstancesPerNode, setCacheNodeIds, setCheckpointInterval, setMaxBins, setMaxDepth, setMaxMemoryInMB, setMinInfoGain, setMinInstancesPerNode, setSeed

Methods inherited from interface org.apache.spark.ml.PredictorParams
validateAndTransformSchema

Methods inherited from interface org.apache.spark.ml.param.shared.HasLabelCol
getLabelCol, labelCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasFeaturesCol
featuresCol, getFeaturesCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasPredictionCol
getPredictionCol, predictionCol

Methods inherited from interface org.apache.spark.ml.param.Params
clear, copyValues, defaultCopy, defaultParamMap, explainParam, explainParams, extractParamMap, extractParamMap, get, getDefault, getOrDefault, getParam, hasDefault, hasParam, isDefined, isSet, paramMap, params, set, set, set, setDefault, setDefault, shouldOwn

Methods inherited from interface org.apache.spark.ml.param.shared.HasCheckpointInterval
checkpointInterval, getCheckpointInterval

Methods inherited from interface org.apache.spark.ml.param.shared.HasSeed
getSeed, seed

Methods inherited from interface org.apache.spark.ml.tree.TreeClassifierParams
getImpurity, getOldImpurity, impurity, setImpurity

Methods inherited from interface org.apache.spark.ml.util.MLWritable
save

Methods inherited from interface org.apache.spark.ml.classification.ProbabilisticClassifierParams
validateAndTransformSchema

Methods inherited from interface org.apache.spark.ml.param.shared.HasRawPredictionCol
getRawPredictionCol, rawPredictionCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasProbabilityCol
getProbabilityCol, probabilityCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasThresholds
getThresholds, thresholds

Methods inherited from interface org.apache.spark.internal.Logging
initializeLogging, initializeLogIfNecessary, initializeLogIfNecessary, isTraceEnabled, log_, log, logDebug, logDebug, logError, logError, logInfo, logInfo, logName, logTrace, logTrace, logWarning, logWarning

- Method Detail
  - read
```
public static MLReader<DecisionTreeClassificationModel> read()
```
  - load
```
public static DecisionTreeClassificationModel load(String path)
```
  - uid
```
public String uid()
```
    Description copied from interface: Identifiable
    
    An immutable unique ID for the object and its derivatives.
    
    Specified by:
    
    uid in interface Identifiable
    
    Returns:
    
    (undocumented)
  - rootNode
```
public Node rootNode()
```
    Description copied from interface: DecisionTreeModel
    
    Root of the decision tree
    
    Specified by:
    
    rootNode in interface DecisionTreeModel
  - numFeatures
```
public int numFeatures()
```
    Description copied from class: PredictionModel
    
    Returns the number of features the model was trained on. If unknown, returns -1
    
    Overrides:
    
    numFeatures in class PredictionModel<Vector,DecisionTreeClassificationModel>
  - numClasses
```
public int numClasses()
```
    Description copied from class: ClassificationModel
    
    Number of classes (values which the label can take).
    
    Specified by:
    
    numClasses in class ClassificationModel<Vector,DecisionTreeClassificationModel>
  - predict
```
public double predict(Vector features)
```
    Description copied from class: ClassificationModel
    
    Predict label for the given features. This method is used to implement transform() and output predictionCol.
    This default implementation for classification predicts the index of the maximum value from predictRaw().
    
    Overrides:
    
    predict in class ClassificationModel<Vector,DecisionTreeClassificationModel>
    
    Parameters:
    
    features - (undocumented)
    
    Returns:
    
    (undocumented)
  - copy
```
public DecisionTreeClassificationModel copy(ParamMap extra)
```
    Description copied from interface: Params
    
    Creates a copy of this instance with the same UID and some extra params. Subclasses should implement this method and set the return type properly. See defaultCopy().
    
    Specified by:
    
    copy in interface Params
    
    Specified by:
    
    copy in class Model<DecisionTreeClassificationModel>
    
    Parameters:
    
    extra - (undocumented)
    
    Returns:
    
    (undocumented)
  - toString
```
public String toString()
```
    Description copied from interface: DecisionTreeModel
    
    Summary of the model
    
    Specified by:
    
    toString in interface DecisionTreeModel
    
    Specified by:
    
    toString in interface Identifiable
    
    Overrides:
    
    toString in class Object
  - featureImportances
```
public Vector featureImportances()
```
    Estimate of the importance of each feature.
    This generalizes the idea of "Gini" importance to other losses, following the explanation of Gini importance from "Random Forests" documentation by Leo Breiman and Adele Cutler, and following the implementation from scikit-learn.
    This feature importance is calculated as follows: - importance(feature j) = sum (over nodes which split on feature j) of the gain, where gain is scaled by the number of instances passing through node - Normalize importances for tree to sum to 1.
    
    Returns:
    
    (undocumented)
    
    Note:
    
    Feature importance for single decision trees can have high variance due to correlated predictor variables. Consider using a RandomForestClassifier to determine feature importance instead.
  - write
```
public MLWriter write()
```
    Description copied from interface: MLWritable
    
    Returns an MLWriter instance for this ML instance.
    
    Specified by:
    
    write in interface MLWritable
    
    Returns:
    
    (undocumented)

Class DecisionTreeClassificationModel

Method Summary

Methods inherited from class org.apache.spark.ml.classification.ProbabilisticClassificationModel

Methods inherited from class org.apache.spark.ml.classification.ClassificationModel

Methods inherited from class org.apache.spark.ml.PredictionModel

Methods inherited from class org.apache.spark.ml.Model

Methods inherited from class org.apache.spark.ml.Transformer

Methods inherited from class Object

Methods inherited from interface org.apache.spark.ml.tree.DecisionTreeModel

Methods inherited from interface org.apache.spark.ml.tree.DecisionTreeParams

Methods inherited from interface org.apache.spark.ml.PredictorParams

Methods inherited from interface org.apache.spark.ml.param.shared.HasLabelCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasFeaturesCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasPredictionCol

Methods inherited from interface org.apache.spark.ml.param.Params

Methods inherited from interface org.apache.spark.ml.param.shared.HasCheckpointInterval

Methods inherited from interface org.apache.spark.ml.param.shared.HasSeed

Methods inherited from interface org.apache.spark.ml.tree.TreeClassifierParams

Methods inherited from interface org.apache.spark.ml.util.MLWritable

Methods inherited from interface org.apache.spark.ml.classification.ProbabilisticClassifierParams

Methods inherited from interface org.apache.spark.ml.param.shared.HasRawPredictionCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasProbabilityCol

Methods inherited from interface org.apache.spark.ml.param.shared.HasThresholds

Methods inherited from interface org.apache.spark.internal.Logging

Method Detail

read

load

uid

rootNode

numFeatures

numClasses

predict

copy

toString

featureImportances

write