com.salesforce.op.stages.impl.feature
unique name of the operation this stage performs
uid for instance
type tag for numeric feature type
type tag for numeric feature value type
numeric evidence for feature type value
Computed splits
Computed splits
should or not split
computed split values
bucket labels
Compute splits using DecisionTreeClassifier
Compute splits using DecisionTreeClassifier
input dataset of (label, feature) tuples
feature name
computed Splits
Criterion used for information gain calculation (case-insensitive).
Criterion used for information gain calculation (case-insensitive). Supported: "entropy" and "gini". (default = gini)
Maximum number of bins Must be >= 2 and <= number of categories in any categorical feature.
Maximum number of bins Must be >= 2 and <= number of categories in any categorical feature. (default = 32)
Maximum depth of the tree (>= 0).
Maximum depth of the tree (>= 0). E.g., depth 0 means 1 leaf node; depth 1 means 1 internal node + 2 leaf nodes. (default = 5)
Minimum information gain for a split to be considered at a tree node.
Minimum information gain for a split to be considered at a tree node. Should be >= 0.0. (default = 0.0)
Minimum number of instances each child must have after split.
Minimum number of instances each child must have after split. If a split causes the left or right child to have fewer than minInstancesPerNode, the split will be discarded as invalid. Should be >= 1. (default = 1)
numeric evidence for feature type value
unique name of the operation this stage performs
unique name of the operation this stage performs
Get the metadata describing the output vector
Get the metadata describing the output vector
This does not trigger onGetMetadata()
Metadata of output vector
Option to keep track of invalid values
Option to keep track of invalid values
Option to keep track of values that were missing
Option to keep track of values that were missing
type tag for numeric feature type
type tag for numeric feature type
type tag for numeric feature value type
type tag for numeric feature value type
uid for instance
uid for instance
Compute the output vector metadata only from the input features.
Compute the output vector metadata only from the input features. Vectorizers use this to derive the full vector, including pivot columns or indicator features.
Vector metadata from input features
Get the name of the output vector
Get the name of the output vector
Output vector name as a string
Smart bucketizer for numeric values based on a Decision Tree classifier.
numeric feature type value
numeric feature type