类 FeatureHasher

  • 所有已实现的接口:
    Serializable, org.apache.flink.ml.api.AlgoOperator<FeatureHasher>, org.apache.flink.ml.api.Stage<FeatureHasher>, org.apache.flink.ml.api.Transformer<FeatureHasher>, org.apache.flink.ml.common.param.HasCategoricalCols<FeatureHasher>, org.apache.flink.ml.common.param.HasInputCols<FeatureHasher>, org.apache.flink.ml.common.param.HasNumFeatures<FeatureHasher>, org.apache.flink.ml.common.param.HasOutputCol<FeatureHasher>, FeatureHasherParams<FeatureHasher>, org.apache.flink.ml.param.WithParams<FeatureHasher>

    public class FeatureHasher
    extends Object
    implements org.apache.flink.ml.api.Transformer<FeatureHasher>, FeatureHasherParams<FeatureHasher>
    A Transformer that transforms a set of categorical or numerical features into a sparse vector of a specified dimension. The rules of hashing categorical columns and numerical columns are as follows:
    • For numerical columns, the index of this feature in the output vector is the hash value of the column name and its correponding value is the same as the input.
    • For categorical columns, the index of this feature in the output vector is the hash value of the string "column_name=value" and the corresponding value is 1.0.

    If multiple features are projected into the same column, the output values are accumulated. For the hashing trick, see https://en.wikipedia.org/wiki/Feature_hashing for details.

    另请参阅:
    序列化表格
    • 字段概要

      • 从接口继承的字段 org.apache.flink.ml.common.param.HasCategoricalCols

        CATEGORICAL_COLS
      • 从接口继承的字段 org.apache.flink.ml.common.param.HasInputCols

        INPUT_COLS
      • 从接口继承的字段 org.apache.flink.ml.common.param.HasNumFeatures

        NUM_FEATURES
      • 从接口继承的字段 org.apache.flink.ml.common.param.HasOutputCol

        OUTPUT_COL
    • 方法概要

      所有方法 静态方法 实例方法 具体方法 
      修饰符和类型 方法 说明
      Map<org.apache.flink.ml.param.Param<?>,​Object> getParamMap()  
      static FeatureHasher load​(org.apache.flink.table.api.bridge.java.StreamTableEnvironment env, String path)  
      void save​(String path)  
      org.apache.flink.table.api.Table[] transform​(org.apache.flink.table.api.Table... inputs)  
      • 从接口继承的方法 org.apache.flink.ml.common.param.HasCategoricalCols

        getCategoricalCols, setCategoricalCols
      • 从接口继承的方法 org.apache.flink.ml.common.param.HasInputCols

        getInputCols, setInputCols
      • 从接口继承的方法 org.apache.flink.ml.common.param.HasNumFeatures

        getNumFeatures, setNumFeatures
      • 从接口继承的方法 org.apache.flink.ml.common.param.HasOutputCol

        getOutputCol, setOutputCol
      • 从接口继承的方法 org.apache.flink.ml.param.WithParams

        get, getParam, set
    • 构造器详细资料

      • FeatureHasher

        public FeatureHasher()
    • 方法详细资料

      • transform

        public org.apache.flink.table.api.Table[] transform​(org.apache.flink.table.api.Table... inputs)
        指定者:
        transform 在接口中 org.apache.flink.ml.api.AlgoOperator<FeatureHasher>
      • getParamMap

        public Map<org.apache.flink.ml.param.Param<?>,​Object> getParamMap()
        指定者:
        getParamMap 在接口中 org.apache.flink.ml.param.WithParams<FeatureHasher>