类 StringIndexer
- java.lang.Object
-
- org.apache.flink.ml.feature.stringindexer.StringIndexer
-
- 所有已实现的接口:
Serializable,org.apache.flink.ml.api.Estimator<StringIndexer,StringIndexerModel>,org.apache.flink.ml.api.Stage<StringIndexer>,org.apache.flink.ml.common.param.HasHandleInvalid<StringIndexer>,org.apache.flink.ml.common.param.HasInputCols<StringIndexer>,org.apache.flink.ml.common.param.HasOutputCols<StringIndexer>,StringIndexerModelParams<StringIndexer>,StringIndexerParams<StringIndexer>,org.apache.flink.ml.param.WithParams<StringIndexer>
public class StringIndexer extends Object implements org.apache.flink.ml.api.Estimator<StringIndexer,StringIndexerModel>, StringIndexerParams<StringIndexer>
An Estimator which implements the string indexing algorithm.A string indexer maps one or more columns (string/numerical value) of the input to one or more indexed output columns (integer value). The output indices of two data points are the same iff their corresponding input columns are the same. The indices are in [0, numDistinctValuesInThisColumn].
The input columns are cast to string if they are numeric values. By default, the output model is arbitrarily ordered. Users can control this by setting
StringIndexerParams.STRING_ORDER_TYPE.User can also control the max number of output indices by setting
StringIndexerParams.MAX_INDEX_NUM. This parameter only works ifStringIndexerParams.STRING_ORDER_TYPEis set as 'frequencyDesc'.The `keep` option of
HasHandleInvalidmeans that we transform the invalid input into a special index, whose value is the number of distinct values in this column.- 另请参阅:
- 序列化表格
-
-
字段概要
-
从接口继承的字段 org.apache.flink.ml.common.param.HasHandleInvalid
ERROR_INVALID, HANDLE_INVALID, KEEP_INVALID, SKIP_INVALID
-
从接口继承的字段 org.apache.flink.ml.feature.stringindexer.StringIndexerParams
ALPHABET_ASC_ORDER, ALPHABET_DESC_ORDER, ARBITRARY_ORDER, FREQUENCY_ASC_ORDER, FREQUENCY_DESC_ORDER, MAX_INDEX_NUM, STRING_ORDER_TYPE
-
-
构造器概要
构造器 构造器 说明 StringIndexer()
-
方法概要
所有方法 静态方法 实例方法 具体方法 修饰符和类型 方法 说明 StringIndexerModelfit(org.apache.flink.table.api.Table... inputs)Map<org.apache.flink.ml.param.Param<?>,Object>getParamMap()static StringIndexerload(org.apache.flink.table.api.bridge.java.StreamTableEnvironment tEnv, String path)voidsave(String path)-
从类继承的方法 java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
从接口继承的方法 org.apache.flink.ml.feature.stringindexer.StringIndexerParams
getMaxIndexNum, getStringOrderType, setMaxIndexNum, setStringOrderType
-
-
-
-
方法详细资料
-
save
public void save(String path) throws IOException
- 指定者:
save在接口中org.apache.flink.ml.api.Stage<StringIndexer>- 抛出:
IOException
-
load
public static StringIndexer load(org.apache.flink.table.api.bridge.java.StreamTableEnvironment tEnv, String path) throws IOException
- 抛出:
IOException
-
getParamMap
public Map<org.apache.flink.ml.param.Param<?>,Object> getParamMap()
- 指定者:
getParamMap在接口中org.apache.flink.ml.param.WithParams<StringIndexer>
-
fit
public StringIndexerModel fit(org.apache.flink.table.api.Table... inputs)
- 指定者:
fit在接口中org.apache.flink.ml.api.Estimator<StringIndexer,StringIndexerModel>
-
-