类 IDF
- java.lang.Object
-
- org.apache.flink.ml.feature.idf.IDF
-
- 所有已实现的接口:
Serializable,org.apache.flink.ml.api.Estimator<IDF,IDFModel>,org.apache.flink.ml.api.Stage<IDF>,org.apache.flink.ml.common.param.HasInputCol<IDF>,org.apache.flink.ml.common.param.HasOutputCol<IDF>,IDFModelParams<IDF>,IDFParams<IDF>,org.apache.flink.ml.param.WithParams<IDF>
public class IDF extends Object implements org.apache.flink.ml.api.Estimator<IDF,IDFModel>, IDFParams<IDF>
An Estimator that computes the inverse document frequency (IDF) for the input documents. IDF is computed following `idf = log((m + 1) / (d(t) + 1))`, where `m` is the total number of documents and `d(t)` is the number of documents that contains `t`.Users could filter out terms that appeared in little documents by setting
IDFParams.getMinDocFreq().See https://en.wikipedia.org/wiki/Tf%E2%80%93idf.
- 另请参阅:
- 序列化表格
-
-
字段概要
-
从接口继承的字段 org.apache.flink.ml.feature.idf.IDFParams
MIN_DOC_FREQ
-
-
构造器概要
构造器 构造器 说明 IDF()
-
方法概要
所有方法 静态方法 实例方法 具体方法 修饰符和类型 方法 说明 IDFModelfit(org.apache.flink.table.api.Table... inputs)Map<org.apache.flink.ml.param.Param<?>,Object>getParamMap()static IDFload(org.apache.flink.table.api.bridge.java.StreamTableEnvironment tEnv, String path)voidsave(String path)-
从类继承的方法 java.lang.Object
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
-
从接口继承的方法 org.apache.flink.ml.feature.idf.IDFParams
getMinDocFreq, setMinDocFreq
-
-
-
-
方法详细资料
-
fit
public IDFModel fit(org.apache.flink.table.api.Table... inputs)
-
getParamMap
public Map<org.apache.flink.ml.param.Param<?>,Object> getParamMap()
- 指定者:
getParamMap在接口中org.apache.flink.ml.param.WithParams<IDF>
-
save
public void save(String path) throws IOException
- 指定者:
save在接口中org.apache.flink.ml.api.Stage<IDF>- 抛出:
IOException
-
load
public static IDF load(org.apache.flink.table.api.bridge.java.StreamTableEnvironment tEnv, String path) throws IOException
- 抛出:
IOException
-
-