类 IDF

  • 所有已实现的接口:
    Serializable, org.apache.flink.ml.api.Estimator<IDF,​IDFModel>, org.apache.flink.ml.api.Stage<IDF>, org.apache.flink.ml.common.param.HasInputCol<IDF>, org.apache.flink.ml.common.param.HasOutputCol<IDF>, IDFModelParams<IDF>, IDFParams<IDF>, org.apache.flink.ml.param.WithParams<IDF>

    public class IDF
    extends Object
    implements org.apache.flink.ml.api.Estimator<IDF,​IDFModel>, IDFParams<IDF>
    An Estimator that computes the inverse document frequency (IDF) for the input documents. IDF is computed following `idf = log((m + 1) / (d(t) + 1))`, where `m` is the total number of documents and `d(t)` is the number of documents that contains `t`.

    Users could filter out terms that appeared in little documents by setting IDFParams.getMinDocFreq().

    See https://en.wikipedia.org/wiki/Tf%E2%80%93idf.

    另请参阅:
    序列化表格
    • 字段概要

      • 从接口继承的字段 org.apache.flink.ml.common.param.HasInputCol

        INPUT_COL
      • 从接口继承的字段 org.apache.flink.ml.common.param.HasOutputCol

        OUTPUT_COL
    • 构造器概要

      构造器 
      构造器 说明
      IDF()  
    • 构造器详细资料

      • IDF

        public IDF()
    • 方法详细资料

      • fit

        public IDFModel fit​(org.apache.flink.table.api.Table... inputs)
        指定者:
        fit 在接口中 org.apache.flink.ml.api.Estimator<IDF,​IDFModel>
      • getParamMap

        public Map<org.apache.flink.ml.param.Param<?>,​Object> getParamMap()
        指定者:
        getParamMap 在接口中 org.apache.flink.ml.param.WithParams<IDF>
      • load

        public static IDF load​(org.apache.flink.table.api.bridge.java.StreamTableEnvironment tEnv,
                               String path)
                        throws IOException
        抛出:
        IOException