Class LanguageProfile

java.lang.Object
org.apache.tika.language.LanguageProfile

@Deprecated public class LanguageProfile extends Object
Deprecated.
Language profile based on ngram counts.
Since:
Apache Tika 0.5
  • Field Details

    • DEFAULT_NGRAM_LENGTH

      public static final int DEFAULT_NGRAM_LENGTH
      Deprecated.
      See Also:
    • useInterleaved

      public static boolean useInterleaved
      Deprecated.
  • Constructor Details

    • LanguageProfile

      public LanguageProfile(int length)
      Deprecated.
    • LanguageProfile

      public LanguageProfile()
      Deprecated.
    • LanguageProfile

      public LanguageProfile(String content, int length)
      Deprecated.
    • LanguageProfile

      public LanguageProfile(String content)
      Deprecated.
  • Method Details

    • getCount

      public long getCount()
      Deprecated.
    • getCount

      public long getCount(String ngram)
      Deprecated.
    • add

      public void add(String ngram)
      Deprecated.
      Adds a single occurrence of the given ngram to this profile.
      Parameters:
      ngram - the ngram
    • add

      public void add(String ngram, long count)
      Deprecated.
      Adds multiple occurrences of the given ngram to this profile.
      Parameters:
      ngram - the ngram
      count - number of occurrences to add
    • distance

      public double distance(LanguageProfile that)
      Deprecated.
      Calculates the geometric distance between this and the given other language profile.
      Parameters:
      that - the other language profile
      Returns:
      distance between the profiles
    • toString

      public String toString()
      Deprecated.
      Overrides:
      toString in class Object