Class UniversalEncodingDetector

java.lang.Object
org.apache.tika.parser.txt.UniversalEncodingDetector
All Implemented Interfaces:
Serializable, org.apache.tika.detect.EncodingDetector

public class UniversalEncodingDetector extends Object implements org.apache.tika.detect.EncodingDetector
See Also:
  • Constructor Details

    • UniversalEncodingDetector

      public UniversalEncodingDetector()
  • Method Details

    • detect

      public Charset detect(InputStream input, org.apache.tika.metadata.Metadata metadata) throws IOException
      Specified by:
      detect in interface org.apache.tika.detect.EncodingDetector
      Throws:
      IOException
    • getMarkLimit

      public int getMarkLimit()
    • setMarkLimit

      @Field public void setMarkLimit(int markLimit)
      How far into the stream to read for charset detection. Default is 8192.
      Parameters:
      markLimit -