Class UniversalEncodingDetector

  • All Implemented Interfaces:
    Serializable, org.apache.tika.detect.EncodingDetector

    public class UniversalEncodingDetector
    extends Object
    implements org.apache.tika.detect.EncodingDetector
    See Also:
    Serialized Form
    • Constructor Detail

      • UniversalEncodingDetector

        public UniversalEncodingDetector()
    • Method Detail

      • detect

        public Charset detect​(InputStream input,
                              org.apache.tika.metadata.Metadata metadata)
                       throws IOException
        Specified by:
        detect in interface org.apache.tika.detect.EncodingDetector
        Throws:
        IOException
      • getMarkLimit

        public int getMarkLimit()
      • setMarkLimit

        @Field
        public void setMarkLimit​(int markLimit)
        How far into the stream to read for charset detection. Default is 8192.
        Parameters:
        markLimit -