-
-
-
class TextAndCSVParser extends org.apache.tika.parser.AbstractEncodingDetectorParser implements
Serializable
-
Serialized Fields
-
defaultTextAndCSVConfig
TextAndCSVConfig defaultTextAndCSVConfig
-
markLimit
int markLimit
This is the mark limit in characters (not bytes) to
read from the stream when classifying the stream as
csv, tsv or txt.
-
minConfidence
double minConfidence
minimum confidence score that there's enough
evidence to determine csv/tsv vs. txt
-
-
- serialVersionUID:
- 1L
-
Serialized Fields
-
inPos
int inPos
The position into the input buffer.
-
input
byte[] input
The input buffer.
-
inSize
int inSize
The number of bytes into the input buffer.
-
minSize
int minSize
The minimum size of a character sequence to be extracted.
-
outPos
int outPos
The current position into the output buffer.
-
output
byte[] output
The output buffer.
-
tmpPos
int tmpPos
The temporary position into the output buffer.
-
xhtml
org.apache.tika.sax.XHTMLContentHandler xhtml
The output content handler.
-
- serialVersionUID:
- -1465227101645003594L
-
Serialized Fields
-
encoding
StringsEncoding encoding
-
minLength
int minLength
-
stringsPath
String stringsPath
-
timeoutSeconds
int timeoutSeconds
-
- serialVersionUID:
- 802566634661575025L
-
Serialized Fields
-
defaultStringsConfig
StringsConfig defaultStringsConfig
-
fileCommandDetector
org.apache.tika.detect.FileCommandDetector fileCommandDetector
-
filePath
String filePath
-
hasEncodingOption
boolean hasEncodingOption
-
stringsPath
String stringsPath
-
stringsPresent
boolean stringsPresent
-
-
-
-
class TXTParser extends org.apache.tika.parser.AbstractEncodingDetectorParser implements
Serializable
- serialVersionUID:
- -6656102320836888910L
-