public class TxtCollectionReader extends AbstractTermSuiteCollectionReader
collectionType, droppedTags, PARAM_COLLECTION_TYPE, PARAM_DROPPED_TAGS, PARAM_ENCODING, PARAM_INPUTDIR, PARAM_LANGUAGE, PARAM_TXT_TAGS, txtTags
Constructor and Description |
---|
TxtCollectionReader() |
Modifier and Type | Method and Description |
---|---|
protected java.lang.String |
getDocumentText(java.lang.String absPath,
java.lang.String encoding)
Gives the document text to set from the input file URI.
|
close, fillCas, getFileFilter, getFiles, getNext, getProgress, hasNext, initialize, lastFileRead
destroy, getCasInitializer, getProcessingResourceMetaData, initialize, isConsuming, reconfigure, setCasInitializer, typeSystemInit
getConfigParameterValue, getConfigParameterValue, setConfigParameterValue, setConfigParameterValue
getCasManager, getLogger, getMetaData, getResourceManager, getUimaContext, getUimaContextAdmin, setLogger, setMetaData
clone, equals, finalize, getClass, hashCode, notify, notifyAll, toString, wait, wait, wait
protected java.lang.String getDocumentText(java.lang.String absPath, java.lang.String encoding) throws java.io.IOException
AbstractTermSuiteCollectionReader
getDocumentText
in class AbstractTermSuiteCollectionReader
java.io.IOException