- ABOUT - Static variable in interface org.apache.tika.metadata.XMP
-
Unordered text strings of advisories.
- ABS_PEAK_AUDIO_FILE_PATH - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The absolute path to the file's peak audio file.
- AbstractEmitter - Class in org.apache.tika.pipes.emitter
-
- AbstractEmitter() - Constructor for class org.apache.tika.pipes.emitter.AbstractEmitter
-
- AbstractEncodingDetectorParser - Class in org.apache.tika.parser
-
- AbstractEncodingDetectorParser() - Constructor for class org.apache.tika.parser.AbstractEncodingDetectorParser
-
- AbstractEncodingDetectorParser(EncodingDetector) - Constructor for class org.apache.tika.parser.AbstractEncodingDetectorParser
-
- AbstractFetcher - Class in org.apache.tika.pipes.fetcher
-
- AbstractFetcher() - Constructor for class org.apache.tika.pipes.fetcher.AbstractFetcher
-
- AbstractFetcher(String) - Constructor for class org.apache.tika.pipes.fetcher.AbstractFetcher
-
- AbstractMultipleParser - Class in org.apache.tika.parser.multiple
-
Abstract base class for parser wrappers which may / will
process a given stream multiple times, merging the results
of the various parsers used.
- AbstractMultipleParser(MediaTypeRegistry, Collection<? extends Parser>, Map<String, Param>) - Constructor for class org.apache.tika.parser.multiple.AbstractMultipleParser
-
- AbstractMultipleParser(MediaTypeRegistry, AbstractMultipleParser.MetadataPolicy, Parser...) - Constructor for class org.apache.tika.parser.multiple.AbstractMultipleParser
-
- AbstractMultipleParser(MediaTypeRegistry, AbstractMultipleParser.MetadataPolicy, Collection<? extends Parser>) - Constructor for class org.apache.tika.parser.multiple.AbstractMultipleParser
-
- AbstractMultipleParser.MetadataPolicy - Enum in org.apache.tika.parser.multiple
-
The various strategies for handling metadata emitted by
multiple parsers.
- AbstractParser - Class in org.apache.tika.parser
-
Abstract base class for new parsers.
- AbstractParser() - Constructor for class org.apache.tika.parser.AbstractParser
-
- AbstractRecursiveParserWrapperHandler - Class in org.apache.tika.sax
-
- AbstractRecursiveParserWrapperHandler(ContentHandlerFactory) - Constructor for class org.apache.tika.sax.AbstractRecursiveParserWrapperHandler
-
- AbstractRecursiveParserWrapperHandler(ContentHandlerFactory, int) - Constructor for class org.apache.tika.sax.AbstractRecursiveParserWrapperHandler
-
- AccessPermissionException - Exception in org.apache.tika.exception
-
Exception to be thrown when a document does not allow content extraction.
- AccessPermissionException() - Constructor for exception org.apache.tika.exception.AccessPermissionException
-
- AccessPermissionException(Throwable) - Constructor for exception org.apache.tika.exception.AccessPermissionException
-
- AccessPermissionException(String) - Constructor for exception org.apache.tika.exception.AccessPermissionException
-
- AccessPermissionException(String, Throwable) - Constructor for exception org.apache.tika.exception.AccessPermissionException
-
- AccessPermissions - Interface in org.apache.tika.metadata
-
Until we can find a common standard, we'll use these options.
- ACKNOWLEDGEMENT - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- ACRONYM_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- ACTION_TRIGGER - Static variable in interface org.apache.tika.metadata.PDF
-
This specifies where an action or destination would be found/triggered
in the document: on document open, before close, etc.
- actuallyParse() - Method in class org.apache.tika.pipes.PipesServer
-
- add(String, String) - Method in class org.apache.tika.metadata.Metadata
-
Add a metadata name/value mapping.
- add(Property, String) - Method in class org.apache.tika.metadata.Metadata
-
Add a metadata property/value mapping.
- add(Property, int) - Method in class org.apache.tika.metadata.Metadata
-
Adds the integer value of the identified metadata property.
- addAlias(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
- addData(byte[], int, int) - Method in class org.apache.tika.detect.TextStatistics
-
- addingService(ServiceReference) - Method in class org.apache.tika.config.TikaActivator
-
- ADDITIONAL_MODEL_INFO - Static variable in interface org.apache.tika.metadata.IPTC
-
Information about the ethnicity and other facets of the model(s) in a
model-released image.
- addPattern(MimeType, String) - Method in class org.apache.tika.mime.MimeTypes
-
Adds a file name pattern for the given media type.
- addPattern(MimeType, String, boolean) - Method in class org.apache.tika.mime.MimeTypes
-
Adds a file name pattern for the given media type.
- addPrefix(String, String) - Method in class org.apache.tika.sax.xpath.XPathParser
-
- addResource(Closeable) - Method in class org.apache.tika.io.TemporaryResources
-
- addSuperType(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
- addText(char[], int, int) - Method in class org.apache.tika.language.detect.LanguageDetector
-
Add statistics about this text for the current document.
- addText(CharSequence) - Method in class org.apache.tika.language.detect.LanguageDetector
-
Add to the statistics being accumulated for the current
document.
- addType(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
- advance(int) - Method in class org.apache.tika.sax.SecureContentHandler
-
Records the given number of output characters (or more accurately
UTF-16 code units).
- ADVISORY - Static variable in interface org.apache.tika.metadata.XMP
-
Unordered text strings of advisories.
- afterRead(int) - Method in class org.apache.tika.io.TikaInputStream
-
- ALBUM - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The name of the album."
- ALBUM_ARTIST - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The name of the album artist or group for compilation albums."
- ALIAS_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- ALIAS_TYPE_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- allowedPolicies - Static variable in class org.apache.tika.parser.multiple.FallbackParser
-
The different Metadata Policies we support (all)
- allowedPolicies - Static variable in class org.apache.tika.parser.multiple.SupplementingParser
-
The different Metadata Policies we support (not discard)
- ALT_TAPE_NAME - Static variable in interface org.apache.tika.metadata.XMPDM
-
"An alternative tape name, set via the project window or timecode
dialog in Premiere.
- ALTITUDE - Static variable in interface org.apache.tika.metadata.Geographic
-
The WGS84 Altitude of the Point
- ALTITUDE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- AnnotationUtils - Class in org.apache.tika.utils
-
This class contains utilities for dealing with tika annotations
- AnnotationUtils() - Constructor for class org.apache.tika.utils.AnnotationUtils
-
- APP_VERSION - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- APPLICATION - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- application(String) - Static method in class org.apache.tika.mime.MediaType
-
- APPLICATION_XML - Static variable in class org.apache.tika.mime.MediaType
-
- APPLICATION_ZIP - Static variable in class org.apache.tika.mime.MediaType
-
- ARCHITECTURE_BITS - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- ARTIST - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The name of the artist or artists."
- ARTWORK_OR_OBJECT - Static variable in interface org.apache.tika.metadata.IPTC
-
A set of metadata about artwork or an object in the item
- ARTWORK_OR_OBJECT_DETAIL_COPYRIGHT_NOTICE - Static variable in interface org.apache.tika.metadata.IPTC
-
Contains any necessary copyright notice for claiming the intellectual
property for artwork or an object in the image and should identify the
current owner of the copyright of this work with associated intellectual
property rights.
- ARTWORK_OR_OBJECT_DETAIL_CREATOR - Static variable in interface org.apache.tika.metadata.IPTC
-
Contains the name of the artist who has created artwork or an object in the image.
- ARTWORK_OR_OBJECT_DETAIL_DATE_CREATED - Static variable in interface org.apache.tika.metadata.IPTC
-
Designates the date and optionally the time the artwork or object in the
image was created.
- ARTWORK_OR_OBJECT_DETAIL_SOURCE - Static variable in interface org.apache.tika.metadata.IPTC
-
The organisation or body holding and registering the artwork or object in
the image for inventory purposes.
- ARTWORK_OR_OBJECT_DETAIL_SOURCE_INVENTORY_NUMBER - Static variable in interface org.apache.tika.metadata.IPTC
-
The inventory number issued by the organisation or body holding and
registering the artwork or object in the image.
- ARTWORK_OR_OBJECT_DETAIL_TITLE - Static variable in interface org.apache.tika.metadata.IPTC
-
A reference for the artwork or object in the image.
- asInputSource() - Method in class org.apache.tika.detect.AutoDetectReader
-
- ASSEMBLE_DOCUMENT - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Can the user insert/rotate/delete pages.
- assignFieldParams(Object, Map<String, Param>) - Static method in class org.apache.tika.utils.AnnotationUtils
-
Assigns the param values to bean
- assignValue(Object, Object) - Method in class org.apache.tika.config.ParamField
-
Sets given value to the annotated field of bean
- AsyncConfig - Class in org.apache.tika.pipes.async
-
- AsyncConfig() - Constructor for class org.apache.tika.pipes.async.AsyncConfig
-
- AsyncEmitter - Class in org.apache.tika.pipes.async
-
Worker thread that takes EmitData off the queue, batches it
and tries to emit it as a batch
- AsyncEmitter(AsyncConfig, ArrayBlockingQueue<EmitData>, EmitterManager) - Constructor for class org.apache.tika.pipes.async.AsyncEmitter
-
- AsyncProcessor - Class in org.apache.tika.pipes.async
-
This is the main class for handling async requests.
- AsyncProcessor(Path) - Constructor for class org.apache.tika.pipes.async.AsyncProcessor
-
- attachExternalParsers(TikaConfig) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
-
- attachExternalParsers(List<ExternalParser>, TikaConfig) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
-
- AttributeMatcher - Class in org.apache.tika.sax.xpath
-
Final evaluation state of a .../@*
XPath expression.
- AttributeMatcher() - Constructor for class org.apache.tika.sax.xpath.AttributeMatcher
-
- audio(String) - Static method in class org.apache.tika.mime.MediaType
-
- AUDIO_CHANNEL_TYPE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The audio channel type."
- AUDIO_COMPRESSOR - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The audio compression used.
- AUDIO_MOD_DATE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The date and time when the audio was last modified."
- AUDIO_SAMPLE_RATE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The audio sample rate.
- AUDIO_SAMPLE_TYPE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The audio sample type."
- AUTHOR - Static variable in interface org.apache.tika.metadata.Office
-
Name of the principal author(s) of a document
- AUTHORS_POSITION - Static variable in interface org.apache.tika.metadata.Photoshop
-
- AutoDetectParser - Class in org.apache.tika.parser
-
- AutoDetectParser() - Constructor for class org.apache.tika.parser.AutoDetectParser
-
Creates an auto-detecting parser instance using the default Tika
configuration.
- AutoDetectParser(Detector) - Constructor for class org.apache.tika.parser.AutoDetectParser
-
- AutoDetectParser(Parser...) - Constructor for class org.apache.tika.parser.AutoDetectParser
-
Creates an auto-detecting parser instance using the specified set of parser.
- AutoDetectParser(Detector, Parser...) - Constructor for class org.apache.tika.parser.AutoDetectParser
-
- AutoDetectParser(TikaConfig) - Constructor for class org.apache.tika.parser.AutoDetectParser
-
- AutoDetectParserFactory - Class in org.apache.tika.parser
-
Factory for an AutoDetectParser
- AutoDetectParserFactory(Map<String, String>) - Constructor for class org.apache.tika.parser.AutoDetectParserFactory
-
- AutoDetectReader - Class in org.apache.tika.detect
-
An input stream reader that automatically detects the character encoding
to be used for converting bytes to characters.
- AutoDetectReader(InputStream, Metadata, EncodingDetector) - Constructor for class org.apache.tika.detect.AutoDetectReader
-
- AutoDetectReader(InputStream, Metadata, ServiceLoader) - Constructor for class org.apache.tika.detect.AutoDetectReader
-
- AutoDetectReader(InputStream, Metadata) - Constructor for class org.apache.tika.detect.AutoDetectReader
-
- AutoDetectReader(InputStream) - Constructor for class org.apache.tika.detect.AutoDetectReader
-
- available() - Method in class org.apache.tika.io.LookaheadInputStream
-
- call() - Method in class org.apache.tika.pipes.async.AsyncEmitter
-
- call() - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- CALL - Static variable in class org.apache.tika.pipes.PipesServer
-
- CAN_MODIFY - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Can any modifications be made to the document
- CAN_MODIFY_ANNOTATIONS - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Can the user modify annotations
- CAN_PRINT - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Can the user print the document
- CAN_PRINT_DEGRADED - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Can the user print an image-degraded version of the document.
- CAPTION_WRITER - Static variable in interface org.apache.tika.metadata.Photoshop
-
- cast(InputStream) - Static method in class org.apache.tika.io.TikaInputStream
-
Returns the given stream casts to a TikaInputStream, or
null
if the stream is not a TikaInputStream.
- CATEGORY - Static variable in interface org.apache.tika.metadata.IPTC
-
Deprecated.
- CATEGORY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
-
A categorization of the content of this package.
- CATEGORY - Static variable in interface org.apache.tika.metadata.Photoshop
-
- CERTIFICATE - Static variable in interface org.apache.tika.metadata.XMPRights
-
A Web URL for a rights management certificate.
- ChannelTypePropertyConverter() - Constructor for class org.apache.tika.metadata.XMPDM.ChannelTypePropertyConverter
-
Deprecated.
- CHARACTER_COUNT - Static variable in interface org.apache.tika.metadata.Office
-
The number of Characters in the document
- CHARACTER_COUNT_WITH_SPACES - Static variable in interface org.apache.tika.metadata.Office
-
The number of Characters in the document, including spaces
- characters - Variable in class org.apache.tika.mime.MimeTypesReader
-
- characters(char[], int, int) - Method in class org.apache.tika.mime.MimeTypesReader
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.DIFContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.ExpandedTitleContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.LinkContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.PhoneExtractingContentHandler
-
The characters method is called whenever a Parser wants to pass raw...
- characters(char[], int, int) - Method in class org.apache.tika.sax.SafeContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.SecureContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.StandardsExtractingContentHandler
-
The characters method is called whenever a Parser wants to pass raw
characters to the ContentHandler.
- characters(char[], int, int) - Method in class org.apache.tika.sax.TeeContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.TextContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.ToTextContentHandler
-
Writes the given characters to the given character stream.
- characters(char[], int, int) - Method in class org.apache.tika.sax.ToXMLContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.WriteOutContentHandler
-
Writes the given characters to the given character stream.
- characters(char[], int, int) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
- characters(String) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
- characters(char[], int, int) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
-
- CHARACTERS_PER_PAGE - Static variable in interface org.apache.tika.metadata.PDF
-
- CharsetUtils - Class in org.apache.tika.utils
-
- CharsetUtils() - Constructor for class org.apache.tika.utils.CharsetUtils
-
- check(String, int...) - Static method in class org.apache.tika.embedder.ExternalEmbedder
-
Checks to see if the command can be run.
- check(String[], int...) - Static method in class org.apache.tika.embedder.ExternalEmbedder
-
Checks to see if the command can be run.
- check(String, int...) - Static method in class org.apache.tika.parser.external.ExternalParser
-
Checks to see if the command can be run.
- check(String[], int...) - Static method in class org.apache.tika.parser.external.ExternalParser
-
- CHECK_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- checkActive() - Method in class org.apache.tika.pipes.async.AsyncProcessor
-
- checkHasFile() - Static method in class org.apache.tika.detect.FileCommandDetector
-
- checkHasFile(String) - Static method in class org.apache.tika.detect.FileCommandDetector
-
- checkInitialization(InitializableProblemHandler) - Method in interface org.apache.tika.config.Initializable
-
- checkInitialization(InitializableProblemHandler) - Method in class org.apache.tika.pipes.fetcher.fs.FileSystemFetcher
-
- checkInitialization(InitializableProblemHandler) - Method in class org.apache.tika.pipes.pipesiterator.FileSystemPipesIterator
-
- checkInitialization(InitializableProblemHandler) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- ChildMatcher - Class in org.apache.tika.sax.xpath
-
Intermediate evaluation state of a .../*...
XPath expression.
- ChildMatcher(Matcher) - Constructor for class org.apache.tika.sax.xpath.ChildMatcher
-
- CITY - Static variable in interface org.apache.tika.metadata.IPTC
-
Name of the city the content is focussing on -- either the place shown
in visual media or referenced by text or audio media.
- CITY - Static variable in interface org.apache.tika.metadata.Photoshop
-
- clean(String) - Static method in class org.apache.tika.sax.CleanPhoneText
-
- clean(String) - Static method in class org.apache.tika.utils.CharsetUtils
-
Handle various common charset name errors, and return something
that will be considered valid (and is normalized)
- CleanPhoneText - Class in org.apache.tika.sax
-
Class to help de-obfuscate phone numbers in text.
- CleanPhoneText() - Constructor for class org.apache.tika.sax.CleanPhoneText
-
- cleanSubstitutions - Static variable in class org.apache.tika.sax.CleanPhoneText
-
- ClearByMimeMetadataFilter - Class in org.apache.tika.metadata.filter
-
This class clears the entire metadata object if the
mime matches the mime filter.
- ClearByMimeMetadataFilter() - Constructor for class org.apache.tika.metadata.filter.ClearByMimeMetadataFilter
-
- ClearByMimeMetadataFilter(Set<String>) - Constructor for class org.apache.tika.metadata.filter.ClearByMimeMetadataFilter
-
- CLIENT_UNAVAILABLE_WITHIN_MS - Static variable in class org.apache.tika.pipes.PipesResult
-
- ClimateForcast - Interface in org.apache.tika.metadata
-
- cloneMetadata(Metadata) - Static method in class org.apache.tika.utils.ParserUtils
-
Does a deep clone of a Metadata object.
- close() - Method in class org.apache.tika.fork.ForkParser
-
- close() - Method in class org.apache.tika.io.LookaheadInputStream
-
- close() - Method in class org.apache.tika.io.TemporaryResources
-
Closes all tracked resources.
- close() - Method in class org.apache.tika.io.TikaInputStream
-
- close() - Method in class org.apache.tika.language.detect.LanguageWriter
-
Ignored.
- close() - Method in class org.apache.tika.parser.ParsingReader
-
Closes the read end of the pipe.
- close() - Method in class org.apache.tika.pipes.async.AsyncProcessor
-
- close() - Method in class org.apache.tika.pipes.PipesClient
-
- close() - Method in class org.apache.tika.pipes.PipesParser
-
- close() - Method in class org.apache.tika.utils.RereadableInputStream
-
Closes the input stream and removes the temporary file if one was
created.
- COLOR_MODE - Static variable in interface org.apache.tika.metadata.Photoshop
-
- COLUMN_COUNT - Static variable in interface org.apache.tika.metadata.Database
-
- COLUMN_NAME - Static variable in interface org.apache.tika.metadata.Database
-
- COMMAND_LINE - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- COMMAND_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- COMMENT - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- COMMENT_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- COMMENTS - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- COMMENTS - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- COMPANY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- compareClassName(Object, Object) - Static method in class org.apache.tika.utils.CompareUtils
-
Compare two classes by class names.
- compareTo(Property) - Method in class org.apache.tika.metadata.Property
-
- compareTo(MediaType) - Method in class org.apache.tika.mime.MediaType
-
- compareTo(MimeType) - Method in class org.apache.tika.mime.MimeType
-
- CompareUtils - Class in org.apache.tika.utils
-
- CompareUtils() - Constructor for class org.apache.tika.utils.CompareUtils
-
- COMPILATION - Static variable in interface org.apache.tika.metadata.XMPDM
-
"An album created by various artists."
- COMPLETED_SEMAPHORE - Static variable in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- COMPOSER - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The composer's name."
- composite(Property, Property[]) - Static method in class org.apache.tika.metadata.Property
-
Constructs a new composite property from the given primary and array of secondary properties.
- CompositeDetector - Class in org.apache.tika.detect
-
Content type detector that combines multiple different detection mechanisms.
- CompositeDetector(MediaTypeRegistry, List<Detector>, Collection<Class<? extends Detector>>) - Constructor for class org.apache.tika.detect.CompositeDetector
-
- CompositeDetector(MediaTypeRegistry, List<Detector>) - Constructor for class org.apache.tika.detect.CompositeDetector
-
- CompositeDetector(List<Detector>) - Constructor for class org.apache.tika.detect.CompositeDetector
-
- CompositeDetector(Detector...) - Constructor for class org.apache.tika.detect.CompositeDetector
-
- CompositeDigester - Class in org.apache.tika.parser.digest
-
- CompositeDigester(DigestingParser.Digester...) - Constructor for class org.apache.tika.parser.digest.CompositeDigester
-
- CompositeEncodingDetector - Class in org.apache.tika.detect
-
- CompositeEncodingDetector(List<EncodingDetector>, Collection<Class<? extends EncodingDetector>>) - Constructor for class org.apache.tika.detect.CompositeEncodingDetector
-
- CompositeEncodingDetector(List<EncodingDetector>) - Constructor for class org.apache.tika.detect.CompositeEncodingDetector
-
- CompositeExternalParser - Class in org.apache.tika.parser.external
-
A Composite Parser that wraps up all the available External Parsers,
and provides an easy way to access them.
- CompositeExternalParser() - Constructor for class org.apache.tika.parser.external.CompositeExternalParser
-
- CompositeExternalParser(MediaTypeRegistry) - Constructor for class org.apache.tika.parser.external.CompositeExternalParser
-
- CompositeMatcher - Class in org.apache.tika.sax.xpath
-
Composite XPath evaluation state.
- CompositeMatcher(Matcher, Matcher) - Constructor for class org.apache.tika.sax.xpath.CompositeMatcher
-
- CompositeMetadataFilter - Class in org.apache.tika.metadata.filter
-
- CompositeMetadataFilter(List<MetadataFilter>) - Constructor for class org.apache.tika.metadata.filter.CompositeMetadataFilter
-
- CompositeParser - Class in org.apache.tika.parser
-
Composite parser that delegates parsing tasks to a component parser
based on the declared content type of the incoming document.
- CompositeParser(MediaTypeRegistry, List<Parser>, Collection<Class<? extends Parser>>) - Constructor for class org.apache.tika.parser.CompositeParser
-
- CompositeParser(MediaTypeRegistry, List<Parser>) - Constructor for class org.apache.tika.parser.CompositeParser
-
- CompositeParser(MediaTypeRegistry, Parser...) - Constructor for class org.apache.tika.parser.CompositeParser
-
- CompositeParser() - Constructor for class org.apache.tika.parser.CompositeParser
-
- ConcurrentUtils - Class in org.apache.tika.utils
-
Utility Class for Concurrency in Tika
- ConcurrentUtils() - Constructor for class org.apache.tika.utils.ConcurrentUtils
-
- ConfigBase - Class in org.apache.tika.config
-
- ConfigBase() - Constructor for class org.apache.tika.config.ConfigBase
-
- ConfigurableThreadPoolExecutor - Interface in org.apache.tika.concurrent
-
Allows Thread Pool to be Configurable.
- configure(String, InputStream) - Method in class org.apache.tika.config.ConfigBase
-
Use this to configure a subclass of ConfigBase, a single known object.
- consume(String) - Method in interface org.apache.tika.parser.external.ExternalParser.LineConsumer
-
Consume a line
- CONTACT - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- CONTACT_INFO_ADDRESS - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information address part.
- CONTACT_INFO_CITY - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information city part.
- CONTACT_INFO_COUNTRY - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information country part.
- CONTACT_INFO_EMAIL - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information email address part.
- CONTACT_INFO_PHONE - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information phone number part.
- CONTACT_INFO_POSTAL_CODE - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information part denoting the local postal code.
- CONTACT_INFO_STATE_PROVINCE - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information part denoting regional information such as state or province.
- CONTACT_INFO_WEB_URL - Static variable in interface org.apache.tika.metadata.IPTC
-
The contact information web address part.
- CONTAINER_EXCEPTION - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- ContainerExtractor - Interface in org.apache.tika.extractor
-
Tika container extractor interface.
- CONTENT_DISPOSITION - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_ENCODING - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_LANGUAGE - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_LENGTH - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_LOCATION - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_MD5 - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_STATUS - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
-
The status of the content.
- CONTENT_TYPE - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- CONTENT_TYPE_HINT - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
This is currently used to identify Content-Type that may be
included within a document, such as in html documents
(e.g.
- CONTENT_TYPE_PARSER_OVERRIDE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
This is used by parsers to override detection of embedded resources
with the override detector.
- CONTENT_TYPE_USER_OVERRIDE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
This is used by users to override detection with the override detector.
- ContentHandlerDecorator - Class in org.apache.tika.sax
-
- ContentHandlerDecorator(ContentHandler) - Constructor for class org.apache.tika.sax.ContentHandlerDecorator
-
Creates a decorator for the given SAX event handler.
- ContentHandlerDecorator() - Constructor for class org.apache.tika.sax.ContentHandlerDecorator
-
Creates a decorator that by default forwards incoming SAX events to
a dummy content handler that simply ignores all the events.
- ContentHandlerFactory - Interface in org.apache.tika.sax
-
Interface to allow easier injection of code for getting a new ContentHandler
- CONTRIBUTOR - Static variable in interface org.apache.tika.metadata.DublinCore
-
An entity responsible for making contributions to the content of the
resource.
- CONTRIBUTOR - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- CONTROLLED_VOCABULARY_TERM - Static variable in interface org.apache.tika.metadata.IPTC
-
A term to describe the content of the image by a value from a Controlled
Vocabulary.
- CONVENTIONS - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- convert(Object) - Static method in class org.apache.tika.metadata.XMPDM.ChannelTypePropertyConverter
-
Deprecated.
How a standalone converter might work
- convertAndSet(Metadata, Object) - Static method in class org.apache.tika.metadata.XMPDM.ChannelTypePropertyConverter
-
Deprecated.
How convert+set might work
- COPYRIGHT - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The copyright information."
- COPYRIGHT_NOTICE - Static variable in interface org.apache.tika.metadata.IPTC
-
Contains any necessary copyright notice for claiming the intellectual
property for this item and should identify the current owner of the
copyright for the item.
- COPYRIGHT_OWNER - Static variable in interface org.apache.tika.metadata.IPTC
-
Owner or owners of the copyright in the licensed image.
- COPYRIGHT_OWNER_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
The ID of the owner or owners of the copyright in the licensed image.
- COPYRIGHT_OWNER_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
-
- COPYRIGHT_OWNER_NAME - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of the owner or owners of the copyright in the licensed image.
- CorruptedFileException - Exception in org.apache.tika.exception
-
This exception should be thrown when the parse absolutely, positively has to stop.
- CorruptedFileException(String) - Constructor for exception org.apache.tika.exception.CorruptedFileException
-
- CorruptedFileException(String, Throwable) - Constructor for exception org.apache.tika.exception.CorruptedFileException
-
- count() - Method in class org.apache.tika.detect.TextStatistics
-
Returns the total number of bytes seen so far.
- count(int) - Method in class org.apache.tika.detect.TextStatistics
-
Returns the number of occurrences of the given byte.
- countControl() - Method in class org.apache.tika.detect.TextStatistics
-
Counts control characters (i.e.
- countEightBit() - Method in class org.apache.tika.detect.TextStatistics
-
Counts eight bit characters, i.e.
- COUNTRY - Static variable in interface org.apache.tika.metadata.IPTC
-
Full name of the country the content is focussing on -- either the
country shown in visual media or referenced in text or audio media.
- COUNTRY - Static variable in interface org.apache.tika.metadata.Photoshop
-
- COUNTRY_CODE - Static variable in interface org.apache.tika.metadata.IPTC
-
Code of the country the content is focussing on -- either the country
shown in visual media or referenced in text or audio media.
- countSafeAscii() - Method in class org.apache.tika.detect.TextStatistics
-
Counts "safe" (i.e.
- COVERAGE - Static variable in interface org.apache.tika.metadata.DublinCore
-
The extent or scope of the content of the resource.
- COVERAGE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- create() - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates an empty instance; same as calling new MimeTypes().
- create(Document) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance from the specified document.
- create(InputStream...) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance from the specified input stream.
- create(InputStream) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
- create(URL...) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance from the resource
at the location specified by the URL.
- create(URL) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
- create(String) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance from the specified file path,
as interpreted by the class loader in getResource().
- create(String, String) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance.
- create(String, String, ClassLoader) - Static method in class org.apache.tika.mime.MimeTypesFactory
-
Creates and returns a MimeTypes instance.
- create() - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
-
- create(ServiceLoader) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
-
- create(String, ServiceLoader) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
-
- create(URL...) - Static method in class org.apache.tika.parser.external.ExternalParsersFactory
-
- CREATE_DATE - Static variable in interface org.apache.tika.metadata.XMP
-
The date and time the resource was created.
- CREATED - Static variable in interface org.apache.tika.metadata.DublinCore
-
Date of creation of the resource.
- CREATED - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- createTempFile() - Method in class org.apache.tika.io.TemporaryResources
-
Creates a temporary file that will automatically be deleted when
the
TemporaryResources.close()
method is called, returning its path.
- createTemporaryFile() - Method in class org.apache.tika.io.TemporaryResources
-
- CREATION_DATE - Static variable in interface org.apache.tika.metadata.Office
-
When was the document created?
- CreativeCommons - Interface in org.apache.tika.metadata
-
A collection of Creative Commons properties names.
- CREATOR - Static variable in interface org.apache.tika.metadata.DublinCore
-
An entity primarily responsible for making the content of the resource.
- CREATOR - Static variable in interface org.apache.tika.metadata.IPTC
-
Contains the name of the person who created the content of this item, a
photographer for photos, a graphic artist for graphics, or a writer for
textual news, but in cases where the photographer should not be
identified the name of a company or organisation may be appropriate.
- CREATOR - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- CREATOR_TOOL - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- CREATOR_TOOL - Static variable in interface org.apache.tika.metadata.XMP
-
The name of the first known tool used to create the resource.
- CREATORS_CONTACT_INFO - Static variable in interface org.apache.tika.metadata.IPTC
-
The creator's contact information provides all necessary information to
get in contact with the creator of this item and comprises a set of
sub-properties for proper addressing.
- CREATORS_JOB_TITLE - Static variable in interface org.apache.tika.metadata.IPTC
-
Contains the job title of the person who created the content of this
item.
- CREDIT - Static variable in interface org.apache.tika.metadata.Photoshop
-
- CREDIT_LINE - Static variable in interface org.apache.tika.metadata.IPTC
-
The credit to person(s) and/or organisation(s) required by the supplier
of the item to be used when published.
- CryptoParser - Class in org.apache.tika.parser
-
Decrypts the incoming document stream and delegates further parsing to
another parser instance.
- CryptoParser(String, Provider, Set<MediaType>) - Constructor for class org.apache.tika.parser.CryptoParser
-
- CryptoParser(String, Set<MediaType>) - Constructor for class org.apache.tika.parser.CryptoParser
-
- CUSTOM_MIMES_SYS_PROP - Static variable in class org.apache.tika.mime.MimeTypesFactory
-
System property to set a path to an additional external custom mimetypes
XML file to be loaded.
- Database - Interface in org.apache.tika.metadata
-
- DATE - Static variable in interface org.apache.tika.metadata.DublinCore
-
A date associated with an event in the life cycle of the resource.
- DATE_CREATED - Static variable in interface org.apache.tika.metadata.IPTC
-
Designates the date and optionally the time the intellectual content was
created rather than the date of the creation of the physical
representation.
- DATE_CREATED - Static variable in interface org.apache.tika.metadata.Photoshop
-
- DateUtils - Class in org.apache.tika.utils
-
Date related utility methods and constants
- DateUtils() - Constructor for class org.apache.tika.utils.DateUtils
-
- decode(String) - Static method in class org.apache.tika.mime.HexCoDec
-
Decode a hex string
- decode(char[]) - Static method in class org.apache.tika.mime.HexCoDec
-
Decode an array of hex chars
- decode(char[], int, int) - Static method in class org.apache.tika.mime.HexCoDec
-
Decode an array of hex chars.
- DEFAULT - Static variable in interface org.apache.tika.config.InitializableProblemHandler
-
- DEFAULT - Static variable in class org.apache.tika.config.ParamField
-
- DEFAULT_HANDLER_CONFIG - Static variable in class org.apache.tika.pipes.HandlerConfig
-
- DEFAULT_MAX_ENTITY_EXPANSIONS - Static variable in class org.apache.tika.utils.XMLReaderUtils
-
- DEFAULT_MAX_WAIT_MS - Static variable in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- DEFAULT_ON_PARSE_EXCEPTION - Static variable in class org.apache.tika.pipes.FetchEmitTuple
-
- DEFAULT_POOL_SIZE - Static variable in class org.apache.tika.utils.XMLReaderUtils
-
Default size for the pool of SAX Parsers
and the pool of DOM builders
- DEFAULT_QUEUE_SIZE - Static variable in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- DefaultDetector - Class in org.apache.tika.detect
-
- DefaultDetector(MimeTypes, ServiceLoader, Collection<Class<? extends Detector>>) - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultDetector(MimeTypes, ServiceLoader) - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultDetector(MimeTypes, ClassLoader) - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultDetector(ClassLoader) - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultDetector(MimeTypes) - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultDetector() - Constructor for class org.apache.tika.detect.DefaultDetector
-
- DefaultEmbeddedStreamTranslator - Class in org.apache.tika.extractor
-
Loads EmbeddedStreamTranslators via service loading.
- DefaultEmbeddedStreamTranslator() - Constructor for class org.apache.tika.extractor.DefaultEmbeddedStreamTranslator
-
- DefaultEncodingDetector - Class in org.apache.tika.detect
-
- DefaultEncodingDetector() - Constructor for class org.apache.tika.detect.DefaultEncodingDetector
-
- DefaultEncodingDetector(ServiceLoader) - Constructor for class org.apache.tika.detect.DefaultEncodingDetector
-
- DefaultEncodingDetector(ServiceLoader, Collection<Class<? extends EncodingDetector>>) - Constructor for class org.apache.tika.detect.DefaultEncodingDetector
-
- DefaultMetadataFilter - Class in org.apache.tika.metadata.filter
-
- DefaultMetadataFilter(ServiceLoader) - Constructor for class org.apache.tika.metadata.filter.DefaultMetadataFilter
-
- DefaultMetadataFilter(List<MetadataFilter>) - Constructor for class org.apache.tika.metadata.filter.DefaultMetadataFilter
-
- DefaultMetadataFilter() - Constructor for class org.apache.tika.metadata.filter.DefaultMetadataFilter
-
- DefaultParser - Class in org.apache.tika.parser
-
- DefaultParser(MediaTypeRegistry, ServiceLoader, Collection<Class<? extends Parser>>, EncodingDetector) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser(MediaTypeRegistry, ServiceLoader, Collection<Class<? extends Parser>>) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser(MediaTypeRegistry, ServiceLoader, EncodingDetector) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser(MediaTypeRegistry, ServiceLoader) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser(MediaTypeRegistry, ClassLoader) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser(ClassLoader) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser(MediaTypeRegistry) - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultParser() - Constructor for class org.apache.tika.parser.DefaultParser
-
- DefaultProbDetector - Class in org.apache.tika.detect
-
A version of
DefaultDetector
for probabilistic mime
detectors, which use statistical techniques to blend the
results of differing underlying detectors when attempting
to detect the type of a given file.
- DefaultProbDetector(ProbabilisticMimeDetectionSelector, ServiceLoader) - Constructor for class org.apache.tika.detect.DefaultProbDetector
-
- DefaultProbDetector(ProbabilisticMimeDetectionSelector, ClassLoader) - Constructor for class org.apache.tika.detect.DefaultProbDetector
-
- DefaultProbDetector(ClassLoader) - Constructor for class org.apache.tika.detect.DefaultProbDetector
-
- DefaultProbDetector(MimeTypes) - Constructor for class org.apache.tika.detect.DefaultProbDetector
-
- DefaultProbDetector() - Constructor for class org.apache.tika.detect.DefaultProbDetector
-
- DefaultTranslator - Class in org.apache.tika.language.translate
-
- DefaultTranslator(ServiceLoader) - Constructor for class org.apache.tika.language.translate.DefaultTranslator
-
- DefaultTranslator() - Constructor for class org.apache.tika.language.translate.DefaultTranslator
-
- DelegatingParser - Class in org.apache.tika.parser
-
Base class for parser implementations that want to delegate parts of the
task of parsing an input document to another parser.
- DelegatingParser() - Constructor for class org.apache.tika.parser.DelegatingParser
-
- DERIVED_FROM_DOCUMENTID - Static variable in interface org.apache.tika.metadata.XMPMM
-
Document id for the document that this document
was derived from
- DERIVED_FROM_INSTANCEID - Static variable in interface org.apache.tika.metadata.XMPMM
-
Instance id for the document instance that this
document was derived from
- descend(String, String) - Method in class org.apache.tika.sax.xpath.ChildMatcher
-
- descend(String, String) - Method in class org.apache.tika.sax.xpath.CompositeMatcher
-
- descend(String, String) - Method in class org.apache.tika.sax.xpath.Matcher
-
Returns the XPath evaluation state that results from descending
to a child element with the given name.
- descend(String, String) - Method in class org.apache.tika.sax.xpath.NamedElementMatcher
-
- descend(String, String) - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
-
- DESCRIPTION - Static variable in interface org.apache.tika.metadata.DublinCore
-
An account of the content of the resource.
- DESCRIPTION - Static variable in interface org.apache.tika.metadata.IPTC
-
A textual description, including captions, of the item's content,
particularly used where the object is not text.
- DESCRIPTION - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- DESCRIPTION_WRITER - Static variable in interface org.apache.tika.metadata.IPTC
-
Identifier or the name of the person involved in writing, editing or
correcting the description of the content.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.CompositeDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.CompositeEncodingDetector
-
- detect(InputStream, Metadata) - Method in interface org.apache.tika.detect.Detector
-
Detects the content type of the given input document.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.EmptyDetector
-
- detect(InputStream, Metadata) - Method in interface org.apache.tika.detect.EncodingDetector
-
Detects the character encoding of the given text document, or
null
if the encoding of the document can not be detected.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.FileCommandDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.MagicDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.NameDetector
-
Detects the content type of an input document based on the document
name given in the input metadata.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.NonDetectingEncodingDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.OverrideDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.TextDetector
-
Looks at the beginning of the document input stream to determine
whether the document is text or not.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.TrainedModelDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.TypeDetector
-
Detects the content type of an input document based on a type hint
given in the input metadata.
- detect(InputStream, Metadata) - Method in class org.apache.tika.detect.ZeroSizeFileDetector
-
- detect() - Method in class org.apache.tika.language.detect.LanguageDetector
-
- detect(CharSequence) - Method in class org.apache.tika.language.detect.LanguageDetector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.mime.MimeTypes
-
Automatically detects the MIME type of a document based on magic
markers in the stream prefix and any given metadata hints.
- detect(InputStream, Metadata) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
-
- detect(InputStream, Metadata) - Method in class org.apache.tika.Tika
-
Detects the media type of the given document.
- detect(InputStream, String) - Method in class org.apache.tika.Tika
-
Detects the media type of the given document.
- detect(InputStream) - Method in class org.apache.tika.Tika
-
Detects the media type of the given document.
- detect(byte[], String) - Method in class org.apache.tika.Tika
-
Detects the media type of the given document.
- detect(byte[]) - Method in class org.apache.tika.Tika
-
Detects the media type of the given document.
- detect(Path) - Method in class org.apache.tika.Tika
-
Detects the media type of the file at the given path.
- detect(File) - Method in class org.apache.tika.Tika
-
Detects the media type of the given file.
- detect(URL) - Method in class org.apache.tika.Tika
-
Detects the media type of the resource at the given URL.
- detect(String) - Method in class org.apache.tika.Tika
-
Detects the media type of a document with the given file name.
- detectAll() - Method in class org.apache.tika.language.detect.LanguageDetector
-
Detect languages based on previously submitted text (via addText calls).
- detectAll(String) - Method in class org.apache.tika.language.detect.LanguageDetector
-
Utility wrapper that detects the language of a given chunk of text.
- Detector - Interface in org.apache.tika.detect
-
Content type detector.
- DIFContentHandler - Class in org.apache.tika.sax
-
- DIFContentHandler(ContentHandler, Metadata) - Constructor for class org.apache.tika.sax.DIFContentHandler
-
- digest(InputStream, Metadata, ParseContext) - Method in class org.apache.tika.parser.digest.CompositeDigester
-
- digest(InputStream, Metadata, ParseContext) - Method in class org.apache.tika.parser.digest.InputStreamDigester
-
- digest(InputStream, Metadata, ParseContext) - Method in interface org.apache.tika.parser.DigestingParser.Digester
-
Digests an InputStream and sets the appropriate value(s) in the metadata.
- DigestingParser - Class in org.apache.tika.parser
-
- DigestingParser(Parser, DigestingParser.Digester) - Constructor for class org.apache.tika.parser.DigestingParser
-
Creates a decorator for the given parser.
- DigestingParser.Digester - Interface in org.apache.tika.parser
-
Interface for digester.
- DigestingParser.Encoder - Interface in org.apache.tika.parser
-
Encodes byte array from a MessageDigest to String
- DIGITAL_IMAGE_GUID - Static variable in interface org.apache.tika.metadata.IPTC
-
Globally unique identifier for the item.
- DIGITAL_SOURCE_FILE_TYPE - Static variable in interface org.apache.tika.metadata.IPTC
-
Deprecated.
- DIGITAL_SOURCE_TYPE - Static variable in interface org.apache.tika.metadata.IPTC
-
The type of the source of this digital image
- DISC_NUMBER - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The disc number for part of an album set."
- dispose() - Method in class org.apache.tika.io.TemporaryResources
-
- DOC_INFO_CREATED - Static variable in interface org.apache.tika.metadata.PDF
-
- DOC_INFO_CREATOR - Static variable in interface org.apache.tika.metadata.PDF
-
- DOC_INFO_CREATOR_TOOL - Static variable in interface org.apache.tika.metadata.PDF
-
- DOC_INFO_KEY_WORDS - Static variable in interface org.apache.tika.metadata.PDF
-
- DOC_INFO_MODIFICATION_DATE - Static variable in interface org.apache.tika.metadata.PDF
-
- DOC_INFO_PRODUCER - Static variable in interface org.apache.tika.metadata.PDF
-
- DOC_INFO_SUBJECT - Static variable in interface org.apache.tika.metadata.PDF
-
- DOC_INFO_TITLE - Static variable in interface org.apache.tika.metadata.PDF
-
- DOC_INFO_TRAPPED - Static variable in interface org.apache.tika.metadata.PDF
-
- DOC_SECURITY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- DOC_SECURITY_STRING - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- DOCUMENTID - Static variable in interface org.apache.tika.metadata.XMPMM
-
The common identifier for all versions and renditions of a resource.
- DocumentSelector - Interface in org.apache.tika.extractor
-
Interface for different document selection strategies for purposes like
embedded document extraction by a
ContainerExtractor
instance.
- DublinCore - Interface in org.apache.tika.metadata
-
A collection of Dublin Core metadata names.
- DURATION - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The duration of the media file."
Value is in Seconds, unless xmpDM:scale is also set.
- element(String, String) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
Emits an XHTML element with the given text content.
- ElementMappingContentHandler - Class in org.apache.tika.sax
-
Content handler decorator that maps element QName
s using
a Map
.
- ElementMappingContentHandler(ContentHandler, Map<QName, ElementMappingContentHandler.TargetElement>) - Constructor for class org.apache.tika.sax.ElementMappingContentHandler
-
- ElementMappingContentHandler.TargetElement - Class in org.apache.tika.sax
-
- ElementMatcher - Class in org.apache.tika.sax.xpath
-
Final evaluation state of an XPath expression that targets an element.
- ElementMatcher() - Constructor for class org.apache.tika.sax.xpath.ElementMatcher
-
- EMB_APP_VERSION - Static variable in interface org.apache.tika.metadata.RTFMetadata
-
if an application and version is given as part of the
embedded object, this is the literal string
- EMB_CLASS - Static variable in interface org.apache.tika.metadata.RTFMetadata
-
- EMB_ITEM - Static variable in interface org.apache.tika.metadata.RTFMetadata
-
- EMB_TOPIC - Static variable in interface org.apache.tika.metadata.RTFMetadata
-
- embed(Metadata, InputStream, OutputStream, ParseContext) - Method in interface org.apache.tika.embedder.Embedder
-
Embeds related document metadata from the given metadata object into the
given output stream.
- embed(Metadata, InputStream, OutputStream, ParseContext) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Executes the configured external command and passes the given document
stream as a simple XHTML document to the given SAX content handler.
- EMBEDDED_DEPTH - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- EMBEDDED_EXCEPTION - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- EMBEDDED_PARSER - Static variable in class org.apache.tika.utils.ParserUtils
-
- EMBEDDED_RELATIONSHIP_ID - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- EMBEDDED_RESOURCE_LIMIT_REACHED - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- EMBEDDED_RESOURCE_LIMIT_REACHED - Static variable in class org.apache.tika.sax.AbstractRecursiveParserWrapperHandler
-
- EMBEDDED_RESOURCE_PATH - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- EMBEDDED_RESOURCE_TYPE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
Embedded resource type property
- EMBEDDED_RESOURCE_TYPE_KEY - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- EMBEDDED_STORAGE_CLASS_ID - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- EmbeddedContentHandler - Class in org.apache.tika.sax
-
- EmbeddedContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.EmbeddedContentHandler
-
- EmbeddedDocumentExtractor - Interface in org.apache.tika.extractor
-
- EmbeddedDocumentUtil - Class in org.apache.tika.extractor
-
Utility class to handle common issues with embedded documents.
- EmbeddedDocumentUtil(ParseContext) - Constructor for class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- EmbeddedResourceHandler - Interface in org.apache.tika.extractor
-
Tika container extractor callback interface.
- EmbeddedStreamTranslator - Interface in org.apache.tika.extractor
-
Interface for different filtering of embedded streams.
- Embedder - Interface in org.apache.tika.embedder
-
Tika embedder interface
- emit(List<? extends EmitData>) - Method in class org.apache.tika.pipes.emitter.AbstractEmitter
-
- emit(String, List<Metadata>) - Method in interface org.apache.tika.pipes.emitter.Emitter
-
- emit(List<? extends EmitData>) - Method in interface org.apache.tika.pipes.emitter.Emitter
-
- emit(String, List<Metadata>) - Method in class org.apache.tika.pipes.emitter.EmptyEmitter
-
- emit(List<? extends EmitData>) - Method in class org.apache.tika.pipes.emitter.EmptyEmitter
-
- emit(String, InputStream, Metadata) - Method in interface org.apache.tika.pipes.emitter.StreamEmitter
-
- EMIT_EXCEPTION - Static variable in class org.apache.tika.pipes.PipesServer
-
- EMIT_SUCCESS - Static variable in class org.apache.tika.pipes.PipesResult
-
- EMIT_SUCCESS - Static variable in class org.apache.tika.pipes.PipesServer
-
- EMIT_SUCCESS_PARSE_EXCEPTION - Static variable in class org.apache.tika.pipes.PipesServer
-
- EmitData - Class in org.apache.tika.pipes.emitter
-
- EmitData(EmitKey, List<Metadata>) - Constructor for class org.apache.tika.pipes.emitter.EmitData
-
- EmitKey - Class in org.apache.tika.pipes.emitter
-
- EmitKey() - Constructor for class org.apache.tika.pipes.emitter.EmitKey
-
- EmitKey(String, String) - Constructor for class org.apache.tika.pipes.emitter.EmitKey
-
- Emitter - Interface in org.apache.tika.pipes.emitter
-
- EmitterManager - Class in org.apache.tika.pipes.emitter
-
Utility class that will apply the appropriate fetcher
to the fetcherString based on the prefix.
- EmitterManager(List<Emitter>) - Constructor for class org.apache.tika.pipes.emitter.EmitterManager
-
- EMPTY - Static variable in class org.apache.tika.mime.MediaType
-
- EMPTY - Static variable in class org.apache.tika.utils.StringUtils
-
The empty String ""
.
- EMPTY_OUTPUT - Static variable in class org.apache.tika.pipes.PipesServer
-
- EmptyDetector - Class in org.apache.tika.detect
-
Dummy detector that returns application/octet-stream for all documents.
- EmptyDetector() - Constructor for class org.apache.tika.detect.EmptyDetector
-
- EmptyEmitter - Class in org.apache.tika.pipes.emitter
-
- EmptyEmitter() - Constructor for class org.apache.tika.pipes.emitter.EmptyEmitter
-
- EmptyFetcher - Class in org.apache.tika.pipes.fetcher
-
- EmptyFetcher() - Constructor for class org.apache.tika.pipes.fetcher.EmptyFetcher
-
- EmptyParser - Class in org.apache.tika.parser
-
Dummy parser that always produces an empty XHTML document without even
attempting to parse the given document stream.
- EmptyParser() - Constructor for class org.apache.tika.parser.EmptyParser
-
- EmptyTranslator - Class in org.apache.tika.language.translate
-
Dummy translator that always declines to give any text.
- EmptyTranslator() - Constructor for class org.apache.tika.language.translate.EmptyTranslator
-
- encode(byte[]) - Static method in class org.apache.tika.mime.HexCoDec
-
Hex encode an array of bytes
- encode(byte[], int, int) - Static method in class org.apache.tika.mime.HexCoDec
-
Hex encode an array of bytes
- encode(byte[]) - Method in interface org.apache.tika.parser.DigestingParser.Encoder
-
- EncodingDetector - Interface in org.apache.tika.detect
-
Character encoding detector.
- ENCRYPTED - Static variable in interface org.apache.tika.metadata.WordPerfect
-
Is encrypted?.
- EncryptedDocumentException - Exception in org.apache.tika.exception
-
- EncryptedDocumentException() - Constructor for exception org.apache.tika.exception.EncryptedDocumentException
-
- EncryptedDocumentException(Throwable) - Constructor for exception org.apache.tika.exception.EncryptedDocumentException
-
- EncryptedDocumentException(String) - Constructor for exception org.apache.tika.exception.EncryptedDocumentException
-
- EncryptedDocumentException(String, Throwable) - Constructor for exception org.apache.tika.exception.EncryptedDocumentException
-
- endDescription() - Method in class org.apache.tika.sax.XMPContentHandler
-
- endDocument(ContentHandler, Metadata) - Method in class org.apache.tika.sax.AbstractRecursiveParserWrapperHandler
-
This is called after the full parse has completed.
- endDocument() - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- endDocument() - Method in class org.apache.tika.sax.DIFContentHandler
-
- endDocument() - Method in class org.apache.tika.sax.EmbeddedContentHandler
-
Ignored.
- endDocument() - Method in class org.apache.tika.sax.EndDocumentShieldingContentHandler
-
- endDocument() - Method in class org.apache.tika.sax.PhoneExtractingContentHandler
-
This method is called whenever the Parser is done parsing the file.
- endDocument(ContentHandler, Metadata) - Method in class org.apache.tika.sax.RecursiveParserWrapperHandler
-
- endDocument() - Method in class org.apache.tika.sax.SafeContentHandler
-
- endDocument() - Method in class org.apache.tika.sax.StandardsExtractingContentHandler
-
This method is called whenever the Parser is done parsing the file.
- endDocument() - Method in class org.apache.tika.sax.TeeContentHandler
-
- endDocument() - Method in class org.apache.tika.sax.TextContentHandler
-
- endDocument() - Method in class org.apache.tika.sax.ToTextContentHandler
-
Flushes the character stream so that no characters are forgotten
in internal buffers.
- endDocument() - Method in class org.apache.tika.sax.XHTMLContentHandler
-
Ends the XHTML document by writing the following footer and
clearing the namespace mappings:
- endDocument() - Method in class org.apache.tika.sax.XMPContentHandler
-
Ends the XMP document by writing the following footer and
clearing the namespace mappings:
- EndDocumentShieldingContentHandler - Class in org.apache.tika.sax
-
- EndDocumentShieldingContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.EndDocumentShieldingContentHandler
-
Creates a decorator for the given SAX event handler.
- endElement(String, String, String) - Method in class org.apache.tika.mime.MimeTypesReader
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.DIFContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.ElementMappingContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.ExpandedTitleContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.LinkContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.SafeContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.SecureContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.TeeContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.ToHTMLContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.ToTextContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.ToXMLContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
Ends the given element.
- endElement(String) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
- endElement(String, String, String) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
-
- endEmbeddedDocument(ContentHandler, Metadata) - Method in class org.apache.tika.sax.AbstractRecursiveParserWrapperHandler
-
This is called after parsing each embedded document.
- endEmbeddedDocument(ContentHandler, Metadata) - Method in class org.apache.tika.sax.RecursiveParserWrapperHandler
-
This is called after parsing an embedded document.
- ENDIAN - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- EndianUtils - Class in org.apache.tika.io
-
General Endian Related Utilties.
- EndianUtils() - Constructor for class org.apache.tika.io.EndianUtils
-
- EndianUtils.BufferUnderrunException - Exception in org.apache.tika.io
-
- ENDLINE - Static variable in class org.apache.tika.sax.XHTMLContentHandler
-
- endPrefixMapping(String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- endPrefixMapping(String) - Method in class org.apache.tika.sax.TeeContentHandler
-
- ENGINEER - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The engineer's name."
- enqueue() - Method in class org.apache.tika.pipes.pipesiterator.FileSystemPipesIterator
-
- enqueue() - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- ensureStreamReReadable(InputStream, TemporaryResources) - Static method in class org.apache.tika.utils.ParserUtils
-
Ensures that the Stream will be able to be re-read, by buffering to
a temporary file if required.
- equals(String, String) - Static method in class org.apache.tika.language.detect.LanguageNames
-
- equals(Object) - Method in class org.apache.tika.metadata.Metadata
-
- equals(Object) - Method in class org.apache.tika.metadata.Property
-
- equals(Object) - Method in class org.apache.tika.mime.MediaType
-
- equals(Object) - Method in class org.apache.tika.mime.MimeType
-
- equals(Object) - Method in class org.apache.tika.pipes.emitter.EmitKey
-
- equals(Object) - Method in class org.apache.tika.pipes.FetchEmitTuple
-
- equals(Object) - Method in class org.apache.tika.pipes.fetcher.FetchKey
-
- equals(Object) - Method in class org.apache.tika.pipes.HandlerConfig
-
- EQUIPMENT_MAKE - Static variable in interface org.apache.tika.metadata.TIFF
-
"Manufacturer of the recording equipment."
- EQUIPMENT_MODEL - Static variable in interface org.apache.tika.metadata.TIFF
-
"Model name or number of the recording equipment."
- ERROR_CODES_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- ErrorParser - Class in org.apache.tika.parser
-
Dummy parser that always throws a
TikaException
without even
attempting to parse the given document stream.
- ErrorParser() - Constructor for class org.apache.tika.parser.ErrorParser
-
- escapeCommandLine(String) - Static method in class org.apache.tika.utils.ProcessUtils
-
This should correctly put double-quotes around an argument if
ProcessBuilder doesn't seem to work (as it doesn't
on paths with spaces on Windows)
- EVENT - Static variable in interface org.apache.tika.metadata.IPTC
-
Names or describes the specific event the content relates to.
- ExceptionUtils - Class in org.apache.tika.utils
-
- ExceptionUtils() - Constructor for class org.apache.tika.utils.ExceptionUtils
-
- ExcludeFieldMetadataFilter - Class in org.apache.tika.metadata.filter
-
- ExcludeFieldMetadataFilter() - Constructor for class org.apache.tika.metadata.filter.ExcludeFieldMetadataFilter
-
- ExcludeFieldMetadataFilter(Set<String>) - Constructor for class org.apache.tika.metadata.filter.ExcludeFieldMetadataFilter
-
- execute(ParseContext, Runnable) - Static method in class org.apache.tika.utils.ConcurrentUtils
-
Execute a runnable using an ExecutorService from the ParseContext if possible.
- EXIF_PAGE_COUNT - Static variable in interface org.apache.tika.metadata.TIFF
-
- ExpandedTitleContentHandler - Class in org.apache.tika.sax
-
- ExpandedTitleContentHandler() - Constructor for class org.apache.tika.sax.ExpandedTitleContentHandler
-
- ExpandedTitleContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.ExpandedTitleContentHandler
-
- EXPERIMENT_ID - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- EXPOSURE_TIME - Static variable in interface org.apache.tika.metadata.TIFF
-
"Exposure time in seconds."
- extension_neg(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
-
- extension_trust(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
-
- EXTERNAL_PARSERS_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- externalBoolean(String) - Static method in class org.apache.tika.metadata.Property
-
- externalBooleanSeq(String) - Static method in class org.apache.tika.metadata.Property
-
- externalClosedChoise(String, String...) - Static method in class org.apache.tika.metadata.Property
-
- externalDate(String) - Static method in class org.apache.tika.metadata.Property
-
- ExternalEmbedder - Class in org.apache.tika.embedder
-
Embedder that uses an external program (like sed or exiftool) to embed text
content and metadata into a given document.
- ExternalEmbedder() - Constructor for class org.apache.tika.embedder.ExternalEmbedder
-
- externalInteger(String) - Static method in class org.apache.tika.metadata.Property
-
- externalOpenChoise(String, String...) - Static method in class org.apache.tika.metadata.Property
-
- ExternalParser - Class in org.apache.tika.parser.external
-
Parser that uses an external program (like catdoc or pdf2txt) to extract
text content and metadata from a given document.
- ExternalParser() - Constructor for class org.apache.tika.parser.external.ExternalParser
-
- ExternalParser.LineConsumer - Interface in org.apache.tika.parser.external
-
Consumer contract
- ExternalParsersConfigReader - Class in org.apache.tika.parser.external
-
Builds up ExternalParser instances based on XML file(s)
which define what to run, for what, and how to process
any output metadata.
- ExternalParsersConfigReader() - Constructor for class org.apache.tika.parser.external.ExternalParsersConfigReader
-
- ExternalParsersConfigReaderMetKeys - Interface in org.apache.tika.parser.external
-
- ExternalParsersFactory - Class in org.apache.tika.parser.external
-
Creates instances of ExternalParser based on XML
configuration files.
- ExternalParsersFactory() - Constructor for class org.apache.tika.parser.external.ExternalParsersFactory
-
- externalReal(String) - Static method in class org.apache.tika.metadata.Property
-
- externalRealSeq(String) - Static method in class org.apache.tika.metadata.Property
-
- externalText(String) - Static method in class org.apache.tika.metadata.Property
-
- externalTextBag(String) - Static method in class org.apache.tika.metadata.Property
-
- extract(TikaInputStream, ContainerExtractor, EmbeddedResourceHandler) - Method in interface org.apache.tika.extractor.ContainerExtractor
-
Processes a container file, and extracts all the embedded
resources from within it.
- extract(TikaInputStream, ContainerExtractor, EmbeddedResourceHandler) - Method in class org.apache.tika.extractor.ParserContainerExtractor
-
- EXTRACT_CONTENT - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Should content be extracted, generally.
- EXTRACT_FOR_ACCESSIBILITY - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
Should content be extracted for the purposes
of accessibility.
- extractLinks(String) - Static method in class org.apache.tika.utils.RegexUtils
-
Extract urls from plain text.
- extractPhoneNumbers(String) - Static method in class org.apache.tika.sax.CleanPhoneText
-
- extractRootElement(byte[]) - Method in class org.apache.tika.detect.XmlRootExtractor
-
- extractRootElement(InputStream) - Method in class org.apache.tika.detect.XmlRootExtractor
-
- extractStandardReferences(String, double) - Static method in class org.apache.tika.sax.StandardsText
-
Extracts the standard references found within the given text.
- GENRE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The name of the genre."
- Geographic - Interface in org.apache.tika.metadata
-
Geographic schema.
- get(InputStream, TemporaryResources) - Static method in class org.apache.tika.io.TikaInputStream
-
Casts or wraps the given stream to a TikaInputStream instance.
- get(InputStream) - Static method in class org.apache.tika.io.TikaInputStream
-
Casts or wraps the given stream to a TikaInputStream instance.
- get(byte[]) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the given array of bytes.
- get(byte[], Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the given array of bytes.
- get(Path) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the file at the given path.
- get(Path, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the file at the given path.
- get(Path, Metadata, TemporaryResources) - Static method in class org.apache.tika.io.TikaInputStream
-
- get(File) - Static method in class org.apache.tika.io.TikaInputStream
-
- get(File, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
- get(InputStreamFactory) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from a Factory which can create
fresh
InputStream
s for the same resource multiple times.
- get(InputStreamFactory, TemporaryResources) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from a Factory which can create
fresh
InputStream
s for the same resource multiple times.
- get(Blob) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the given database BLOB.
- get(Blob, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the given database BLOB.
- get(URI) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the resource at the given URI.
- get(URI, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the resource at the given URI.
- get(URL) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the resource at the given URL.
- get(URL, Metadata) - Static method in class org.apache.tika.io.TikaInputStream
-
Creates a TikaInputStream from the resource at the given URL.
- get(String) - Method in class org.apache.tika.metadata.Metadata
-
Get the value associated to a metadata name.
- get(Property) - Method in class org.apache.tika.metadata.Metadata
-
Returns the value (if any) of the identified metadata property.
- get(String) - Static method in class org.apache.tika.metadata.Property
-
Retrieve the property object that corresponds to the given key
- get(Class<T>) - Method in class org.apache.tika.parser.ParseContext
-
Returns the object in this context that implements the given interface.
- get(Class<T>, T) - Method in class org.apache.tika.parser.ParseContext
-
Returns the object in this context that implements the given interface,
or the given default value if such an object is not found.
- getAcronym() - Method in class org.apache.tika.mime.MimeType
-
Returns an acronym for this mime type.
- getAliases(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Returns the set of known aliases of the given canonical media type.
- getAllComponentParsers() - Method in class org.apache.tika.parser.CompositeParser
-
Returns all parsers registered with the Composite Parser,
including ones which may not currently be active.
- getAllComponentParsers() - Method in class org.apache.tika.parser.DefaultParser
-
- getAllParsers() - Method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
- getAttributesMapping() - Method in class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
-
- getAttrValue(String, Attributes) - Static method in class org.apache.tika.utils.XMLReaderUtils
-
- getBasePath() - Method in class org.apache.tika.pipes.fetcher.fs.FileSystemFetcher
-
- getBaseType() - Method in class org.apache.tika.mime.MediaType
-
Returns the base form of the MediaType, excluding
any parameters, such as "text/plain" for
"text/plain; charset=utf-8"
- getCapacity() - Method in class org.apache.tika.pipes.async.AsyncProcessor
-
- getCause() - Method in exception org.apache.tika.sax.TaggedSAXException
-
Returns the wrapped exception.
- getCharset() - Method in class org.apache.tika.detect.AutoDetectReader
-
- getCharset() - Method in class org.apache.tika.detect.NonDetectingEncodingDetector
-
- getChildTypes(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Returns the set of known children of the given canonical media type
- getChoices() - Method in class org.apache.tika.metadata.Property
-
Returns the (immutable) set of choices for the values of this property.
- getCommand() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets the command to be run.
- getCommand() - Method in class org.apache.tika.parser.external.ExternalParser
-
- getCommandAppendOperator() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets the operator to append rather than replace a value for the command
line tool, i.e.
- getCommandAssignmentDelimeter() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets the delimiter for multiple assignments for the command line tool,
i.e.
- getCommandAssignmentOperator() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets the assignment operator for the command line tool, i.e.
- getCommandMetadataSegments(Metadata) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Constructs a collection of command line arguments responsible for setting
individual metadata fields based on the given metadata
.
- getConfidence() - Method in class org.apache.tika.language.detect.LanguageResult
-
- getConfig() - Method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- getContentHandlerFactory() - Method in class org.apache.tika.sax.AbstractRecursiveParserWrapperHandler
-
- getDate(Property) - Method in class org.apache.tika.metadata.Metadata
-
Returns the value of the identified Date based metadata property.
- getDecorationName() - Method in class org.apache.tika.parser.ParserDecorator
-
- getDefaultConfig() - Static method in class org.apache.tika.config.TikaConfig
-
Provides a default configuration (TikaConfig).
- getDefaultDetector(MimeTypes, ServiceLoader) - Static method in class org.apache.tika.config.TikaConfig
-
- getDefaultEncodingDetector(ServiceLoader) - Static method in class org.apache.tika.config.TikaConfig
-
- getDefaultLanguageDetector() - Static method in class org.apache.tika.language.detect.LanguageDetector
-
- getDefaultMimeTypes() - Static method in class org.apache.tika.mime.MimeTypes
-
Get the default MimeTypes.
- getDefaultMimeTypes(ClassLoader) - Static method in class org.apache.tika.mime.MimeTypes
-
Get the default MimeTypes.
- getDefaultRegistry() - Static method in class org.apache.tika.mime.MediaTypeRegistry
-
Returns the built-in media type registry included in Tika.
- getDelegateParser(ParseContext) - Method in class org.apache.tika.parser.DelegatingParser
-
Returns the parser instance to which parsing tasks should be delegated.
- getDelegatingParser() - Method in class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
-
- getDescription() - Method in class org.apache.tika.mime.MimeType
-
Returns the description of this media type.
- getDetector() - Method in class org.apache.tika.config.TikaConfig
-
Returns the configured detector instance.
- getDetector() - Method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- getDetector() - Method in class org.apache.tika.language.detect.LanguageHandler
-
Returns the language detector used by this content handler.
- getDetector() - Method in class org.apache.tika.language.detect.LanguageWriter
-
Returns the language detector used by this writer.
- getDetector() - Method in class org.apache.tika.parser.AutoDetectParser
-
Returns the type detector used by this parser to auto-detect the type
of a document.
- getDetector() - Method in class org.apache.tika.Tika
-
Returns the detector instance used by this facade.
- getDetectors() - Method in class org.apache.tika.detect.CompositeDetector
-
Returns the component detectors.
- getDetectors() - Method in class org.apache.tika.detect.CompositeEncodingDetector
-
- getDetectors() - Method in class org.apache.tika.detect.DefaultDetector
-
- getDetectors() - Method in class org.apache.tika.detect.DefaultProbDetector
-
- getDocumentBuilder() - Method in class org.apache.tika.parser.ParseContext
-
Returns the DOM builder specified in this parsing context.
- getDocumentBuilder() - Static method in class org.apache.tika.utils.XMLReaderUtils
-
Returns the DOM builder specified in this parsing context.
- getDocumentBuilderFactory() - Static method in class org.apache.tika.utils.XMLReaderUtils
-
Returns the DOM builder factory specified in this parsing context.
- getEmbeddedDocumentExtractor(ParseContext) - Static method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
This offers a uniform way to get an EmbeddedDocumentExtractor from a ParseContext.
- getEmitData() - Method in class org.apache.tika.pipes.PipesResult
-
- getEmitKey() - Method in class org.apache.tika.pipes.emitter.EmitData
-
- getEmitKey() - Method in class org.apache.tika.pipes.emitter.EmitKey
-
- getEmitKey() - Method in class org.apache.tika.pipes.FetchEmitTuple
-
- getEmitMaxEstimatedBytes() - Method in class org.apache.tika.pipes.async.AsyncConfig
-
When the emit queue hits this estimated size (sum of
estimated extract sizes), emit the batch.
- getEmitter(String) - Method in class org.apache.tika.pipes.emitter.EmitterManager
-
- getEmitterName() - Method in class org.apache.tika.pipes.emitter.EmitKey
-
- getEmitterName() - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- getEmitWithinMillis() - Method in class org.apache.tika.pipes.async.AsyncConfig
-
- getEncodingDetector() - Method in class org.apache.tika.config.TikaConfig
-
Returns the configured encoding detector instance
- getEncodingDetector(ParseContext) - Method in class org.apache.tika.parser.AbstractEncodingDetectorParser
-
Look for an EncodingDetetor in the ParseContext.
- getEncodingDetector() - Method in class org.apache.tika.parser.AbstractEncodingDetectorParser
-
- getEstimatedSizeBytes() - Method in class org.apache.tika.pipes.emitter.EmitData
-
- getExecutorService() - Method in class org.apache.tika.config.TikaConfig
-
- getExtension(TikaInputStream, Metadata) - Method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- getExtension() - Method in class org.apache.tika.mime.MimeType
-
Returns the preferred file extension of this type, or an empty string
if no extensions are known.
- getExtensions() - Method in class org.apache.tika.mime.MimeType
-
Returns the list of all known file extensions of this media type.
- getFallback() - Method in class org.apache.tika.parser.CompositeParser
-
Returns the fallback parser.
- getFetcher(String) - Method in class org.apache.tika.pipes.fetcher.FetcherManager
-
- getFetcherName() - Method in class org.apache.tika.pipes.fetcher.FetchKey
-
- getFetcherName() - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- getFetchKey() - Method in class org.apache.tika.pipes.FetchEmitTuple
-
- getFetchKey() - Method in class org.apache.tika.pipes.fetcher.FetchKey
-
- getField() - Method in class org.apache.tika.config.ParamField
-
- getFile() - Method in class org.apache.tika.io.TikaInputStream
-
- getFileChannel() - Method in class org.apache.tika.io.TikaInputStream
-
- getFilesProcessed() - Method in class org.apache.tika.pipes.PipesClient
-
- getFilteredStackTrace(Throwable) - Static method in class org.apache.tika.utils.ExceptionUtils
-
Simple util to get stack trace.
- getForkedJvmArgs() - Method in class org.apache.tika.pipes.PipesConfigBase
-
- getHandlerConfig() - Method in class org.apache.tika.pipes.FetchEmitTuple
-
- getHandlerConfig() - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- getId() - Method in class org.apache.tika.pipes.FetchEmitTuple
-
- getIdentifier() - Method in class org.apache.tika.sax.StandardReference
-
- getIgnoredLineConsumer() - Method in class org.apache.tika.parser.external.ExternalParser
-
Gets lines consumer
- getInitializableProblemHandler() - Method in class org.apache.tika.config.ServiceLoader
-
Returns the handler for problems with initializables
- getInputStream() - Method in interface org.apache.tika.io.InputStreamFactory
-
- getInputStreamFactory() - Method in class org.apache.tika.io.TikaInputStream
-
- getInt(Property) - Method in class org.apache.tika.metadata.Metadata
-
Returns the value of the identified Integer based metadata property.
- getIntBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE int value from the beginning of a byte array
- getIntBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE int value from a byte array
- getIntLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE int value from the beginning of a byte array
- getIntLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE int value from a byte array
- getIntValues(Property) - Method in class org.apache.tika.metadata.Metadata
-
Gets the array of ints of the identified "seq" integer metadata property.
- getJavaCommand() - Method in class org.apache.tika.fork.ForkParser
-
- getJavaCommandAsList() - Method in class org.apache.tika.fork.ForkParser
-
Returns the command used to start the forked server process.
- getJavaPath() - Method in class org.apache.tika.pipes.PipesConfigBase
-
- getLanguage() - Method in class org.apache.tika.language.detect.LanguageHandler
-
Returns the detected language based on text handled thus far.
- getLanguage() - Method in class org.apache.tika.language.detect.LanguageResult
-
The ISO 639-1 language code (plus optional country code)
- getLanguage() - Method in class org.apache.tika.language.detect.LanguageWriter
-
Returns the detected language based on text written thus far.
- getLanguageDetectors() - Static method in class org.apache.tika.language.detect.LanguageDetector
-
- getLanguageDetectors(ServiceLoader) - Static method in class org.apache.tika.language.detect.LanguageDetector
-
- getLength() - Method in class org.apache.tika.detect.MagicDetector
-
- getLength() - Method in class org.apache.tika.io.TikaInputStream
-
Returns the length (in bytes) of this stream.
- getLinks() - Method in class org.apache.tika.mime.MimeType
-
Get a list of links to help document this mime type
- getLinks() - Method in class org.apache.tika.sax.LinkContentHandler
-
Returns the list of collected links.
- getLoader() - Method in class org.apache.tika.config.ServiceLoader
-
- getLoadErrorHandler() - Method in class org.apache.tika.config.ServiceLoader
-
Returns the load error handler used by this loader.
- getLongLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE long value from a byte array
- getMacroLanguage(String) - Static method in class org.apache.tika.language.detect.LanguageNames
-
If language is a specific variant of a macro language (e.g.
- getMainOrganizationAcronym() - Method in class org.apache.tika.sax.StandardReference
-
- getMappedTagName() - Method in class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
-
- getMaxEmbeddedResources() - Method in class org.apache.tika.pipes.HandlerConfig
-
- getMaxEntityExpansions() - Static method in class org.apache.tika.utils.XMLReaderUtils
-
- getMaxFilesProcessed() - Method in class org.apache.tika.pipes.PipesConfigBase
-
- getMaxForEmitBatchBytes() - Method in class org.apache.tika.pipes.async.AsyncConfig
-
What is the maximum bytes size per extract that
will be allowed in the emit queue.
- getMaximumCompressionRatio() - Method in class org.apache.tika.sax.SecureContentHandler
-
Returns the maximum compression ratio.
- getMaximumDepth() - Method in class org.apache.tika.sax.SecureContentHandler
-
Returns the maximum XML element nesting level.
- getMaximumPackageEntryDepth() - Method in class org.apache.tika.sax.SecureContentHandler
-
Returns the maximum package entry nesting level.
- getMaxStringLength() - Method in class org.apache.tika.Tika
-
Returns the maximum length of strings returned by the
parseToString methods.
- getMaxWaitForClientMillis() - Method in class org.apache.tika.pipes.PipesConfig
-
- getMediaTypeRegistry() - Method in class org.apache.tika.config.TikaConfig
-
- getMediaTypeRegistry() - Method in class org.apache.tika.mime.MimeTypes
-
- getMediaTypeRegistry() - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
-
- getMediaTypeRegistry() - Method in class org.apache.tika.parser.CompositeParser
-
Returns the media type registry used to infer type relationships.
- getMediaTypeRegistry() - Method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
Returns the media type registry used to infer type relationships.
- getMessage() - Method in exception org.apache.tika.exception.WriteLimitReachedException
-
- getMessage() - Method in exception org.apache.tika.pipes.async.OfferLargerThanQueueSize
-
- getMessage() - Method in class org.apache.tika.pipes.PipesResult
-
- getMetadata() - Method in class org.apache.tika.pipes.FetchEmitTuple
-
- getMetadataCommandArguments() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets the map of Metadata keys to command line parameters.
- getMetadataExtractionPatterns() - Method in class org.apache.tika.parser.external.ExternalParser
-
- getMetadataFilter() - Method in class org.apache.tika.config.TikaConfig
-
- getMetadataList() - Method in class org.apache.tika.pipes.emitter.EmitData
-
- getMetadataList() - Method in class org.apache.tika.sax.RecursiveParserWrapperHandler
-
- getMetadataPolicy(Map<String, Param>) - Static method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
- getMetadataPolicy() - Method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
- getMimeRepository() - Method in class org.apache.tika.config.TikaConfig
-
- getMimeType(String) - Method in class org.apache.tika.mime.MimeTypes
-
- getMimeType(File) - Method in class org.apache.tika.mime.MimeTypes
-
- getMimeTypes() - Method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- getMinLength() - Method in class org.apache.tika.detect.TrainedModelDetector
-
- getMinLength() - Method in class org.apache.tika.mime.MimeTypes
-
Return the minimum length of data to provide to analyzing methods based
on the document's content in order to check all the known MimeTypes.
- getMSB() - Method in class org.apache.tika.metadata.MachineMetadata.Endian
-
- getName() - Method in class org.apache.tika.config.Param
-
- getName() - Method in class org.apache.tika.config.ParamField
-
- getName(String) - Static method in class org.apache.tika.io.FilenameUtils
-
This is a duplication of the algorithm and functionality
available in commons io FilenameUtils.
- getName() - Method in class org.apache.tika.metadata.MachineMetadata.Endian
-
- getName() - Method in class org.apache.tika.metadata.Property
-
- getName() - Method in class org.apache.tika.mime.MimeType
-
Returns the name of this media type.
- getName() - Method in class org.apache.tika.pipes.emitter.AbstractEmitter
-
- getName() - Method in interface org.apache.tika.pipes.emitter.Emitter
-
- getName() - Method in class org.apache.tika.pipes.emitter.EmptyEmitter
-
- getName() - Method in class org.apache.tika.pipes.fetcher.AbstractFetcher
-
- getName() - Method in class org.apache.tika.pipes.fetcher.EmptyFetcher
-
- getName() - Method in interface org.apache.tika.pipes.fetcher.Fetcher
-
- getNewContentHandler() - Method in class org.apache.tika.sax.AbstractRecursiveParserWrapperHandler
-
- getNewContentHandler(OutputStream, Charset) - Method in class org.apache.tika.sax.AbstractRecursiveParserWrapperHandler
-
- getNewContentHandler() - Method in class org.apache.tika.sax.BasicContentHandlerFactory
-
- getNewContentHandler(OutputStream, String) - Method in class org.apache.tika.sax.BasicContentHandlerFactory
-
- getNewContentHandler(OutputStream, Charset) - Method in class org.apache.tika.sax.BasicContentHandlerFactory
-
- getNewContentHandler() - Method in interface org.apache.tika.sax.ContentHandlerFactory
-
- getNewContentHandler(OutputStream, String) - Method in interface org.apache.tika.sax.ContentHandlerFactory
-
- getNewContentHandler(OutputStream, Charset) - Method in interface org.apache.tika.sax.ContentHandlerFactory
-
- getNumClients() - Method in class org.apache.tika.pipes.PipesConfigBase
-
- getNumEmitters() - Method in class org.apache.tika.pipes.async.AsyncConfig
-
Number of emitters
- getNumOfHidden() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- getNumOfInputs() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- getNumOfOutputs() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- getOnParseException() - Method in class org.apache.tika.pipes.FetchEmitTuple
-
- getOnParseException() - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- getOpenContainer() - Method in class org.apache.tika.io.TikaInputStream
-
Returns the open container object if any, such as a
POIFS FileSystem in the event of an OLE2 document
being detected and processed by the OLE2 detector.
- getOrganizations() - Static method in class org.apache.tika.sax.StandardOrganizations
-
Returns the map containing the collection of the most important technical standard
organizations.
- getOrganzationsRegex() - Static method in class org.apache.tika.sax.StandardOrganizations
-
Returns the regular expression containing the most important technical standard
organizations.
- getOutputThreshold() - Method in class org.apache.tika.sax.SecureContentHandler
-
Returns the configured output threshold.
- getParameters() - Method in class org.apache.tika.mime.MediaType
-
Returns an immutable sorted map of the parameters of this media type.
- getParams() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- getParser(MediaType) - Method in class org.apache.tika.config.TikaConfig
-
- getParser() - Method in class org.apache.tika.config.TikaConfig
-
Returns the configured parser instance.
- getParser(Metadata) - Method in class org.apache.tika.parser.CompositeParser
-
Returns the parser that best matches the given metadata.
- getParser(Metadata, ParseContext) - Method in class org.apache.tika.parser.CompositeParser
-
- getParser() - Method in class org.apache.tika.Tika
-
Returns the parser instance used by this facade.
- getParserClassname(Parser) - Static method in class org.apache.tika.utils.ParserUtils
-
- getParsers(ParseContext) - Method in class org.apache.tika.parser.CompositeParser
-
- getParsers() - Method in class org.apache.tika.parser.CompositeParser
-
Returns the component parsers.
- getParsers(ParseContext) - Method in class org.apache.tika.parser.DefaultParser
-
- getPassword(Metadata) - Method in interface org.apache.tika.parser.PasswordProvider
-
Looks up the password for a document with the given metadata,
and returns it for the Parser.
- getPasswordProvider() - Method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- getPath() - Method in class org.apache.tika.io.TikaInputStream
-
If the user created this TikaInputStream with a file,
the original file will be returned.
- getPath(int) - Method in class org.apache.tika.io.TikaInputStream
-
- getPoolSize() - Method in class org.apache.tika.fork.ForkParser
-
Returns the size of the process pool.
- getPoolSize() - Static method in class org.apache.tika.utils.XMLReaderUtils
-
- getPosition() - Method in class org.apache.tika.io.TikaInputStream
-
Returns the current position within the stream.
- getPrimaryProperty() - Method in class org.apache.tika.metadata.Property
-
Gets the primary property for a composite property
- getProperties(String) - Static method in class org.apache.tika.metadata.Property
-
- getPropertyType(String) - Static method in class org.apache.tika.metadata.Property
-
Get the type of a property
- getPropertyType() - Method in class org.apache.tika.metadata.Property
-
- getProvider() - Method in class org.apache.tika.parser.digest.InputStreamDigester
-
When subclassing this, becare to ensure that your provider is
thread-safe (not likely) or return a new provider with each call.
- getQNameAsString(QName) - Static method in class org.apache.tika.sax.ElementMappingContentHandler
-
- getQueueSize() - Method in class org.apache.tika.pipes.async.AsyncConfig
-
FetchEmitTuple queue size
- getQueueSize() - Method in exception org.apache.tika.pipes.async.OfferLargerThanQueueSize
-
- getRawScore() - Method in class org.apache.tika.language.detect.LanguageResult
-
- getRegisteredMimeType(String) - Method in class org.apache.tika.mime.MimeTypes
-
Returns the registered, normalised media type with the given name (or alias).
- getRel() - Method in class org.apache.tika.sax.Link
-
- getResource(Class<T>) - Method in class org.apache.tika.io.TemporaryResources
-
Returns the latest of the tracked resources that implements or
extends the given interface or class.
- getResourceAsStream(String) - Method in class org.apache.tika.config.ServiceLoader
-
Returns an input stream for reading the specified resource from the
configured class loader.
- getSAXParser() - Method in class org.apache.tika.parser.ParseContext
-
Returns the SAX parser specified in this parsing context.
- getSAXParser() - Static method in class org.apache.tika.utils.XMLReaderUtils
-
Returns the SAX parser specified in this parsing context.
- getSAXParserFactory() - Method in class org.apache.tika.parser.ParseContext
-
Returns the SAX parser factory specified in this parsing context.
- getSAXParserFactory() - Static method in class org.apache.tika.utils.XMLReaderUtils
-
Returns the SAX parser factory specified in this parsing context.
- getScore() - Method in class org.apache.tika.sax.StandardReference
-
- getSecondaryExtractProperties() - Method in class org.apache.tika.metadata.Property
-
Gets the secondary properties for a composite property
- getSecondOrganizationAcronym() - Method in class org.apache.tika.sax.StandardReference
-
- getSeparator() - Method in class org.apache.tika.sax.StandardReference
-
- getServiceClass(Class<T>, String) - Method in class org.apache.tika.config.ServiceLoader
-
Loads and returns the named service class that's expected to implement
the given interface.
- getServiceLoader() - Method in class org.apache.tika.config.TikaConfig
-
- getSetter() - Method in class org.apache.tika.config.ParamField
-
- getShortBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE short value from the beginning of a byte array
- getShortBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE short value from a byte array
- getShortLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE short value from the beginning of a byte array
- getShortLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE short value from a byte array
- getShutdownClientAfterMillis() - Method in class org.apache.tika.pipes.PipesConfigBase
-
- getSizeOffered() - Method in exception org.apache.tika.pipes.async.OfferLargerThanQueueSize
-
- getStackTrace(Throwable) - Static method in class org.apache.tika.utils.ExceptionUtils
-
Get the full stacktrace as a string
- getStartupTimeoutMillis() - Method in class org.apache.tika.pipes.PipesConfigBase
-
- getStatelessParser(ParseContext) - Static method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
Utility function to get the Parser that was sent in to the
ParseContext to handle embedded documents.
- getStatus() - Method in class org.apache.tika.pipes.PipesResult
-
- getSubtype() - Method in class org.apache.tika.mime.MediaType
-
Return the Sub-Type of the MediaType,
such as "plain" for "text/plain"
- getSupertype(MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Returns the supertype of the given type.
- getSupported() - Method in class org.apache.tika.pipes.emitter.EmitterManager
-
- getSupported() - Method in class org.apache.tika.pipes.fetcher.FetcherManager
-
- getSupportedEmbedTypes(ParseContext) - Method in interface org.apache.tika.embedder.Embedder
-
Returns the set of media types supported by this embedder when used with
the given parse context.
- getSupportedEmbedTypes(ParseContext) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
- getSupportedEmbedTypes() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.fork.ForkParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.CompositeParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.CryptoParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.DelegatingParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.EmptyParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.ErrorParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.external.ExternalParser
-
- getSupportedTypes() - Method in class org.apache.tika.parser.external.ExternalParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.NetworkParser
-
- getSupportedTypes(ParseContext) - Method in interface org.apache.tika.parser.Parser
-
Returns the set of media types supported by this parser when used
with the given parse context.
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.ParserDecorator
-
Delegates the method call to the decorated parser.
- getSupportedTypes(ParseContext) - Method in class org.apache.tika.parser.RecursiveParserWrapper
-
- getTag() - Method in exception org.apache.tika.sax.TaggedSAXException
-
Returns the object reference used as the tag this exception.
- getTail() - Method in class org.apache.tika.io.TailStream
-
Returns an array with the last data read from the underlying stream.
- getText() - Method in class org.apache.tika.sax.Link
-
- getThreshold() - Method in class org.apache.tika.sax.StandardsExtractingContentHandler
-
Gets the threshold to be used for selecting the standard references found
within the text based on their score.
- getTikaConfig() - Method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- getTikaConfig() - Method in class org.apache.tika.pipes.PipesConfigBase
-
- getTimeoutMillis() - Method in class org.apache.tika.pipes.PipesConfigBase
-
- getTitle() - Method in class org.apache.tika.sax.Link
-
- getTransformer() - Method in class org.apache.tika.parser.ParseContext
-
Returns the transformer specified in this parsing context.
- getTransformer() - Static method in class org.apache.tika.utils.XMLReaderUtils
-
Returns a new transformer
- getTranslator() - Method in class org.apache.tika.config.TikaConfig
-
Returns the configured translator instance.
- getTranslator() - Method in class org.apache.tika.language.translate.DefaultTranslator
-
Returns the current translator
- getTranslator() - Method in class org.apache.tika.Tika
-
Returns the translator instance used by this facade.
- getTranslators() - Method in class org.apache.tika.language.translate.DefaultTranslator
-
Returns all available translators
- getType() - Method in class org.apache.tika.config.Param
-
- getType() - Method in class org.apache.tika.config.ParamField
-
- getType() - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- getType() - Method in class org.apache.tika.mime.MediaType
-
Return the Type of the MediaType, such as
"text" for "text/plain"
- getType() - Method in class org.apache.tika.mime.MimeType
-
Returns the normalized media type name.
- getType() - Method in class org.apache.tika.pipes.HandlerConfig
-
- getType() - Method in class org.apache.tika.sax.BasicContentHandlerFactory
-
- getType() - Method in class org.apache.tika.sax.Link
-
- getTypes() - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Returns the set of all known canonical media types.
- getTypeString() - Method in class org.apache.tika.config.Param
-
- getUByte(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
get the unsigned value of a byte.
- getUIntBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE unsigned int value from a byte array
- getUIntBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE unsigned int value from a byte array
- getUIntLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE unsigned int value from a byte array
- getUIntLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE unsigned int value from a byte array
- getUniformTypeIdentifier() - Method in class org.apache.tika.mime.MimeType
-
Get the UTI for this mime type.
- getUri() - Method in class org.apache.tika.sax.Link
-
- getUShortBE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE unsigned short value from the beginning of a byte array
- getUShortBE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE unsigned short value from a byte array
- getUShortLE(byte[]) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE unsigned short value from the beginning of a byte array
- getUShortLE(byte[], int) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE unsigned short value from a byte array
- getValue() - Method in class org.apache.tika.config.Param
-
- getValues(Property) - Method in class org.apache.tika.metadata.Metadata
-
Get the values associated to a metadata name.
- getValues(String) - Method in class org.apache.tika.metadata.Metadata
-
Get the values associated to a metadata name.
- getValueType() - Method in class org.apache.tika.metadata.Property
-
- getWrappedParser() - Method in class org.apache.tika.parser.ParserDecorator
-
Gets the parser wrapped by this ParserDecorator
- getWriteLimit() - Method in class org.apache.tika.pipes.HandlerConfig
-
- getWriteLimit() - Method in class org.apache.tika.sax.BasicContentHandlerFactory
-
- getXMLInputFactory() - Method in class org.apache.tika.parser.ParseContext
-
Returns the StAX input factory specified in this parsing context.
- getXMLInputFactory() - Static method in class org.apache.tika.utils.XMLReaderUtils
-
Returns the StAX input factory specified in this parsing context.
- getXMLReader() - Method in class org.apache.tika.parser.ParseContext
-
Returns the XMLReader specified in this parsing context.
- getXMLReader() - Static method in class org.apache.tika.utils.XMLReaderUtils
-
Returns the XMLReader specified in this parsing context.
- GLOB_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- ID - Static variable in interface org.apache.tika.metadata.QuattroPro
-
ID.
- IDENTIFIER - Static variable in interface org.apache.tika.metadata.DublinCore
-
Recommended best practice is to identify the resource by means of
a string or number conforming to a formal identification system.
- IDENTIFIER - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- IDENTIFIER - Static variable in interface org.apache.tika.metadata.XMP
-
An unordered array of text strings that unambiguously identify the resource
within a given context.
- identifyStaticServiceProviders(Class<T>) - Method in class org.apache.tika.config.ServiceLoader
-
Returns the defined static service providers of the given type, without
attempting to load them.
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.DIFContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.LinkContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.SafeContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.SecureContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.TeeContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.TextContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.ToTextContentHandler
-
Writes the given ignorable characters to the given character stream.
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.WriteOutContentHandler
-
- ignorableWhitespace(char[], int, int) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
-
- IGNORE - Static variable in interface org.apache.tika.config.InitializableProblemHandler
-
Strategy that simply ignores all problems.
- IGNORE - Static variable in interface org.apache.tika.config.LoadErrorHandler
-
Strategy that simply ignores all problems.
- IGNORE_ZERO_BYTE_FILE_EXCEPTION - Static variable in exception org.apache.tika.exception.ZeroByteFileException
-
- IgnoreZeroByteFileException() - Constructor for class org.apache.tika.exception.ZeroByteFileException.IgnoreZeroByteFileException
-
- image(String) - Static method in class org.apache.tika.mime.MediaType
-
- IMAGE_COUNT - Static variable in interface org.apache.tika.metadata.Office
-
The number of Images in the document
- IMAGE_CREATOR - Static variable in interface org.apache.tika.metadata.IPTC
-
Creator or creators of the image.
- IMAGE_CREATOR_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
The ID of the creator or creators of the image.
- IMAGE_CREATOR_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
-
- IMAGE_CREATOR_NAME - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of the creator or creators of the image.
- IMAGE_LENGTH - Static variable in interface org.apache.tika.metadata.TIFF
-
"Image height in pixels."
- IMAGE_REGISTRY_ENTRY - Static variable in interface org.apache.tika.metadata.IPTC
-
Both a Registry Item Id and a Registry Organisation Id to record any
registration of this item with a registry.
- IMAGE_SUPPLIER - Static variable in interface org.apache.tika.metadata.IPTC
-
Identifies the most recent supplier of the item, who is not necessarily
its owner or creator.
- IMAGE_SUPPLIER_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
Identifies the most recent supplier of the item, who is not necessarily
its owner or creator.
- IMAGE_SUPPLIER_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
-
- IMAGE_SUPPLIER_IMAGE_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
Optional identifier assigned by the Image Supplier to the image.
- IMAGE_SUPPLIER_NAME - Static variable in interface org.apache.tika.metadata.IPTC
-
Identifies the most recent supplier of the item, who is not necessarily
its owner or creator.
- IMAGE_WIDTH - Static variable in interface org.apache.tika.metadata.TIFF
-
"Image width in pixels."
- IncludeFieldMetadataFilter - Class in org.apache.tika.metadata.filter
-
- IncludeFieldMetadataFilter() - Constructor for class org.apache.tika.metadata.filter.IncludeFieldMetadataFilter
-
- IncludeFieldMetadataFilter(Set<String>) - Constructor for class org.apache.tika.metadata.filter.IncludeFieldMetadataFilter
-
- INFO - Static variable in interface org.apache.tika.config.InitializableProblemHandler
-
Strategy that logs warnings of all problems using a Logger
created using the given class name.
- init(DataInputStream, DataOutputStream) - Method in interface org.apache.tika.fork.ForkProxy
-
- INITIAL_AUTHOR - Static variable in interface org.apache.tika.metadata.Office
-
Name of the initial creator/author of a document
- Initializable - Interface in org.apache.tika.config
-
Components that must do special processing across multiple fields
at initialization time should implement this interface.
- InitializableProblemHandler - Interface in org.apache.tika.config
-
This is to be used to handle potential recoverable problems that
might arise during initialization.
- initialize(Map<String, Param>) - Method in interface org.apache.tika.config.Initializable
-
- initialize(Map<String, Param>) - Method in class org.apache.tika.pipes.fetcher.fs.FileSystemFetcher
-
- initialize(Map<String, Param>) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- INPUT_FILE_TOKEN - Static variable in class org.apache.tika.parser.external.ExternalParser
-
The token, which if present in the Command string, will
be replaced with the input filename.
- InputStreamDigester - Class in org.apache.tika.parser.digest
-
- InputStreamDigester(int, String, DigestingParser.Encoder) - Constructor for class org.apache.tika.parser.digest.InputStreamDigester
-
- InputStreamDigester(int, String, String, DigestingParser.Encoder) - Constructor for class org.apache.tika.parser.digest.InputStreamDigester
-
- InputStreamFactory - Interface in org.apache.tika.io
-
A factory which returns a fresh
InputStream
for the
same
resource each time.
- INSTANCE - Static variable in class org.apache.tika.detect.EmptyDetector
-
Singleton instance of this class.
- INSTANCE - Static variable in class org.apache.tika.parser.EmptyParser
-
Singleton instance of this class.
- INSTANCE - Static variable in class org.apache.tika.parser.ErrorParser
-
Singleton instance of this class.
- INSTANCE - Static variable in exception org.apache.tika.sax.StoppingEarlyException
-
- INSTANCE - Static variable in class org.apache.tika.sax.xpath.AttributeMatcher
-
- INSTANCE - Static variable in class org.apache.tika.sax.xpath.ElementMatcher
-
- INSTANCE - Static variable in class org.apache.tika.sax.xpath.NodeMatcher
-
- INSTANCE - Static variable in class org.apache.tika.sax.xpath.TextMatcher
-
- INSTANCEID - Static variable in interface org.apache.tika.metadata.XMPMM
-
An identifier for a specific incarnation of a resource, updated
each time a file is saved.
- inStartElement - Variable in class org.apache.tika.sax.ToXMLContentHandler
-
- INSTITUTION - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- INSTRUCTIONS - Static variable in interface org.apache.tika.metadata.IPTC
-
Any of a number of instructions from the provider or creator to the
receiver of the item.
- INSTRUCTIONS - Static variable in interface org.apache.tika.metadata.Photoshop
-
- INSTRUMENT - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The musical instrument."
- INTELLECTUAL_GENRE - Static variable in interface org.apache.tika.metadata.IPTC
-
Describes the nature, intellectual, artistic or journalistic
characteristic of a item, not specifically its content.
- internalBoolean(String) - Static method in class org.apache.tika.metadata.Property
-
- internalClosedChoise(String, String...) - Static method in class org.apache.tika.metadata.Property
-
- internalDate(String) - Static method in class org.apache.tika.metadata.Property
-
- internalInteger(String) - Static method in class org.apache.tika.metadata.Property
-
- internalIntegerSequence(String) - Static method in class org.apache.tika.metadata.Property
-
- internalOpenChoise(String, String...) - Static method in class org.apache.tika.metadata.Property
-
- internalRational(String) - Static method in class org.apache.tika.metadata.Property
-
- internalReal(String) - Static method in class org.apache.tika.metadata.Property
-
- internalText(String) - Static method in class org.apache.tika.metadata.Property
-
- internalTextBag(String) - Static method in class org.apache.tika.metadata.Property
-
- internalURI(String) - Static method in class org.apache.tika.metadata.Property
-
- INTERPRETED_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- INTERRUPTED_EXCEPTION - Static variable in class org.apache.tika.pipes.PipesResult
-
- IOUtils - Class in org.apache.tika.io
-
- IOUtils() - Constructor for class org.apache.tika.io.IOUtils
-
- IPTC - Interface in org.apache.tika.metadata
-
IPTC photo metadata schema.
- IPTC_LAST_EDITED - Static variable in interface org.apache.tika.metadata.IPTC
-
The date and optionally time when any of the IPTC photo metadata fields
has been last edited
- IS_ENCRYPTED - Static variable in interface org.apache.tika.metadata.PDF
-
- IS_OS_AIX - Static variable in class org.apache.tika.utils.SystemUtils
-
- IS_OS_HP_UX - Static variable in class org.apache.tika.utils.SystemUtils
-
- IS_OS_IRIX - Static variable in class org.apache.tika.utils.SystemUtils
-
- IS_OS_LINUX - Static variable in class org.apache.tika.utils.SystemUtils
-
- IS_OS_MAC - Static variable in class org.apache.tika.utils.SystemUtils
-
- IS_OS_MAC_OSX - Static variable in class org.apache.tika.utils.SystemUtils
-
- IS_OS_OS2 - Static variable in class org.apache.tika.utils.SystemUtils
-
- IS_OS_SOLARIS - Static variable in class org.apache.tika.utils.SystemUtils
-
- IS_OS_SUN_OS - Static variable in class org.apache.tika.utils.SystemUtils
-
- IS_OS_UNIX - Static variable in class org.apache.tika.utils.SystemUtils
-
- IS_OS_WINDOWS - Static variable in class org.apache.tika.utils.SystemUtils
-
- isAnchor() - Method in class org.apache.tika.sax.Link
-
- isAvailable() - Method in class org.apache.tika.language.translate.DefaultTranslator
-
- isAvailable() - Method in class org.apache.tika.language.translate.EmptyTranslator
-
- isAvailable() - Method in interface org.apache.tika.language.translate.Translator
-
- isBlank(String) - Static method in class org.apache.tika.utils.StringUtils
-
- isCauseOf(SAXException) - Method in class org.apache.tika.sax.TaggedContentHandler
-
Tests if the given exception was caused by this handler.
- isDynamic() - Method in class org.apache.tika.config.ServiceLoader
-
Returns if the service loader is static or dynamic
- isEmpty(CharSequence) - Static method in class org.apache.tika.utils.StringUtils
-
- isEndDocumentWasCalled() - Method in class org.apache.tika.sax.EndDocumentShieldingContentHandler
-
- isExternal() - Method in class org.apache.tika.metadata.Property
-
- isIframe() - Method in class org.apache.tika.sax.Link
-
- isImage() - Method in class org.apache.tika.sax.Link
-
- isInstanceOf(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Checks whether the given media type equals the given base type or
is a specialization of it.
- isInstanceOf(String, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Parses and normalises the given media type string and checks whether
the result equals the given base type or is a specialization of it.
- isInternal() - Method in class org.apache.tika.metadata.Property
-
- isInvalid(int) - Method in class org.apache.tika.sax.SafeContentHandler
-
Checks whether the given Unicode character is an invalid XML character
and should be replaced for output.
- isInvalid(int) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
- isLanguage(String) - Method in class org.apache.tika.language.detect.LanguageResult
-
Return true if the target language matches the detected language.
- isLink() - Method in class org.apache.tika.sax.Link
-
- isMacroLanguage(String) - Static method in class org.apache.tika.language.detect.LanguageNames
-
- isMixedLanguages() - Method in class org.apache.tika.language.detect.LanguageDetector
-
- isMostlyAscii() - Method in class org.apache.tika.detect.TextStatistics
-
Checks whether at least one byte was seen and that the bytes that
were seen were mostly plain text (i.e.
- isMSB() - Method in class org.apache.tika.metadata.MachineMetadata.Endian
-
- isMultiValued(Property) - Method in class org.apache.tika.metadata.Metadata
-
Returns true if named value is multivalued.
- isMultiValued(String) - Method in class org.apache.tika.metadata.Metadata
-
Returns true if named value is multivalued.
- isMultiValuePermitted() - Method in class org.apache.tika.metadata.Property
-
Is the PropertyType one which accepts multiple values?
- ISO_SPEED_RATINGS - Static variable in interface org.apache.tika.metadata.TIFF
-
"ISO Speed and ISO Latitude of the input device as specified in ISO 12232"
- isQuoteAssignmentValues() - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Gets whether or not to quote assignment values, i.e.
- isReasonablyCertain() - Method in class org.apache.tika.language.detect.LanguageResult
-
- ISREGEX_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- isRequired() - Method in class org.apache.tika.config.ParamField
-
- isScript() - Method in class org.apache.tika.sax.Link
-
- isShortText() - Method in class org.apache.tika.language.detect.LanguageDetector
-
- isSpecializationOf(MediaType, MediaType) - Method in class org.apache.tika.mime.MediaTypeRegistry
-
Checks whether the given media type a is a specialization of a more
generic type b.
- isSupported(TikaInputStream) - Method in interface org.apache.tika.extractor.ContainerExtractor
-
Is this Container Extractor able to process the
supplied container?
- isSupported(TikaInputStream) - Method in class org.apache.tika.extractor.ParserContainerExtractor
-
- isSupported(String) - Static method in class org.apache.tika.utils.CharsetUtils
-
Safely return whether is supported, without throwing exceptions
- isTikaInputStream(InputStream) - Static method in class org.apache.tika.io.TikaInputStream
-
Checks whether the given stream is a TikaInputStream instance.
- isUnknown() - Method in class org.apache.tika.language.detect.LanguageResult
-
- isValid(String) - Static method in class org.apache.tika.mime.MimeType
-
Checks that the given string is a valid Internet media type name
based on rules from RFC 2054 section 5.3.
- isWriteLimitReached(Throwable) - Static method in exception org.apache.tika.exception.WriteLimitReachedException
-
Checks whether the given exception (or any of it's root causes) was
thrown by this handler as a signal of reaching the write limit.
- iterator() - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- LABEL - Static variable in interface org.apache.tika.metadata.XMP
-
A word or short phrase that identifies a resource as a member of a userdefined collection.
- LANGUAGE - Static variable in interface org.apache.tika.metadata.DublinCore
-
A language of the intellectual content of the resource.
- LANGUAGE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- LanguageConfidence - Enum in org.apache.tika.language.detect
-
- LanguageDetector - Class in org.apache.tika.language.detect
-
- LanguageDetector() - Constructor for class org.apache.tika.language.detect.LanguageDetector
-
- LanguageHandler - Class in org.apache.tika.language.detect
-
SAX content handler that updates a language detector based on all the
received character content.
- LanguageHandler() - Constructor for class org.apache.tika.language.detect.LanguageHandler
-
- LanguageHandler(LanguageWriter) - Constructor for class org.apache.tika.language.detect.LanguageHandler
-
- LanguageHandler(LanguageDetector) - Constructor for class org.apache.tika.language.detect.LanguageHandler
-
- LanguageNames - Class in org.apache.tika.language.detect
-
Support for language tags (as defined by https://tools.ietf.org/html/bcp47)
- LanguageNames() - Constructor for class org.apache.tika.language.detect.LanguageNames
-
- LanguageResult - Class in org.apache.tika.language.detect
-
- LanguageResult(String, LanguageConfidence, float) - Constructor for class org.apache.tika.language.detect.LanguageResult
-
- LanguageWriter - Class in org.apache.tika.language.detect
-
Writer that builds a language profile based on all the written content.
- LanguageWriter(LanguageDetector) - Constructor for class org.apache.tika.language.detect.LanguageWriter
-
- LAST_AUTHOR - Static variable in interface org.apache.tika.metadata.Office
-
Name of the last (most recent) author of a document
- LAST_MODIFIED_BY - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
-
The user who performed the last modification.
- LAST_PRINTED - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
-
The date and time of the last printing.
- LATITUDE - Static variable in interface org.apache.tika.metadata.Geographic
-
The WGS84 Latitude of the Point
- LATITUDE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- leftPad(String, int, String) - Static method in class org.apache.tika.utils.StringUtils
-
Left pad a String with a specified String.
- leftPad(String, int, char) - Static method in class org.apache.tika.utils.StringUtils
-
- LICENSE_LOCATION - Static variable in interface org.apache.tika.metadata.CreativeCommons
-
- LICENSE_URL - Static variable in interface org.apache.tika.metadata.CreativeCommons
-
- LICENSOR - Static variable in interface org.apache.tika.metadata.IPTC
-
A person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_CITY - Static variable in interface org.apache.tika.metadata.IPTC
-
The city of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_COUNTRY - Static variable in interface org.apache.tika.metadata.IPTC
-
The country of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_EMAIL - Static variable in interface org.apache.tika.metadata.IPTC
-
The email of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_EXTENDED_ADDRESS - Static variable in interface org.apache.tika.metadata.IPTC
-
The extended address of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
The ID of the person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_ID_WRONG_CASE - Static variable in interface org.apache.tika.metadata.IPTC
-
- LICENSOR_NAME - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of the person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_POSTAL_CODE - Static variable in interface org.apache.tika.metadata.IPTC
-
The postal code of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_REGION - Static variable in interface org.apache.tika.metadata.IPTC
-
The region of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_STREET_ADDRESS - Static variable in interface org.apache.tika.metadata.IPTC
-
The street address of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_TELEPHONE_1 - Static variable in interface org.apache.tika.metadata.IPTC
-
The phone number of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_TELEPHONE_2 - Static variable in interface org.apache.tika.metadata.IPTC
-
The phone number of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LICENSOR_URL - Static variable in interface org.apache.tika.metadata.IPTC
-
The URL of a person or company that should be contacted to obtain a licence for
using the item or who has licensed the item.
- LINE_COUNT - Static variable in interface org.apache.tika.metadata.Office
-
The number of lines in the document
- Link - Class in org.apache.tika.sax
-
- Link(String, String, String, String) - Constructor for class org.apache.tika.sax.Link
-
- Link(String, String, String, String, String) - Constructor for class org.apache.tika.sax.Link
-
- LinkContentHandler - Class in org.apache.tika.sax
-
Content handler that collects links from an XHTML document.
- LinkContentHandler() - Constructor for class org.apache.tika.sax.LinkContentHandler
-
Default constructor
- LinkContentHandler(boolean) - Constructor for class org.apache.tika.sax.LinkContentHandler
-
Default constructor
- LITTLE - Static variable in class org.apache.tika.metadata.MachineMetadata.Endian
-
- load(InputStream) - Static method in class org.apache.tika.config.Param
-
- load(Node) - Static method in class org.apache.tika.config.Param
-
- load(Element, boolean) - Static method in class org.apache.tika.metadata.filter.MetadataFilter
-
Loads the metadata filter from the config file if it exists, otherwise returns NoOpFilter
- load(Path) - Static method in class org.apache.tika.pipes.async.AsyncConfig
-
- load(Path) - Static method in class org.apache.tika.pipes.emitter.EmitterManager
-
- load(Path) - Static method in class org.apache.tika.pipes.fetcher.FetcherManager
-
- load(Path) - Static method in class org.apache.tika.pipes.PipesConfig
-
- loadDefaultModels(InputStream) - Method in class org.apache.tika.detect.NNExampleModelDetector
-
- loadDefaultModels(ClassLoader) - Method in class org.apache.tika.detect.NNExampleModelDetector
-
this method gets overwritten to register load neural network models
- loadDefaultModels(Path) - Method in class org.apache.tika.detect.TrainedModelDetector
-
- loadDefaultModels(File) - Method in class org.apache.tika.detect.TrainedModelDetector
-
- loadDefaultModels(InputStream) - Method in class org.apache.tika.detect.TrainedModelDetector
-
- loadDefaultModels(ClassLoader) - Method in class org.apache.tika.detect.TrainedModelDetector
-
- loadDynamicServiceProviders(Class<T>) - Method in class org.apache.tika.config.ServiceLoader
-
Returns the available dynamic service providers of the given type.
- LoadErrorHandler - Interface in org.apache.tika.config
-
Interface for error handling strategies in service class loading.
- loadModels() - Method in class org.apache.tika.language.detect.LanguageDetector
-
Load (or re-load) all available language models.
- loadModels(Set<String>) - Method in class org.apache.tika.language.detect.LanguageDetector
-
Load (or re-load) the models specified in .
- loadServiceProviders(Class<T>) - Method in class org.apache.tika.config.ServiceLoader
-
Returns all the available service providers of the given type.
- loadStaticServiceProviders(Class<T>) - Method in class org.apache.tika.config.ServiceLoader
-
- loadStaticServiceProviders(Class<T>, Collection<Class<? extends T>>) - Method in class org.apache.tika.config.ServiceLoader
-
Returns the available static service providers of the given type.
- LOCAL_NAME_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- LOCATION - Static variable in interface org.apache.tika.metadata.HttpHeaders
-
- LOCATION_CREATED - Static variable in interface org.apache.tika.metadata.IPTC
-
The location the content of the item was created.
- LOCATION_CREATED_CITY - Static variable in interface org.apache.tika.metadata.IPTC
-
Name of the city of a location.
- LOCATION_CREATED_COUNTRY_CODE - Static variable in interface org.apache.tika.metadata.IPTC
-
The ISO code of a country of a location.
- LOCATION_CREATED_COUNTRY_NAME - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of a country of a location.
- LOCATION_CREATED_PROVINCE_OR_STATE - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of a subregion of a country - a province or state - of a
location.
- LOCATION_CREATED_SUBLOCATION - Static variable in interface org.apache.tika.metadata.IPTC
-
Name of a sublocation.
- LOCATION_CREATED_WORLD_REGION - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of a world region of a location.
- LOCATION_SHOWN - Static variable in interface org.apache.tika.metadata.IPTC
-
A location the content of the item is about.
- LOCATION_SHOWN_CITY - Static variable in interface org.apache.tika.metadata.IPTC
-
Name of the city of a location.
- LOCATION_SHOWN_COUNTRY_CODE - Static variable in interface org.apache.tika.metadata.IPTC
-
The ISO code of a country of a location.
- LOCATION_SHOWN_COUNTRY_NAME - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of a country of a location.
- LOCATION_SHOWN_PROVINCE_OR_STATE - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of a subregion of a country - a province or state - of a
location.
- LOCATION_SHOWN_SUBLOCATION - Static variable in interface org.apache.tika.metadata.IPTC
-
Name of a sublocation.
- LOCATION_SHOWN_WORLD_REGION - Static variable in interface org.apache.tika.metadata.IPTC
-
The name of a world region of a location.
- LOG_COMMENT - Static variable in interface org.apache.tika.metadata.XMPDM
-
"User's log comments."
- LogGobbler(InputStream) - Constructor for class org.apache.tika.pipes.PipesClient.LogGobbler
-
- LONGITUDE - Static variable in interface org.apache.tika.metadata.Geographic
-
The WGS84 Longitude of the Point
- LONGITUDE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- LookaheadInputStream - Class in org.apache.tika.io
-
Stream wrapper that make it easy to read up to n bytes ahead from
a stream that supports the mark feature.
- LookaheadInputStream(InputStream, int) - Constructor for class org.apache.tika.io.LookaheadInputStream
-
Creates a lookahead wrapper for the given input stream.
- looksLikeUTF8() - Method in class org.apache.tika.detect.TextStatistics
-
Checks whether the observed byte stream looks like UTF-8 encoded text.
- LOOP - Static variable in interface org.apache.tika.metadata.XMPDM
-
"When true, the clip can be looped seamlessly."
- LOWEST_VERSION - Static variable in interface org.apache.tika.metadata.QuattroPro
-
Lowest version.
- MACHINE_ALPHA - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_ARM - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_EFI - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_IA_64 - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_M32R - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_M68K - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_M88K - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_MIPS - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_PPC - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_S370 - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_S390 - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_SH3 - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_SH4 - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_SH5 - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_SPARC - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_TYPE - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_UNKNOWN - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_VAX - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_x86_32 - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MACHINE_x86_64 - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- MachineMetadata - Interface in org.apache.tika.metadata
-
Metadata for describing machines, such as their
architecture, type and endian-ness
- MachineMetadata.Endian - Class in org.apache.tika.metadata
-
- magic_neg(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
-
- MAGIC_PRIORITY_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- MAGIC_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- magic_trust(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
-
- MagicDetector - Class in org.apache.tika.detect
-
Content type detection based on magic bytes, i.e.
- MagicDetector(MediaType, byte[]) - Constructor for class org.apache.tika.detect.MagicDetector
-
Creates a detector for input documents that have the exact given byte
pattern at the beginning of the document stream.
- MagicDetector(MediaType, byte[], int) - Constructor for class org.apache.tika.detect.MagicDetector
-
Creates a detector for input documents that have the exact given byte
pattern at the given offset of the document stream.
- MagicDetector(MediaType, byte[], byte[], int, int) - Constructor for class org.apache.tika.detect.MagicDetector
-
Creates a detector for input documents that meet the specified magic
match.
- MagicDetector(MediaType, byte[], byte[], boolean, int, int) - Constructor for class org.apache.tika.detect.MagicDetector
-
Creates a detector for input documents that meet the specified
magic match.
- MagicDetector(MediaType, byte[], byte[], boolean, boolean, int, int) - Constructor for class org.apache.tika.detect.MagicDetector
-
Creates a detector for input documents that meet the specified
magic match.
- main(String[]) - Static method in class org.apache.tika.pipes.PipesServer
-
- MAJOR_VERSION - Static variable in interface org.apache.tika.metadata.WordPerfect
-
Major version.
- makeName(String, String, String) - Static method in class org.apache.tika.language.detect.LanguageNames
-
- MANAGER - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- mapAttributes(Attributes) - Method in class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
-
- MAPI_FROM_REPRESENTING_EMAIL - Static variable in interface org.apache.tika.metadata.Office
-
- MAPI_FROM_REPRESENTING_NAME - Static variable in interface org.apache.tika.metadata.Office
-
- MAPI_MESSAGE_CLASS - Static variable in interface org.apache.tika.metadata.Office
-
MAPI message class.
- MAPI_MESSAGE_CLIENT_SUBMIT_TIME - Static variable in interface org.apache.tika.metadata.Office
-
- MAPI_SENT_BY_SERVER_TYPE - Static variable in interface org.apache.tika.metadata.Office
-
- MappedBufferCleaner - Class in org.apache.tika.io
-
Copied/pasted from the Apache Lucene/Solr project.
- MappedBufferCleaner() - Constructor for class org.apache.tika.io.MappedBufferCleaner
-
- mark(int) - Method in class org.apache.tika.io.BoundedInputStream
-
- mark(int) - Method in class org.apache.tika.io.LookaheadInputStream
-
- mark(int) - Method in class org.apache.tika.io.TailStream
-
This implementation saves the internal state including the
content of the tail buffer so that it can be restored when ''reset()'' is
called later.
- mark(int) - Method in class org.apache.tika.io.TikaInputStream
-
- MARKED - Static variable in interface org.apache.tika.metadata.XMPRights
-
When true, indicates that this is a rights-managed resource.
- markSupported() - Method in class org.apache.tika.io.LookaheadInputStream
-
- markSupported() - Method in class org.apache.tika.io.TikaInputStream
-
- MATCH_MASK_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- MATCH_MINSHOULDMATCH_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- MATCH_OFFSET_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- MATCH_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- MATCH_TYPE_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- MATCH_VALUE_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- Matcher - Class in org.apache.tika.sax.xpath
-
XPath element matcher.
- Matcher() - Constructor for class org.apache.tika.sax.xpath.Matcher
-
- matches(byte[]) - Method in class org.apache.tika.mime.MimeType
-
- matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.AttributeMatcher
-
- matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.CompositeMatcher
-
- matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.Matcher
-
Returns true
if the XPath expression matches the named
attribute of the element associated with this evaluation state.
- matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.NamedAttributeMatcher
-
- matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.NodeMatcher
-
- matchesAttribute(String, String) - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
-
- matchesElement() - Method in class org.apache.tika.sax.xpath.CompositeMatcher
-
- matchesElement() - Method in class org.apache.tika.sax.xpath.ElementMatcher
-
- matchesElement() - Method in class org.apache.tika.sax.xpath.Matcher
-
Returns true
if the XPath expression matches
the element associated with this evaluation state.
- matchesElement() - Method in class org.apache.tika.sax.xpath.NodeMatcher
-
- matchesElement() - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
-
- matchesMagic(byte[]) - Method in class org.apache.tika.mime.MimeType
-
- matchesText() - Method in class org.apache.tika.sax.xpath.CompositeMatcher
-
- matchesText() - Method in class org.apache.tika.sax.xpath.Matcher
-
Returns true
if the XPath expression matches all text
nodes whose parent is the element associated with this evaluation
state.
- matchesText() - Method in class org.apache.tika.sax.xpath.NodeMatcher
-
- matchesText() - Method in class org.apache.tika.sax.xpath.SubtreeMatcher
-
- matchesText() - Method in class org.apache.tika.sax.xpath.TextMatcher
-
- MatchingContentHandler - Class in org.apache.tika.sax.xpath
-
Content handler decorator that only passes the elements, attributes,
and text nodes that match the given XPath expression.
- MatchingContentHandler(ContentHandler, Matcher) - Constructor for class org.apache.tika.sax.xpath.MatchingContentHandler
-
- MAX_AVAIL_HEIGHT - Static variable in interface org.apache.tika.metadata.IPTC
-
The maximum available height in pixels of the original photo from which
this photo has been derived by downsizing.
- MAX_AVAIL_WIDTH - Static variable in interface org.apache.tika.metadata.IPTC
-
The maximum available width in pixels of the original photo from which
this photo has been derived by downsizing.
- MediaType - Class in org.apache.tika.mime
-
Internet media type.
- MediaType(String, String, Map<String, String>) - Constructor for class org.apache.tika.mime.MediaType
-
- MediaType(String, String) - Constructor for class org.apache.tika.mime.MediaType
-
- MediaType(MediaType, Map<String, String>) - Constructor for class org.apache.tika.mime.MediaType
-
- MediaType(MediaType, String, String) - Constructor for class org.apache.tika.mime.MediaType
-
Creates a media type by adding a parameter to a base type.
- MediaType(MediaType, Charset) - Constructor for class org.apache.tika.mime.MediaType
-
Creates a media type by adding the "charset" parameter to a base type.
- MediaTypeRegistry - Class in org.apache.tika.mime
-
Registry of known Internet media types.
- MediaTypeRegistry() - Constructor for class org.apache.tika.mime.MediaTypeRegistry
-
- mergeMetadata(Metadata, Metadata, AbstractMultipleParser.MetadataPolicy) - Static method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
- Message - Interface in org.apache.tika.metadata
-
A collection of Message related property names.
- MESSAGE_BCC - Static variable in interface org.apache.tika.metadata.Message
-
- MESSAGE_BCC_DISPLAY_NAME - Static variable in interface org.apache.tika.metadata.Message
-
- MESSAGE_BCC_EMAIL - Static variable in interface org.apache.tika.metadata.Message
-
Where possible, this records the email value in the bcc field.
- MESSAGE_BCC_NAME - Static variable in interface org.apache.tika.metadata.Message
-
In Outlook messages, there are sometimes separate fields for "bcc-name" and
"bcc-display-name" name.
- MESSAGE_CC - Static variable in interface org.apache.tika.metadata.Message
-
- MESSAGE_CC_DISPLAY_NAME - Static variable in interface org.apache.tika.metadata.Message
-
- MESSAGE_CC_EMAIL - Static variable in interface org.apache.tika.metadata.Message
-
Where possible, this records the email value in the cc field.
- MESSAGE_CC_NAME - Static variable in interface org.apache.tika.metadata.Message
-
In Outlook messages, there are sometimes separate fields for "cc-name" and
"cc-display-name" name.
- MESSAGE_FROM - Static variable in interface org.apache.tika.metadata.Message
-
- MESSAGE_FROM_EMAIL - Static variable in interface org.apache.tika.metadata.Message
-
Where possible, this records the value from the name field.
- MESSAGE_FROM_NAME - Static variable in interface org.apache.tika.metadata.Message
-
Where possible, this records the value from the name field.
- MESSAGE_PREFIX - Static variable in interface org.apache.tika.metadata.Message
-
- MESSAGE_RAW_HEADER_PREFIX - Static variable in interface org.apache.tika.metadata.Message
-
- MESSAGE_RECIPIENT_ADDRESS - Static variable in interface org.apache.tika.metadata.Message
-
- MESSAGE_TO - Static variable in interface org.apache.tika.metadata.Message
-
- MESSAGE_TO_DISPLAY_NAME - Static variable in interface org.apache.tika.metadata.Message
-
- MESSAGE_TO_EMAIL - Static variable in interface org.apache.tika.metadata.Message
-
Where possible, this records the email value in the to field.
- MESSAGE_TO_NAME - Static variable in interface org.apache.tika.metadata.Message
-
In Outlook messages, there are sometimes separate fields for "to-name" and
"to-display-name" name.
- meta_neg(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
-
- meta_trust(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
-
- Metadata - Class in org.apache.tika.metadata
-
A multi-valued metadata container.
- Metadata() - Constructor for class org.apache.tika.metadata.Metadata
-
Constructs a new, empty metadata.
- metadata(Metadata) - Method in class org.apache.tika.sax.XMPContentHandler
-
- METADATA_COMMAND_ARGUMENTS_SERIALIZED_TOKEN - Static variable in class org.apache.tika.embedder.ExternalEmbedder
-
Token to be replaced with a String array of metadata assignment command
arguments
- METADATA_COMMAND_ARGUMENTS_TOKEN - Static variable in class org.apache.tika.embedder.ExternalEmbedder
-
Token to be replaced with a String array of metadata assignment command
arguments
- METADATA_DATE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- METADATA_DATE - Static variable in interface org.apache.tika.metadata.XMP
-
The date and time that any metadata for this resource was last changed.
- METADATA_KEY_ATTR - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- METADATA_MATCH_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- METADATA_MOD_DATE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The date and time when the metadata was last modified."
- METADATA_POLICY_CONFIG_KEY - Static variable in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
- METADATA_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- MetadataFilter - Class in org.apache.tika.metadata.filter
-
Filters the metadata in place after the parse
- MetadataFilter() - Constructor for class org.apache.tika.metadata.filter.MetadataFilter
-
- metadataList - Variable in class org.apache.tika.sax.RecursiveParserWrapperHandler
-
- MIDDAY - Static variable in class org.apache.tika.utils.DateUtils
-
Custom time zone used to interpret date values without a time
component in a way that most likely falls within the same day
regardless of in which time zone it is later interpreted.
- MIME_INFO_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- MIME_TYPE_MAGIC - Static variable in interface org.apache.tika.metadata.TikaMimeKeys
-
- MIME_TYPE_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- MIME_TYPE_TYPE_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- MimeType - Class in org.apache.tika.mime
-
Internet media type.
- MIMETYPE_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- MimeTypeException - Exception in org.apache.tika.mime
-
A class to encapsulate MimeType related exceptions.
- MimeTypeException(String) - Constructor for exception org.apache.tika.mime.MimeTypeException
-
Constructs a MimeTypeException with the specified detail message.
- MimeTypeException(String, Throwable) - Constructor for exception org.apache.tika.mime.MimeTypeException
-
Constructs a MimeTypeException with the specified detail message
and root cause.
- MimeTypes - Class in org.apache.tika.mime
-
This class is a MimeType repository.
- MimeTypes() - Constructor for class org.apache.tika.mime.MimeTypes
-
- MIMETYPES_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- MimeTypesFactory - Class in org.apache.tika.mime
-
Creates instances of MimeTypes.
- MimeTypesFactory() - Constructor for class org.apache.tika.mime.MimeTypesFactory
-
- MimeTypesReader - Class in org.apache.tika.mime
-
A reader for XML files compliant with the freedesktop MIME-info DTD.
- MimeTypesReader(MimeTypes) - Constructor for class org.apache.tika.mime.MimeTypesReader
-
- MimeTypesReaderMetKeys - Interface in org.apache.tika.mime
-
- MINOR_MODEL_AGE_DISCLOSURE - Static variable in interface org.apache.tika.metadata.IPTC
-
Age of the youngest model pictured in the image, at the time that the
image was made.
- MINOR_VERSION - Static variable in interface org.apache.tika.metadata.WordPerfect
-
Minor version.
- mixedLanguages - Variable in class org.apache.tika.language.detect.LanguageDetector
-
- MODEL_AGE - Static variable in interface org.apache.tika.metadata.IPTC
-
Age of the human model(s) at the time this image was taken in a model
released image.
- MODEL_NAME_ENGLISH - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- MODEL_RELEASE_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
Optional identifier associated with each Model Release.
- MODEL_RELEASE_STATUS - Static variable in interface org.apache.tika.metadata.IPTC
-
Summarizes the availability and scope of model releases authorizing usage
of the likenesses of persons appearing in the photograph.
- MODIFIED - Static variable in interface org.apache.tika.metadata.DublinCore
-
Date on which the resource was changed.
- MODIFIED - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- modifiedService(ServiceReference, Object) - Method in class org.apache.tika.config.TikaActivator
-
- MODIFIER - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- MODIFY_DATE - Static variable in interface org.apache.tika.metadata.XMP
-
The date and time the resource was last modified.
- MULTIPART_BOUNDARY - Static variable in interface org.apache.tika.metadata.Message
-
- MULTIPART_SUBTYPE - Static variable in interface org.apache.tika.metadata.Message
-
- mustNotBeEmpty(String, String) - Static method in class org.apache.tika.config.TikaConfig
-
- mustNotBeEmpty(String, Path) - Static method in class org.apache.tika.config.TikaConfig
-
- PAGE_COUNT - Static variable in interface org.apache.tika.metadata.Office
-
The number of Pages are there in the (paged) document
- PagedText - Interface in org.apache.tika.metadata
-
XMP Paged-text schema.
- PARAGRAPH_COUNT - Static variable in interface org.apache.tika.metadata.Office
-
The number of individual Paragraphs in the document
- Param<T> - Class in org.apache.tika.config
-
This is a serializable model class for parameters from configuration file.
- Param() - Constructor for class org.apache.tika.config.Param
-
- Param(String, Class<T>, T) - Constructor for class org.apache.tika.config.Param
-
- Param(String, T) - Constructor for class org.apache.tika.config.Param
-
- ParamField - Class in org.apache.tika.config
-
This class stores metdata for
Field
annotation are used to map them
to
Param
at runtime
- ParamField(AccessibleObject) - Constructor for class org.apache.tika.config.ParamField
-
Creates a ParamField object
- parse(MediaType, String, String, String, String) - Static method in class org.apache.tika.detect.MagicDetector
-
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.fork.ForkParser
-
This sends the objects to the server for parsing, and the server via
the proxies acts on the handler as if it were updating it directly.
- parse(String) - Static method in class org.apache.tika.mime.MediaType
-
Parses the given string to a media type.
- parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.AbstractParser
-
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.AutoDetectParser
-
- parse(InputStream, ContentHandler, Metadata) - Method in class org.apache.tika.parser.AutoDetectParser
-
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.CompositeParser
-
Delegates the call to the matching component parser.
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.CryptoParser
-
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.DelegatingParser
-
Looks up the delegate parser from the parsing context and
delegates the parse operation to it.
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.DigestingParser
-
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.EmptyParser
-
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.ErrorParser
-
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.external.ExternalParser
-
Executes the configured external command and passes the given document
stream as a simple XHTML document to the given SAX content handler.
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
Processes the given Stream through one or more parsers,
resetting things between parsers as requested by policy.
- parse(InputStream, ContentHandlerFactory, Metadata, ParseContext) - Method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.NetworkParser
-
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in interface org.apache.tika.parser.Parser
-
Parses a document stream into a sequence of XHTML SAX events.
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.ParserDecorator
-
Delegates the method call to the decorated parser.
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.ParserPostProcessor
-
Forwards the call to the delegated parser and post-processes the
results as described above.
- parse(InputStream, ContentHandler, Metadata, ParseContext) - Method in class org.apache.tika.parser.RecursiveParserWrapper
-
- parse(FetchEmitTuple) - Method in class org.apache.tika.pipes.PipesParser
-
- parse(String) - Method in class org.apache.tika.sax.xpath.XPathParser
-
Parses the given simple XPath expression to an evaluation state
initialized at the document node.
- parse(InputStream, Metadata) - Method in class org.apache.tika.Tika
-
Parses the given document and returns the extracted text content.
- parse(InputStream) - Method in class org.apache.tika.Tika
-
Parses the given document and returns the extracted text content.
- parse(Path, Metadata) - Method in class org.apache.tika.Tika
-
Parses the file at the given path and returns the extracted text content.
- parse(Path) - Method in class org.apache.tika.Tika
-
Parses the file at the given path and returns the extracted text content.
- parse(File, Metadata) - Method in class org.apache.tika.Tika
-
Parses the given file and returns the extracted text content.
- parse(File) - Method in class org.apache.tika.Tika
-
Parses the given file and returns the extracted text content.
- parse(URL) - Method in class org.apache.tika.Tika
-
Parses the resource at the given URL and returns the extracted
text content.
- PARSE_EXCEPTION_EMIT - Static variable in class org.apache.tika.pipes.PipesServer
-
This will return the metadata list
- PARSE_EXCEPTION_NO_EMIT - Static variable in class org.apache.tika.pipes.PipesServer
-
This will return the parse exception stack trace
- PARSE_SUCCESS - Static variable in class org.apache.tika.pipes.PipesServer
-
- PARSE_TIME_MILLIS - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- ParseContext - Class in org.apache.tika.parser
-
Parse context.
- ParseContext() - Constructor for class org.apache.tika.parser.ParseContext
-
- parseEmbedded(InputStream, ContentHandler, Metadata, boolean) - Method in interface org.apache.tika.extractor.EmbeddedDocumentExtractor
-
Processes the supplied embedded resource, calling the delegating
parser with the appropriate details.
- parseEmbedded(InputStream, ContentHandler, Metadata, boolean) - Method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- parseEmbedded(InputStream, ContentHandler, Metadata, boolean) - Method in class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
-
- parseHandlerType(String, BasicContentHandlerFactory.HANDLER_TYPE) - Static method in class org.apache.tika.sax.BasicContentHandlerFactory
-
Tries to parse string into handler type.
- Parser - Interface in org.apache.tika.parser
-
Tika parser interface.
- PARSER_TAG - Static variable in interface org.apache.tika.parser.external.ExternalParsersConfigReaderMetKeys
-
- parserCompleted(Parser, Metadata, ContentHandler, ParseContext, Exception) - Method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
Used to notify implementations that a Parser has Finished
or Failed, and to allow them to decide to continue or
abort further parsing
- parserCompleted(Parser, Metadata, ContentHandler, ParseContext, Exception) - Method in class org.apache.tika.parser.multiple.FallbackParser
-
- parserCompleted(Parser, Metadata, ContentHandler, ParseContext, Exception) - Method in class org.apache.tika.parser.multiple.SupplementingParser
-
- ParserContainerExtractor - Class in org.apache.tika.extractor
-
- ParserContainerExtractor() - Constructor for class org.apache.tika.extractor.ParserContainerExtractor
-
- ParserContainerExtractor(TikaConfig) - Constructor for class org.apache.tika.extractor.ParserContainerExtractor
-
- ParserContainerExtractor(Parser, Detector) - Constructor for class org.apache.tika.extractor.ParserContainerExtractor
-
- ParserDecorator - Class in org.apache.tika.parser
-
Decorator base class for the
Parser
interface.
- ParserDecorator(Parser) - Constructor for class org.apache.tika.parser.ParserDecorator
-
Creates a decorator for the given parser.
- ParserFactory - Class in org.apache.tika.parser
-
- ParserFactory(Map<String, String>) - Constructor for class org.apache.tika.parser.ParserFactory
-
- ParserFactoryFactory - Class in org.apache.tika.fork
-
Lightweight, easily serializable class that contains enough information
to build a
ParserFactory
- ParserFactoryFactory(String, Map<String, String>) - Constructor for class org.apache.tika.fork.ParserFactoryFactory
-
- ParserPostProcessor - Class in org.apache.tika.parser
-
Parser decorator that post-processes the results from a decorated parser.
- ParserPostProcessor(Parser) - Constructor for class org.apache.tika.parser.ParserPostProcessor
-
Creates a post-processing decorator for the given parser.
- parserPrepare(Parser, Metadata, ParseContext) - Method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
Used to allow implementations to prepare or change things
before parsing occurs
- ParserUtils - Class in org.apache.tika.utils
-
Helper util methods for Parsers themselves.
- ParserUtils() - Constructor for class org.apache.tika.utils.ParserUtils
-
- parseSAX(InputStream, DefaultHandler, ParseContext) - Static method in class org.apache.tika.utils.XMLReaderUtils
-
This checks context for a user specified
SAXParser
.
- parseToString(InputStream, Metadata) - Method in class org.apache.tika.Tika
-
Parses the given document and returns the extracted text content.
- parseToString(InputStream, Metadata, int) - Method in class org.apache.tika.Tika
-
Parses the given document and returns the extracted text content.
- parseToString(InputStream) - Method in class org.apache.tika.Tika
-
Parses the given document and returns the extracted text content.
- parseToString(Path) - Method in class org.apache.tika.Tika
-
Parses the file at the given path and returns the extracted text content.
- parseToString(File) - Method in class org.apache.tika.Tika
-
Parses the given file and returns the extracted text content.
- parseToString(URL) - Method in class org.apache.tika.Tika
-
Parses the resource at the given URL and returns the extracted
text content.
- ParsingEmbeddedDocumentExtractor - Class in org.apache.tika.extractor
-
Helper class for parsers of package archives or other compound document
formats that support embedded or attached component documents.
- ParsingEmbeddedDocumentExtractor(ParseContext) - Constructor for class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
-
- ParsingReader - Class in org.apache.tika.parser
-
Reader for the text content from a given binary stream.
- ParsingReader(InputStream) - Constructor for class org.apache.tika.parser.ParsingReader
-
Creates a reader for the text content of the given binary stream.
- ParsingReader(InputStream, String) - Constructor for class org.apache.tika.parser.ParsingReader
-
Creates a reader for the text content of the given binary stream
with the given name.
- ParsingReader(Path) - Constructor for class org.apache.tika.parser.ParsingReader
-
Creates a reader for the text content of the file at the given path.
- ParsingReader(File) - Constructor for class org.apache.tika.parser.ParsingReader
-
Creates a reader for the text content of the given file.
- ParsingReader(Parser, InputStream, Metadata, ParseContext) - Constructor for class org.apache.tika.parser.ParsingReader
-
Creates a reader for the text content of the given binary stream
with the given document metadata.
- ParsingReader(Parser, InputStream, Metadata, ParseContext, Executor) - Constructor for class org.apache.tika.parser.ParsingReader
-
Creates a reader for the text content of the given binary stream
with the given document metadata.
- PasswordProvider - Interface in org.apache.tika.parser
-
Interface for providing a password to a Parser for handling Encrypted
and Password Protected Documents.
- PATTERN_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- PDF - Interface in org.apache.tika.metadata
-
PDF properties collection.
- PDF_DOC_INFO_CUSTOM_PREFIX - Static variable in interface org.apache.tika.metadata.PDF
-
- PDF_DOC_INFO_PREFIX - Static variable in interface org.apache.tika.metadata.PDF
-
Prefix to be used for properties that record what was stored
in the docinfo section (as opposed to XMP)
- PDF_EXTENSION_VERSION - Static variable in interface org.apache.tika.metadata.PDF
-
- PDF_PREFIX - Static variable in interface org.apache.tika.metadata.PDF
-
- PDF_PREFLIGHT_PREFIX - Static variable in interface org.apache.tika.metadata.PDF
-
- PDF_VERSION - Static variable in interface org.apache.tika.metadata.PDF
-
- PDFA_PREFIX - Static variable in interface org.apache.tika.metadata.PDF
-
- PDFA_VERSION - Static variable in interface org.apache.tika.metadata.PDF
-
- PDFAID_CONFORMANCE - Static variable in interface org.apache.tika.metadata.PDF
-
- PDFAID_PART - Static variable in interface org.apache.tika.metadata.PDF
-
- PDFAID_PREFIX - Static variable in interface org.apache.tika.metadata.PDF
-
- peek(byte[]) - Method in class org.apache.tika.io.TikaInputStream
-
Fills the given buffer with upcoming bytes from this stream without
advancing the current stream position.
- PERSON - Static variable in interface org.apache.tika.metadata.IPTC
-
Name of a person the content of the item is about.
- PhoneExtractingContentHandler - Class in org.apache.tika.sax
-
Class used to extract phone numbers while parsing.
- PhoneExtractingContentHandler(ContentHandler, Metadata) - Constructor for class org.apache.tika.sax.PhoneExtractingContentHandler
-
Creates a decorator for the given SAX event handler and Metadata object.
- PhoneExtractingContentHandler() - Constructor for class org.apache.tika.sax.PhoneExtractingContentHandler
-
Creates a decorator that by default forwards incoming SAX events to
a dummy content handler that simply ignores all the events.
- Photoshop - Interface in org.apache.tika.metadata
-
XMP Photoshop metadata schema.
- PING - Static variable in class org.apache.tika.pipes.PipesServer
-
- PipesClient - Class in org.apache.tika.pipes
-
- PipesClient(PipesConfigBase) - Constructor for class org.apache.tika.pipes.PipesClient
-
- PipesClient.LogGobbler - Class in org.apache.tika.pipes
-
- PipesConfig - Class in org.apache.tika.pipes
-
- PipesConfigBase - Class in org.apache.tika.pipes
-
- PipesConfigBase() - Constructor for class org.apache.tika.pipes.PipesConfigBase
-
- PipesException - Exception in org.apache.tika.pipes
-
Fatal exception that means that something went seriously wrong.
- PipesException(Throwable) - Constructor for exception org.apache.tika.pipes.PipesException
-
- PipesIterator - Class in org.apache.tika.pipes.pipesiterator
-
Abstract class that handles the testing for timeouts/thread safety
issues.
- PipesIterator() - Constructor for class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- PipesParser - Class in org.apache.tika.pipes
-
- PipesParser(PipesConfig) - Constructor for class org.apache.tika.pipes.PipesParser
-
- PipesResult - Class in org.apache.tika.pipes
-
- PipesResult(PipesResult.STATUS) - Constructor for class org.apache.tika.pipes.PipesResult
-
- PipesResult(PipesResult.STATUS, String) - Constructor for class org.apache.tika.pipes.PipesResult
-
- PipesResult(EmitData) - Constructor for class org.apache.tika.pipes.PipesResult
-
- PipesResult.STATUS - Enum in org.apache.tika.pipes
-
- PipesServer - Class in org.apache.tika.pipes
-
- PipesServer(Path, InputStream, PrintStream, long, long, long) - Constructor for class org.apache.tika.pipes.PipesServer
-
- PLAIN_TEXT - Static variable in class org.apache.tika.mime.MimeTypes
-
Name of the
text
type, text/plain.
- PLATFORM - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_AIX - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_ARM - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_EMBEDDED - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_FREEBSD - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_HPUX - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_IRIX - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_LINUX - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_NETBSD - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_SOLARIS - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_SYSV - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_TRU64 - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLATFORM_WINDOWS - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PLUS_VERSION - Static variable in interface org.apache.tika.metadata.IPTC
-
The version number of the PLUS standards in place at the time of the
transaction.
- predict(double[]) - Method in class org.apache.tika.detect.NNTrainedModel
-
- predict(float[]) - Method in class org.apache.tika.detect.NNTrainedModel
-
The given input vector of unseen is m=(256 + 1) * n= 1 this returns a
prediction probability
- predict(double[]) - Method in class org.apache.tika.detect.TrainedModel
-
- predict(float[]) - Method in class org.apache.tika.detect.TrainedModel
-
- PREFIX - Static variable in interface org.apache.tika.metadata.AccessPermissions
-
- PREFIX - Static variable in interface org.apache.tika.metadata.Database
-
- PREFIX - Static variable in interface org.apache.tika.metadata.MachineMetadata
-
- PREFIX - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
-
- PREFIX - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- PREFIX - Static variable in interface org.apache.tika.metadata.XMP
-
- PREFIX - Static variable in interface org.apache.tika.metadata.XMPIdq
-
- PREFIX - Static variable in interface org.apache.tika.metadata.XMPMM
-
- PREFIX_ - Static variable in interface org.apache.tika.metadata.XMP
-
The xmp prefix followed by the colon delimiter
- PREFIX_ - Static variable in interface org.apache.tika.metadata.XMPIdq
-
The xmpidq prefix followed by the colon delimiter
- PREFIX_ - Static variable in interface org.apache.tika.metadata.XMPMM
-
The xmpMM prefix followed by the colon delimiter
- PREFIX_ - Static variable in interface org.apache.tika.metadata.XMPRights
-
The xmpRights prefix followed by the colon delimiter
- PREFIX_DC - Static variable in interface org.apache.tika.metadata.DublinCore
-
- PREFIX_DC_TERMS - Static variable in interface org.apache.tika.metadata.DublinCore
-
- PREFIX_DOC_META - Static variable in interface org.apache.tika.metadata.Office
-
- PREFIX_FONT_META - Static variable in interface org.apache.tika.metadata.Font
-
- PREFIX_HTML_META - Static variable in interface org.apache.tika.metadata.HTML
-
- PREFIX_IPTC_CORE - Static variable in interface org.apache.tika.metadata.IPTC
-
- PREFIX_IPTC_EXT - Static variable in interface org.apache.tika.metadata.IPTC
-
- PREFIX_PHOTOSHOP - Static variable in interface org.apache.tika.metadata.Photoshop
-
- PREFIX_PLUS - Static variable in interface org.apache.tika.metadata.IPTC
-
- PREFIX_RTF_META - Static variable in interface org.apache.tika.metadata.RTFMetadata
-
- PREFIX_XMP_RIGHTS - Static variable in interface org.apache.tika.metadata.XMPRights
-
- PREFLIGHT_ICC_PROFILE - Static variable in interface org.apache.tika.metadata.PDF
-
- PREFLIGHT_INCREMENTAL_UPDATES - Static variable in interface org.apache.tika.metadata.PDF
-
- PREFLIGHT_IS_LINEARIZED - Static variable in interface org.apache.tika.metadata.PDF
-
- PREFLIGHT_IS_VALID - Static variable in interface org.apache.tika.metadata.PDF
-
- PREFLIGHT_PARSE_EXCEPTION - Static variable in interface org.apache.tika.metadata.PDF
-
- PREFLIGHT_SPECIFICATION - Static variable in interface org.apache.tika.metadata.PDF
-
- PREFLIGHT_TRAILER_COUNT - Static variable in interface org.apache.tika.metadata.PDF
-
- PREFLIGHT_VALIDATION_ERRORS - Static variable in interface org.apache.tika.metadata.PDF
-
- PREFLIGHT_XREF_TYPE - Static variable in interface org.apache.tika.metadata.PDF
-
- PRESENTATION_FORMAT - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- PRINT_DATE - Static variable in interface org.apache.tika.metadata.Office
-
When was the document last printed?
- PRINT_DATE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- priorExtensionFileType(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
-
- priority - Variable in class org.apache.tika.mime.MimeTypesReader
-
- priorMagicFileType(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
-
- priorMetaFileType(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
-
- ProbabilisticMimeDetectionSelector - Class in org.apache.tika.mime
-
Selector for combining different mime detection results
based on probability
- ProbabilisticMimeDetectionSelector() - Constructor for class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
-
- ProbabilisticMimeDetectionSelector(ProbabilisticMimeDetectionSelector.Builder) - Constructor for class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
-
- ProbabilisticMimeDetectionSelector(MimeTypes) - Constructor for class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
-
- ProbabilisticMimeDetectionSelector(MimeTypes, ProbabilisticMimeDetectionSelector.Builder) - Constructor for class org.apache.tika.mime.ProbabilisticMimeDetectionSelector
-
- ProbabilisticMimeDetectionSelector.Builder - Class in org.apache.tika.mime
-
build class for probability parameters setting
- process(DataInputStream, DataOutputStream) - Method in interface org.apache.tika.fork.ForkResource
-
- process(FetchEmitTuple) - Method in class org.apache.tika.pipes.PipesClient
-
- processingInstruction(String, String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- processingInstruction(String, String) - Method in class org.apache.tika.sax.TeeContentHandler
-
- processingInstruction(String, String) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
-
- processRequests() - Method in class org.apache.tika.pipes.PipesServer
-
- ProcessUtils - Class in org.apache.tika.utils
-
- ProcessUtils() - Constructor for class org.apache.tika.utils.ProcessUtils
-
- PRODUCER - Static variable in interface org.apache.tika.metadata.PDF
-
- PRODUCT_TYPE - Static variable in interface org.apache.tika.metadata.WordPerfect
-
Product type.
- PROGRAM_ID - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- PROJECT_ID - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- Property - Class in org.apache.tika.metadata
-
XMP property definition.
- property(String, String) - Method in class org.apache.tika.sax.XMPContentHandler
-
- Property.PropertyType - Enum in org.apache.tika.metadata
-
- Property.ValueType - Enum in org.apache.tika.metadata
-
- PROPERTY_GROUP_IPTC_CORE - Static variable in interface org.apache.tika.metadata.IPTC
-
- PROPERTY_GROUP_IPTC_EXT - Static variable in interface org.apache.tika.metadata.IPTC
-
- PROPERTY_RELEASE_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
Optional identifier associated with each Property Release.
- PROPERTY_RELEASE_STATUS - Static variable in interface org.apache.tika.metadata.IPTC
-
Summarises the availability and scope of property releases authorizing
usage of the properties appearing in the photograph.
- PropertyTypeException - Exception in org.apache.tika.metadata
-
XMP property definition violation exception.
- PropertyTypeException(String) - Constructor for exception org.apache.tika.metadata.PropertyTypeException
-
- PropertyTypeException(Property.PropertyType, Property.PropertyType) - Constructor for exception org.apache.tika.metadata.PropertyTypeException
-
- PropertyTypeException(Property.ValueType, Property.ValueType) - Constructor for exception org.apache.tika.metadata.PropertyTypeException
-
- PropertyTypeException(Property.PropertyType) - Constructor for exception org.apache.tika.metadata.PropertyTypeException
-
- PROTECTED - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- PROVINCE_OR_STATE - Static variable in interface org.apache.tika.metadata.IPTC
-
Name of the subregion of a country -- either called province or state or
anything else -- the content is focussing on -- either the subregion
shown in visual media or referenced by text or audio media.
- PUBLISHER - Static variable in interface org.apache.tika.metadata.DublinCore
-
An entity responsible for making the resource available.
- PUBLISHER - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- PULL_DOWN - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The sampling phase of film to be converted to video (pull-down)."
- RATING - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- RATING - Static variable in interface org.apache.tika.metadata.XMP
-
A user-assigned rating for this file.
- RDF - Static variable in class org.apache.tika.sax.XMPContentHandler
-
The RDF namespace URI
- read() - Method in class org.apache.tika.io.BoundedInputStream
-
- read(byte[]) - Method in class org.apache.tika.io.BoundedInputStream
-
Invokes the delegate's read(byte[])
method.
- read(byte[], int, int) - Method in class org.apache.tika.io.BoundedInputStream
-
Invokes the delegate's read(byte[], int, int)
method.
- read() - Method in class org.apache.tika.io.LookaheadInputStream
-
- read(byte[], int, int) - Method in class org.apache.tika.io.LookaheadInputStream
-
- read() - Method in class org.apache.tika.io.TailStream
-
This implementation adds the read byte to the internal tail
buffer.
- read(byte[]) - Method in class org.apache.tika.io.TailStream
-
This implementation delegates to the underlying stream and
then adds the correct portion of the read buffer to the internal tail
buffer.
- read(byte[], int, int) - Method in class org.apache.tika.io.TailStream
-
This implementation delegates to the underlying stream and
then adds the correct portion of the read buffer to the internal tail
buffer.
- read(InputStream) - Method in class org.apache.tika.mime.MimeTypesReader
-
- read(Document) - Method in class org.apache.tika.mime.MimeTypesReader
-
- read(InputStream) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
-
- read(Document) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
-
- read(Element) - Static method in class org.apache.tika.parser.external.ExternalParsersConfigReader
-
- read(char[], int, int) - Method in class org.apache.tika.parser.ParsingReader
-
Reads parsed text from the pipe connected to the parsing thread.
- read() - Method in class org.apache.tika.utils.RereadableInputStream
-
Reads a byte from the stream, saving it in the store if it is being
read from the original stream.
- readByteFrequencies(InputStream) - Method in class org.apache.tika.detect.TrainedModelDetector
-
Read the inputstream
and build a byte frequency histogram
- readIntBE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE int value from an InputStream
- readIntLE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE int value from an InputStream
- readLongBE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
Get a NE long value from an InputStream
- readLongLE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE long value from an InputStream
- readShortBE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE short value from an InputStream
- readShortLE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE short value from an InputStream
- readUE7(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
Gets the integer value that is stored in UTF-8 like fashion, in Big Endian
but with the high bit on each number indicating if it continues or not
- readUIntBE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
Get a BE unsigned int value from an InputStream
- readUIntLE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
Get a LE unsigned int value from an InputStream
- readUShortBE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
- readUShortLE(InputStream) - Static method in class org.apache.tika.io.EndianUtils
-
- READY - Static variable in class org.apache.tika.pipes.PipesServer
-
- REALIZATION - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- reallyEndDocument() - Method in class org.apache.tika.sax.EndDocumentShieldingContentHandler
-
- recordEmbeddedStreamException(Throwable, Metadata) - Static method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- recordException(Throwable, Metadata) - Static method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- recordParserDetails(Parser, Metadata) - Static method in class org.apache.tika.utils.ParserUtils
-
Records details of the
Parser
used to the
Metadata
,
typically wanted where multiple parsers could be picked between
or used.
- recordParserFailure(Parser, Throwable, Metadata) - Static method in class org.apache.tika.utils.ParserUtils
-
Records details of a
Parser
's failure to the
Metadata
, so you can check what went wrong even if the
Exception
wasn't immediately thrown (eg when several different
Parsers are used)
- RecursiveParserWrapper - Class in org.apache.tika.parser
-
This is a helper class that wraps a parser in a recursive handler.
- RecursiveParserWrapper(Parser) - Constructor for class org.apache.tika.parser.RecursiveParserWrapper
-
- RecursiveParserWrapper(Parser, boolean) - Constructor for class org.apache.tika.parser.RecursiveParserWrapper
-
- RecursiveParserWrapperHandler - Class in org.apache.tika.sax
-
- RecursiveParserWrapperHandler(ContentHandlerFactory) - Constructor for class org.apache.tika.sax.RecursiveParserWrapperHandler
-
Create a handler with no limit on the number of embedded resources
- RecursiveParserWrapperHandler(ContentHandlerFactory, int) - Constructor for class org.apache.tika.sax.RecursiveParserWrapperHandler
-
Create a handler that limits the number of embedded resources that will be
parsed
- RecursiveParserWrapperHandler(ContentHandlerFactory, int, MetadataFilter) - Constructor for class org.apache.tika.sax.RecursiveParserWrapperHandler
-
- REFERENCES - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- RegexUtils - Class in org.apache.tika.utils
-
Inspired from Nutch code class OutlinkExtractor.
- RegexUtils() - Constructor for class org.apache.tika.utils.RegexUtils
-
- registerModels(MediaType, TrainedModel) - Method in class org.apache.tika.detect.TrainedModelDetector
-
- REGISTRY_ENTRY_CREATED_ITEM_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
A unique identifier created by a registry and applied by the creator of
the item.
- REGISTRY_ENTRY_CREATED_ORGANISATION_ID - Static variable in interface org.apache.tika.metadata.IPTC
-
An identifier for the registry which issued the corresponding Registry Image Id.
- RELATION - Static variable in interface org.apache.tika.metadata.DublinCore
-
A reference to a related resource.
- RELATION - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- RELATIVE_PEAK_AUDIO_FILE_PATH - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The relative path to the file's peak audio file.
- RELEASE_DATE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The date the title was released."
- remove(String) - Method in class org.apache.tika.metadata.Metadata
-
Remove a metadata and all its associated values.
- removedService(ServiceReference, Object) - Method in class org.apache.tika.config.TikaActivator
-
- RENDITION_CLASS - Static variable in interface org.apache.tika.metadata.XMPMM
-
The rendition class name for this resource.
- RENDITION_PARAMS - Static variable in interface org.apache.tika.metadata.XMPMM
-
Can be used to provide additional rendition parameters that
are too complex or verbose to encode in xmpMM:RenditionClass
- repeat(char, int) - Static method in class org.apache.tika.utils.StringUtils
-
Returns padding using the specified delimiter repeated
to a given length.
- repeat(String, int) - Static method in class org.apache.tika.utils.StringUtils
-
Repeat a String repeat
times to form a
new String.
- RereadableInputStream - Class in org.apache.tika.utils
-
Wraps an input stream, reading it only once, but making it available
for rereading an arbitrary number of times.
- RereadableInputStream(InputStream) - Constructor for class org.apache.tika.utils.RereadableInputStream
-
Creates a rereadable input stream with defaults of 512*1024*1024 bytes (500M) for
maxBytesInMemory and both readToEndOfStreamOnFirstRewind and closeOriginalStreamOnClose
set to true
- RereadableInputStream(InputStream, boolean) - Constructor for class org.apache.tika.utils.RereadableInputStream
-
Creates a rereadable input stream defaulting to 512*1024*1024 bytes (500M) for
maxBytesInMemory
- RereadableInputStream(InputStream, int) - Constructor for class org.apache.tika.utils.RereadableInputStream
-
Creates a rereadable input stream with closeOriginalStreamOnClose set to true
- RereadableInputStream(InputStream, int, boolean) - Constructor for class org.apache.tika.utils.RereadableInputStream
-
Creates a rereadable input stream.
- RESERVED_FILENAME_CHARACTERS - Static variable in class org.apache.tika.io.FilenameUtils
-
Reserved characters
- reset() - Method in class org.apache.tika.io.BoundedInputStream
-
- reset() - Method in class org.apache.tika.io.LookaheadInputStream
-
- reset() - Method in class org.apache.tika.io.TailStream
-
This implementation restores this stream's state to the
state when ''mark()'' was called the last time.
- reset() - Method in class org.apache.tika.io.TikaInputStream
-
- reset() - Method in class org.apache.tika.language.detect.LanguageDetector
-
Reset statistics about the current document being processed
- reset() - Method in class org.apache.tika.language.detect.LanguageWriter
-
- RESOLUTION_HORIZONTAL - Static variable in interface org.apache.tika.metadata.TIFF
-
"Horizontal resolution in pixels per unit."
- RESOLUTION_UNIT - Static variable in interface org.apache.tika.metadata.TIFF
-
"Units used for Horizontal and Vertical Resolutions."
One of "Inch" or "cm"
- RESOLUTION_VERTICAL - Static variable in interface org.apache.tika.metadata.TIFF
-
"Vertical resolution in pixels per unit."
- resolveEntity(String, String) - Method in class org.apache.tika.mime.MimeTypesReader
-
- resolveEntity(String, String) - Method in class org.apache.tika.sax.OfflineContentHandler
-
Returns an empty stream.
- RESOURCE_NAME_KEY - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- REVISION - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
-
The revision number.
- rewind() - Method in class org.apache.tika.utils.RereadableInputStream
-
"Rewinds" the stream to the beginning for rereading.
- RichTextContentHandler - Class in org.apache.tika.sax
-
Content handler for Rich Text, it will extract XHTML <img/>
tag <alt/> attribute and XHTML <a/> tag <name/>
attribute into the output.
- RichTextContentHandler(Writer) - Constructor for class org.apache.tika.sax.RichTextContentHandler
-
Creates a content handler that writes XHTML body character events to
the given writer.
- RIGHTS - Static variable in interface org.apache.tika.metadata.DublinCore
-
Information about rights held in and over the resource.
- RIGHTS - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- RIGHTS_USAGE_TERMS - Static variable in interface org.apache.tika.metadata.IPTC
-
The licensing parameters of the item expressed in free-text.
- ROOT_XML_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- ROW_COUNT - Static variable in interface org.apache.tika.metadata.Database
-
- RTF_PICT_META_PREFIX - Static variable in interface org.apache.tika.metadata.RTFMetadata
-
- RTFMetadata - Interface in org.apache.tika.metadata
-
- run() - Method in class org.apache.tika.pipes.PipesClient.LogGobbler
-
- run() - Method in class org.apache.tika.pipes.PipesServer
-
- RuntimeSAXException - Exception in org.apache.tika.exception
-
Use this to throw a SAXException in subclassed methods that don't throw SAXExceptions
- RuntimeSAXException(SAXException) - Constructor for exception org.apache.tika.exception.RuntimeSAXException
-
- SafeContentHandler - Class in org.apache.tika.sax
-
- SafeContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.SafeContentHandler
-
- SafeContentHandler.Output - Interface in org.apache.tika.sax
-
Internal interface that allows both character and
ignorable whitespace content to be filtered the same way.
- SAMPLES_PER_PIXEL - Static variable in interface org.apache.tika.metadata.TIFF
-
"Number of components per pixel."
- save(OutputStream) - Method in class org.apache.tika.config.Param
-
- save(Document, Node) - Method in class org.apache.tika.config.Param
-
- SAVE_DATE - Static variable in interface org.apache.tika.metadata.Office
-
When was the document last saved?
- SCALE_TYPE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The musical scale used in the music.
- SCENE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The name of the scene."
- SCENE_CODE - Static variable in interface org.apache.tika.metadata.IPTC
-
Describes the scene of a news content.
- SCHEME - Static variable in interface org.apache.tika.metadata.XMPIdq
-
A qualifier providing the name of the formal identification
scheme used for an item in the xmp:Identifier array.
- SCRIPT_SOURCE - Static variable in interface org.apache.tika.metadata.HTML
-
If a script element contains a src value, this value
is set in the embedded document's metadata
- SecureContentHandler - Class in org.apache.tika.sax
-
Content handler decorator that attempts to prevent denial of service
attacks against Tika parsers.
- SecureContentHandler(ContentHandler, TikaInputStream) - Constructor for class org.apache.tika.sax.SecureContentHandler
-
Decorates the given content handler with zip bomb prevention based
on the count of bytes read from the given counting input stream.
- SECURITY_LOCKED_FOR_ANNOTATIONS - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- SECURITY_NONE - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- SECURITY_PASSWORD_PROTECTED - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- SECURITY_READ_ONLY_ENFORCED - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- SECURITY_READ_ONLY_RECOMMENDED - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- SECURITY_UNKNOWN - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- select(Metadata) - Method in interface org.apache.tika.extractor.DocumentSelector
-
Checks if a document with the given metadata matches the specified
selection criteria.
- serialize(TikaConfig, TikaConfigSerializer.Mode, Writer, Charset) - Static method in class org.apache.tika.config.TikaConfigSerializer
-
- serializeMetadata(List<String>) - Static method in class org.apache.tika.embedder.ExternalEmbedder
-
Serializes a collection of metadata command line arguments into a single
string.
- ServiceLoader - Class in org.apache.tika.config
-
Internal utility class that Tika uses to look up service providers.
- ServiceLoader(ClassLoader, LoadErrorHandler, InitializableProblemHandler, boolean) - Constructor for class org.apache.tika.config.ServiceLoader
-
- ServiceLoader(ClassLoader, LoadErrorHandler, boolean) - Constructor for class org.apache.tika.config.ServiceLoader
-
- ServiceLoader(ClassLoader, LoadErrorHandler) - Constructor for class org.apache.tika.config.ServiceLoader
-
- ServiceLoader(ClassLoader) - Constructor for class org.apache.tika.config.ServiceLoader
-
- ServiceLoader() - Constructor for class org.apache.tika.config.ServiceLoader
-
- ServiceLoaderUtils - Class in org.apache.tika.utils
-
Service Loading and Ordering related utils
- ServiceLoaderUtils() - Constructor for class org.apache.tika.utils.ServiceLoaderUtils
-
- set(String, String) - Method in class org.apache.tika.metadata.Metadata
-
Set metadata name/value.
- set(Property, String) - Method in class org.apache.tika.metadata.Metadata
-
Sets the value of the identified metadata property.
- set(Property, String[]) - Method in class org.apache.tika.metadata.Metadata
-
Sets the values of the identified metadata property.
- set(Property, int) - Method in class org.apache.tika.metadata.Metadata
-
Sets the integer value of the identified metadata property.
- set(Property, double) - Method in class org.apache.tika.metadata.Metadata
-
Sets the real or rational value of the identified metadata property.
- set(Property, Date) - Method in class org.apache.tika.metadata.Metadata
-
Sets the date value of the identified metadata property.
- set(Property, Calendar) - Method in class org.apache.tika.metadata.Metadata
-
Sets the date value of the identified metadata property.
- set(MediaType...) - Static method in class org.apache.tika.mime.MediaType
-
Convenience method that returns an unmodifiable set that contains
all the given media types.
- set(String...) - Static method in class org.apache.tika.mime.MediaType
-
Convenience method that parses the given media type strings and
returns an unmodifiable set that contains all the parsed types.
- set(Class<T>, T) - Method in class org.apache.tika.parser.ParseContext
-
Adds the given value to the context as an implementation of the given
interface.
- setAll(Properties) - Method in class org.apache.tika.metadata.Metadata
-
Copy All key-value pairs from properties.
- setBasePath(String) - Method in class org.apache.tika.pipes.fetcher.fs.FileSystemFetcher
-
If clients will send in relative paths, this
must be set to allow this fetcher to fetch the
full path.
- setBasePath(String) - Method in class org.apache.tika.pipes.pipesiterator.FileSystemPipesIterator
-
- setCommand(String...) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Sets the command to be run.
- setCommand(String...) - Method in class org.apache.tika.parser.external.ExternalParser
-
Sets the command to be run.
- setCommandAppendOperator(String) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Sets the operator to append rather than replace a value for the command
line tool, i.e.
- setCommandAssignmentDelimeter(String) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Sets the delimiter for multiple assignments for the command line tool,
i.e.
- setCommandAssignmentOperator(String) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Sets the assignment operator for the command line tool, i.e.
- setContentHandler(ContentHandler) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
Sets the underlying content handler.
- setContextClassLoader(ClassLoader) - Static method in class org.apache.tika.config.ServiceLoader
-
Sets the context class loader to use for all threads that access
this class.
- setCorePoolSize(int) - Method in interface org.apache.tika.concurrent.ConfigurableThreadPoolExecutor
-
- setDescription(String) - Method in class org.apache.tika.mime.MimeType
-
Set the description of this media type.
- setDetector(Detector) - Method in class org.apache.tika.parser.AutoDetectParser
-
Sets the type detector used by this parser to auto-detect the type
of a document.
- setDocumentLocator(Locator) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- setDocumentLocator(Locator) - Method in class org.apache.tika.sax.DIFContentHandler
-
- setDocumentLocator(Locator) - Method in class org.apache.tika.sax.TeeContentHandler
-
- setDocumentLocator(Locator) - Method in class org.apache.tika.sax.TextContentHandler
-
- setEmitKey(EmitKey) - Method in class org.apache.tika.pipes.FetchEmitTuple
-
- setEmitMaxEstimatedBytes(long) - Method in class org.apache.tika.pipes.async.AsyncConfig
-
- setEmitterName(String) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- setEmitWithinMillis(long) - Method in class org.apache.tika.pipes.async.AsyncConfig
-
- setEncodingDetector(EncodingDetector) - Method in class org.apache.tika.parser.AbstractEncodingDetectorParser
-
- setExclude(List<String>) - Method in class org.apache.tika.metadata.filter.ExcludeFieldMetadataFilter
-
- setExcludeUnmapped(boolean) - Method in class org.apache.tika.metadata.filter.FieldNameMappingFilter
-
If this is true
(default), this means that only the fields that
have a "from" value in the mapper will be passed through.
- setFallback(Parser) - Method in class org.apache.tika.parser.CompositeParser
-
Sets the fallback parser.
- setFetcherName(String) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- setFilePath(String) - Method in class org.apache.tika.detect.FileCommandDetector
-
- setForkedJvmArgs(List<String>) - Method in class org.apache.tika.pipes.PipesConfigBase
-
- setHandlerConfig(HandlerConfig) - Method in class org.apache.tika.pipes.FetchEmitTuple
-
- setHandlerType(String) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- setIdentifier(String) - Method in class org.apache.tika.sax.StandardReference
-
- setIgnoredLineConsumer(ExternalParser.LineConsumer) - Method in class org.apache.tika.parser.external.ExternalParser
-
Set a consumer for the lines ignored by the parse functions
- setInclude(List<String>) - Method in class org.apache.tika.metadata.filter.IncludeFieldMetadataFilter
-
- setJavaCommand(List<String>) - Method in class org.apache.tika.fork.ForkParser
-
Sets the command used to start the forked server process.
- setJavaCommand(String) - Method in class org.apache.tika.fork.ForkParser
-
- setJavaPath(String) - Method in class org.apache.tika.pipes.PipesConfigBase
-
- setMainOrganizationAcronym(String) - Method in class org.apache.tika.sax.StandardReference
-
- setMappings(Map<String, String>) - Method in class org.apache.tika.metadata.filter.FieldNameMappingFilter
-
- setMaxBytes(int) - Method in class org.apache.tika.detect.FileCommandDetector
-
If this is not called on a TikaInputStream, this detector
will spool up to this many bytes to a file to be detected
by the 'file' command.
- setMaxEntityExpansions(int) - Static method in class org.apache.tika.utils.XMLReaderUtils
-
Set the maximum number of entity expansions allowable in SAX/DOM/StAX parsing.
- setMaxFilesProcessed(int) - Method in class org.apache.tika.pipes.PipesConfigBase
-
- setMaxFilesProcessedPerServer(int) - Method in class org.apache.tika.fork.ForkParser
-
If there is a slowly building memory leak in one of the parsers,
it is useful to set a limit on the number of files processed
by a server before it is shutdown and restarted.
- setMaxForEmitBatchBytes(long) - Method in class org.apache.tika.pipes.async.AsyncConfig
-
- setMaximumCompressionRatio(long) - Method in class org.apache.tika.sax.SecureContentHandler
-
Sets the ratio between output characters and input bytes.
- setMaximumDepth(int) - Method in class org.apache.tika.sax.SecureContentHandler
-
Sets the maximum XML element nesting level.
- setMaximumPackageEntryDepth(int) - Method in class org.apache.tika.sax.SecureContentHandler
-
Sets the maximum package entry nesting level.
- setMaximumPoolSize(int) - Method in interface org.apache.tika.concurrent.ConfigurableThreadPoolExecutor
-
- setMaxStringLength(int) - Method in class org.apache.tika.Tika
-
Sets the maximum length of strings returned by the parseToString
methods.
- setMaxWaitForClientMillis(long) - Method in class org.apache.tika.pipes.PipesConfig
-
- setMaxWaitMs(long) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- setMediaTypeRegistry(MediaTypeRegistry) - Method in class org.apache.tika.parser.CompositeParser
-
Sets the media type registry used to infer type relationships.
- setMediaTypeRegistry(MediaTypeRegistry) - Method in class org.apache.tika.parser.multiple.AbstractMultipleParser
-
Sets the media type registry used to infer type relationships.
- setMetadataCommandArguments(Map<Property, String[]>) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Sets the map of Metadata keys to command line parameters.
- setMetadataExtractionPatterns(Map<Pattern, String>) - Method in class org.apache.tika.parser.external.ExternalParser
-
Sets the map of regular expression patterns and Metadata
keys.
- setMimes(List<String>) - Method in class org.apache.tika.metadata.filter.ClearByMimeMetadataFilter
-
- setMixedLanguages(boolean) - Method in class org.apache.tika.language.detect.LanguageDetector
-
- setName(String) - Method in class org.apache.tika.config.Param
-
- setName(String) - Method in class org.apache.tika.pipes.emitter.AbstractEmitter
-
- setName(String) - Method in class org.apache.tika.pipes.fetcher.AbstractFetcher
-
- setNumClients(int) - Method in class org.apache.tika.pipes.PipesConfigBase
-
- setNumEmitters(int) - Method in class org.apache.tika.pipes.async.AsyncConfig
-
- setNumOfHidden(int) - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- setNumOfInputs(int) - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- setNumOfOutputs(int) - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- setOnParseException(String) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- setOnParseException(FetchEmitTuple.ON_PARSE_EXCEPTION) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- setOpenContainer(Object) - Method in class org.apache.tika.io.TikaInputStream
-
Stores the open container object against
the stream, eg after a Zip contents
detector has loaded the file to decide
what it contains.
- setOutputThreshold(long) - Method in class org.apache.tika.sax.SecureContentHandler
-
Sets the threshold for output characters before the zip bomb prevention
is activated.
- setParams(float[]) - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- setParsers(Map<MediaType, Parser>) - Method in class org.apache.tika.parser.CompositeParser
-
Sets the component parsers.
- setPoolSize(int) - Method in class org.apache.tika.fork.ForkParser
-
Sets the size of the process pool.
- setPoolSize(int) - Static method in class org.apache.tika.mime.MimeTypesReader
-
Set the pool size for cached XML parsers.
- setPoolSize(int) - Static method in class org.apache.tika.utils.XMLReaderUtils
-
Set the pool size for cached XML parsers.
- setPriors(Map<String, Float>) - Method in class org.apache.tika.language.detect.LanguageDetector
-
Set the a-priori probabilities for these languages.
- setQueueSize(int) - Method in class org.apache.tika.pipes.async.AsyncConfig
-
- setQueueSize(int) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- setQuoteAssignmentValues(boolean) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
Sets whether or not to quote assignment values, i.e.
- setScore(double) - Method in class org.apache.tika.sax.StandardReference
-
- setScore(double) - Method in class org.apache.tika.sax.StandardReference.StandardReferenceBuilder
-
- setSecondOrganization(String, String) - Method in class org.apache.tika.sax.StandardReference.StandardReferenceBuilder
-
- setSecondOrganizationAcronym(String) - Method in class org.apache.tika.sax.StandardReference
-
- setSeparator(String) - Method in class org.apache.tika.sax.StandardReference
-
- setServerParseTimeoutMillis(long) - Method in class org.apache.tika.fork.ForkParser
-
The maximum amount of time allowed for the server to try to parse a file.
- setServerPulseMillis(long) - Method in class org.apache.tika.fork.ForkParser
-
The amount of time in milliseconds that the server
should wait before checking to see if the parse has timed out
or if the wait has timed out
The default is 5 seconds.
- setServerWaitTimeoutMillis(long) - Method in class org.apache.tika.fork.ForkParser
-
The maximum amount of time allowed for the server to wait for a new request to parse
a file.
- setShortText(boolean) - Method in class org.apache.tika.language.detect.LanguageDetector
-
- setShutdownClientAfterMillis(long) - Method in class org.apache.tika.pipes.PipesConfigBase
-
- setSuperType(MimeType, MediaType) - Method in class org.apache.tika.mime.MimeTypes
-
- setSupportedEmbedTypes(Set<MediaType>) - Method in class org.apache.tika.embedder.ExternalEmbedder
-
- setSupportedTypes(Set<MediaType>) - Method in class org.apache.tika.parser.external.ExternalParser
-
- setTemporaryFileDirectory(Path) - Method in class org.apache.tika.io.TemporaryResources
-
- setTemporaryFileDirectory(File) - Method in class org.apache.tika.io.TemporaryResources
-
- setThreshold(double) - Method in class org.apache.tika.sax.StandardsExtractingContentHandler
-
Sets the score to be used as threshold.
- setTikaConfig(Path) - Method in class org.apache.tika.pipes.PipesConfigBase
-
- setTikaConfig(String) - Method in class org.apache.tika.pipes.PipesConfigBase
-
- setTimeoutMillis(long) - Method in class org.apache.tika.pipes.PipesConfigBase
-
- setTimeoutMs(long) - Method in class org.apache.tika.detect.FileCommandDetector
-
- setType(Class<T>) - Method in class org.apache.tika.config.Param
-
- setType(MediaType) - Method in class org.apache.tika.detect.NNTrainedModelBuilder
-
- setTypeString(String) - Method in class org.apache.tika.config.Param
-
- setWriteLimit(int) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- shortText - Variable in class org.apache.tika.language.detect.LanguageDetector
-
- SHOT_DATE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The date and time when the video was shot."
- SHOT_LOCATION - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The name of the location where the video was shot.
- SHOT_NAME - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The name of the shot or take."
- shouldParseEmbedded(Metadata) - Method in interface org.apache.tika.extractor.EmbeddedDocumentExtractor
-
- shouldParseEmbedded(Metadata) - Method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
- shouldParseEmbedded(Metadata) - Method in class org.apache.tika.extractor.ParsingEmbeddedDocumentExtractor
-
- shouldTranslate(InputStream, Metadata) - Method in class org.apache.tika.extractor.DefaultEmbeddedStreamTranslator
-
This should sniff the stream to determine if it needs to be translated.
- shouldTranslate(InputStream, Metadata) - Method in interface org.apache.tika.extractor.EmbeddedStreamTranslator
-
- SimpleThreadPoolExecutor - Class in org.apache.tika.concurrent
-
Simple Thread Pool Executor
- SimpleThreadPoolExecutor() - Constructor for class org.apache.tika.concurrent.SimpleThreadPoolExecutor
-
- size() - Method in class org.apache.tika.metadata.Metadata
-
Returns the number of metadata names in this metadata.
- skip(long) - Method in class org.apache.tika.io.BoundedInputStream
-
Invokes the delegate's skip(long)
method.
- skip(InputStream, long, byte[]) - Static method in class org.apache.tika.io.IOUtils
-
- skip(long) - Method in class org.apache.tika.io.LookaheadInputStream
-
- skip(long) - Method in class org.apache.tika.io.TailStream
-
This implementation delegates to the read()
method
to ensure that the tail buffer is also filled if data is skipped.
- skip(long) - Method in class org.apache.tika.io.TikaInputStream
-
- skippedEntity(String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- skippedEntity(String) - Method in class org.apache.tika.sax.TeeContentHandler
-
- skippedEntity(String) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
-
- SLIDE_COUNT - Static variable in interface org.apache.tika.metadata.Office
-
The number of Slides are there in the (presentation) document
- SOFTWARE - Static variable in interface org.apache.tika.metadata.TIFF
-
"Software or firmware used to generate the image."
- sortLoadedClasses(List<T>) - Static method in class org.apache.tika.utils.ServiceLoaderUtils
-
Sorts a list of loaded classes, so that non-Tika ones come
before Tika ones, and otherwise in reverse alphabetical order
- SOURCE - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- SOURCE - Static variable in interface org.apache.tika.metadata.DublinCore
-
A reference to a resource from which the present resource is derived.
- SOURCE - Static variable in interface org.apache.tika.metadata.IPTC
-
Identifies the original owner of the copyright for the intellectual
content of the item.
- SOURCE - Static variable in interface org.apache.tika.metadata.Photoshop
-
- SOURCE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- SOURCE_PATH - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
This should be used to store the path (relative or full)
of the source file, including the file name,
e.g.
- SPACE - Static variable in class org.apache.tika.utils.StringUtils
-
A String for a space character.
- SPEAKER_PLACEMENT - Static variable in interface org.apache.tika.metadata.XMPDM
-
"A description of the speaker angles from center front in degrees.
- STANDARD_REFERENCES - Static variable in class org.apache.tika.sax.StandardsExtractingContentHandler
-
- StandardOrganizations - Class in org.apache.tika.sax
-
This class provides a collection of the most important technical standard organizations.
- StandardOrganizations() - Constructor for class org.apache.tika.sax.StandardOrganizations
-
- StandardReference - Class in org.apache.tika.sax
-
Class that represents a standard reference.
- StandardReference.StandardReferenceBuilder - Class in org.apache.tika.sax
-
- StandardReferenceBuilder(String, String) - Constructor for class org.apache.tika.sax.StandardReference.StandardReferenceBuilder
-
- StandardsExtractingContentHandler - Class in org.apache.tika.sax
-
StandardsExtractingContentHandler is a Content Handler used to extract
standard references while parsing.
- StandardsExtractingContentHandler(ContentHandler, Metadata) - Constructor for class org.apache.tika.sax.StandardsExtractingContentHandler
-
Creates a decorator for the given SAX event handler and Metadata object.
- StandardsExtractingContentHandler() - Constructor for class org.apache.tika.sax.StandardsExtractingContentHandler
-
Creates a decorator that by default forwards incoming SAX events to a
dummy content handler that simply ignores all the events.
- StandardsText - Class in org.apache.tika.sax
-
StandardText relies on regular expressions to extract standard references
from text.
- StandardsText() - Constructor for class org.apache.tika.sax.StandardsText
-
- start(BundleContext) - Method in class org.apache.tika.config.TikaActivator
-
- startDescription(String, String, String) - Method in class org.apache.tika.sax.XMPContentHandler
-
- startDocument() - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- startDocument() - Method in class org.apache.tika.sax.DIFContentHandler
-
- startDocument() - Method in class org.apache.tika.sax.EmbeddedContentHandler
-
Ignored.
- startDocument() - Method in class org.apache.tika.sax.ExpandedTitleContentHandler
-
- startDocument() - Method in class org.apache.tika.sax.TeeContentHandler
-
- startDocument() - Method in class org.apache.tika.sax.TextContentHandler
-
- startDocument() - Method in class org.apache.tika.sax.ToHTMLContentHandler
-
- startDocument() - Method in class org.apache.tika.sax.ToXMLContentHandler
-
Writes the XML prefix.
- startDocument() - Method in class org.apache.tika.sax.XHTMLContentHandler
-
Starts an XHTML document by setting up the namespace mappings
when called for the first time.
- startDocument() - Method in class org.apache.tika.sax.XMPContentHandler
-
Starts an XMP document by setting up the namespace mappings and
writing out the following header:
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.mime.MimeTypesReader
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.DIFContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.ElementMappingContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.ExpandedTitleContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.LinkContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.RichTextContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.SafeContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.SecureContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.TeeContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.TextAndAttributeContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.TextContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.ToTextContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.ToXMLContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
Starts the given element.
- startElement(String) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
- startElement(String, String, String) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
- startElement(String, AttributesImpl) - Method in class org.apache.tika.sax.XHTMLContentHandler
-
- startElement(String, String, String, Attributes) - Method in class org.apache.tika.sax.xpath.MatchingContentHandler
-
- startEmbeddedDocument(ContentHandler, Metadata) - Method in class org.apache.tika.sax.AbstractRecursiveParserWrapperHandler
-
This is called before parsing each embedded document.
- startEmbeddedDocument(ContentHandler, Metadata) - Method in class org.apache.tika.sax.RecursiveParserWrapperHandler
-
This is called before parsing an embedded document
- startPrefixMapping(String, String) - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- startPrefixMapping(String, String) - Method in class org.apache.tika.sax.TeeContentHandler
-
- startPrefixMapping(String, String) - Method in class org.apache.tika.sax.ToXMLContentHandler
-
- STATE - Static variable in interface org.apache.tika.metadata.Photoshop
-
- StatefulParser - Class in org.apache.tika.parser
-
The RecursiveParserWrapper wraps the parser sent
into the parsecontext and then uses that parser
to store state (among many other things).
- StatefulParser(Parser) - Constructor for class org.apache.tika.parser.StatefulParser
-
Creates a decorator for the given parser.
- stop(BundleContext) - Method in class org.apache.tika.config.TikaActivator
-
- StoppingEarlyException - Exception in org.apache.tika.sax
-
Sentinel exception to stop parsing xml once target is found
while SAX parsing.
- StoppingEarlyException() - Constructor for exception org.apache.tika.sax.StoppingEarlyException
-
- StreamEmitter - Interface in org.apache.tika.pipes.emitter
-
- streamResetForReRead(InputStream, TemporaryResources) - Static method in class org.apache.tika.utils.ParserUtils
-
- STRETCH_MODE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The audio stretch mode."
- StringUtils - Class in org.apache.tika.utils
-
- StringUtils() - Constructor for class org.apache.tika.utils.StringUtils
-
- SUB_CLASS_OF_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- SUB_CLASS_TYPE_ATTR - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- SUBJECT - Static variable in interface org.apache.tika.metadata.DublinCore
-
The topic of the content of the resource.
- SUBJECT - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLCore
-
The document's subject.
- SUBJECT - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
DublinCore.SUBJECT
; should include both subject and keywords
if a document format has both.
- SUBJECT_CODE - Static variable in interface org.apache.tika.metadata.IPTC
-
Specifies one or more Subjects from the IPTC Subject-NewsCodes taxonomy
to categorise the content.
- SUBLOCATION - Static variable in interface org.apache.tika.metadata.IPTC
-
Name of a sublocation the content is focussing on -- either the
location shown in visual media or referenced by text or audio media.
- SubtreeMatcher - Class in org.apache.tika.sax.xpath
-
Evaluation state of a ...//...
XPath expression.
- SubtreeMatcher(Matcher) - Constructor for class org.apache.tika.sax.xpath.SubtreeMatcher
-
- SUPPLEMENTAL_CATEGORIES - Static variable in interface org.apache.tika.metadata.IPTC
-
Deprecated.
- SUPPLEMENTAL_CATEGORIES - Static variable in interface org.apache.tika.metadata.Photoshop
-
- SupplementingParser - Class in org.apache.tika.parser.multiple
-
- SupplementingParser(MediaTypeRegistry, Collection<? extends Parser>, Map<String, Param>) - Constructor for class org.apache.tika.parser.multiple.SupplementingParser
-
- SupplementingParser(MediaTypeRegistry, AbstractMultipleParser.MetadataPolicy, Parser...) - Constructor for class org.apache.tika.parser.multiple.SupplementingParser
-
- SupplementingParser(MediaTypeRegistry, AbstractMultipleParser.MetadataPolicy, Collection<? extends Parser>) - Constructor for class org.apache.tika.parser.multiple.SupplementingParser
-
- SystemUtils - Class in org.apache.tika.utils
-
Copied from commons-lang to avoid requiring the dependency
- SystemUtils() - Constructor for class org.apache.tika.utils.SystemUtils
-
- TABLE_COUNT - Static variable in interface org.apache.tika.metadata.Office
-
The number of Tables in the document
- TABLE_ID - Static variable in interface org.apache.tika.metadata.ClimateForcast
-
- TABLE_NAME - Static variable in interface org.apache.tika.metadata.Database
-
- TaggedContentHandler - Class in org.apache.tika.sax
-
A content handler decorator that tags potential exceptions so that the
handler that caused the exception can easily be identified.
- TaggedContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.TaggedContentHandler
-
Creates a tagging decorator for the given content handler.
- TaggedSAXException - Exception in org.apache.tika.sax
-
A
SAXException
wrapper that tags the wrapped exception with
a given object reference.
- TaggedSAXException(SAXException, Object) - Constructor for exception org.apache.tika.sax.TaggedSAXException
-
Creates a tagged wrapper for the given exception.
- TailStream - Class in org.apache.tika.io
-
A specialized input stream implementation which records the last portion read
from an underlying stream.
- TailStream(InputStream, int) - Constructor for class org.apache.tika.io.TailStream
-
Creates a new instance of TailStream
.
- TAPE_NAME - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The name of the tape from which the clip was captured, as set during
the capture process."
- TargetElement(QName, Map<QName, QName>) - Constructor for class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
-
Creates an TargetElement, attributes of this element will
be mapped as specified
- TargetElement(String, String, Map<QName, QName>) - Constructor for class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
-
A shortcut that automatically creates the QName object
- TargetElement(QName) - Constructor for class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
-
Creates an TargetElement with no attributes, all attributes
will be deleted from SAX stream
- TargetElement(String, String) - Constructor for class org.apache.tika.sax.ElementMappingContentHandler.TargetElement
-
A shortcut that automatically creates the QName object
- TeeContentHandler - Class in org.apache.tika.sax
-
Content handler proxy that forwards the received SAX events to zero or
more underlying content handlers.
- TeeContentHandler(ContentHandler...) - Constructor for class org.apache.tika.sax.TeeContentHandler
-
- TEMPLATE - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- TEMPO - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The audio's tempo."
- TemporaryResources - Class in org.apache.tika.io
-
Utility class for tracking and ultimately closing or otherwise disposing
a collection of temporary resources.
- TemporaryResources() - Constructor for class org.apache.tika.io.TemporaryResources
-
- text(String) - Static method in class org.apache.tika.mime.MediaType
-
- TEXT_HTML - Static variable in class org.apache.tika.mime.MediaType
-
- TEXT_PLAIN - Static variable in class org.apache.tika.mime.MediaType
-
- TextAndAttributeContentHandler - Class in org.apache.tika.sax
-
- TextAndAttributeContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.TextAndAttributeContentHandler
-
- TextAndAttributeContentHandler(ContentHandler, boolean) - Constructor for class org.apache.tika.sax.TextAndAttributeContentHandler
-
- TextContentHandler - Class in org.apache.tika.sax
-
- TextContentHandler(ContentHandler) - Constructor for class org.apache.tika.sax.TextContentHandler
-
- TextContentHandler(ContentHandler, boolean) - Constructor for class org.apache.tika.sax.TextContentHandler
-
- TextDetector - Class in org.apache.tika.detect
-
Content type detection of plain text documents.
- TextDetector() - Constructor for class org.apache.tika.detect.TextDetector
-
Constructs a
TextDetector
which will look at the default number
of bytes from the beginning of the document.
- TextDetector(int) - Constructor for class org.apache.tika.detect.TextDetector
-
Constructs a
TextDetector
which will look at a given number of
bytes from the beginning of the document.
- TextMatcher - Class in org.apache.tika.sax.xpath
-
Final evaluation state of a .../text()
XPath expression.
- TextMatcher() - Constructor for class org.apache.tika.sax.xpath.TextMatcher
-
- TextStatistics - Class in org.apache.tika.detect
-
Utility class for computing a histogram of the bytes seen in a stream.
- TextStatistics() - Constructor for class org.apache.tika.detect.TextStatistics
-
- threshold(float) - Method in class org.apache.tika.mime.ProbabilisticMimeDetectionSelector.Builder
-
- THROW - Static variable in interface org.apache.tika.config.InitializableProblemHandler
-
- THROW - Static variable in interface org.apache.tika.config.LoadErrorHandler
-
Strategy that throws a
RuntimeException
with the given
throwable as the root cause, thus interrupting the entire service
loading operation.
- throwIfCauseOf(SAXException) - Method in class org.apache.tika.sax.SecureContentHandler
-
- throwIfCauseOf(Exception) - Method in class org.apache.tika.sax.TaggedContentHandler
-
Re-throws the original exception thrown by this handler.
- throwIfWriteLimitReached(Exception) - Static method in exception org.apache.tika.exception.WriteLimitReachedException
-
- THUMBNAIL - Static variable in interface org.apache.tika.metadata.RTFMetadata
-
if set to true, this means that an image file is probably a "thumbnail"
any time a pict/emf/wmf is in an object
- TIFF - Interface in org.apache.tika.metadata
-
XMP Exif TIFF schema.
- Tika - Class in org.apache.tika
-
Facade class for accessing Tika functionality.
- Tika(Detector, Parser) - Constructor for class org.apache.tika.Tika
-
Creates a Tika facade using the given detector and parser instances, but the default
Translator.
- Tika(Detector, Parser, Translator) - Constructor for class org.apache.tika.Tika
-
Creates a Tika facade using the given detector, parser, and translator instances.
- Tika(TikaConfig) - Constructor for class org.apache.tika.Tika
-
Creates a Tika facade using the given configuration.
- Tika() - Constructor for class org.apache.tika.Tika
-
Creates a Tika facade using the default configuration.
- Tika(Detector) - Constructor for class org.apache.tika.Tika
-
Creates a Tika facade using the given detector instance, the
default parser configuration, and the default Translator.
- TIKA_CONFIG_PATH - Static variable in class org.apache.tika.parser.AutoDetectParserFactory
-
Path to a tika-config file.
- TIKA_CONTENT - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- TIKA_CONTENT_HANDLER - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
Simple class name of the content handler
- TIKA_LINK_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- TIKA_META_EXCEPTION_EMBEDDED_STREAM - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
Use this to store exceptions caught while trying to read the
stream of an embedded resource.
- TIKA_META_EXCEPTION_PREFIX - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
Use this to store parse exception information in the Metadata object.
- TIKA_META_EXCEPTION_WARNING - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
Use this to store exceptions caught during a parse that are
non-fatal, e.g.
- TIKA_META_PREFIX - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
Use this to prefix metadata properties that store information
about the parsing process.
- TIKA_MIME_FILE - Static variable in interface org.apache.tika.metadata.TikaMimeKeys
-
- TIKA_PARSED_BY - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- TIKA_UTI_TAG - Static variable in interface org.apache.tika.mime.MimeTypesReaderMetKeys
-
- TikaActivator - Class in org.apache.tika.config
-
Bundle activator that adjust the class loading mechanism of the
ServiceLoader
class to work correctly in an OSGi environment.
- TikaActivator() - Constructor for class org.apache.tika.config.TikaActivator
-
- TikaConfig - Class in org.apache.tika.config
-
Parse xml config file.
- TikaConfig(String) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(Path) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(Path, ServiceLoader) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(File) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(File, ServiceLoader) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(URL) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(URL, ClassLoader) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(URL, ServiceLoader) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(InputStream) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(Document) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(Document, ServiceLoader) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(Element) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(Element, ClassLoader) - Constructor for class org.apache.tika.config.TikaConfig
-
- TikaConfig(ClassLoader) - Constructor for class org.apache.tika.config.TikaConfig
-
Creates a Tika configuration from the built-in media type rules
and all the
Parser
implementations available through the
service provider mechanism
in the given
class loader.
- TikaConfig() - Constructor for class org.apache.tika.config.TikaConfig
-
Creates a default Tika configuration.
- TikaConfigException - Exception in org.apache.tika.exception
-
Tika Config Exception is an exception to occur when there is an error
in Tika config file and/or one or more of the parsers failed to initialize
from that erroneous config.
- TikaConfigException(String) - Constructor for exception org.apache.tika.exception.TikaConfigException
-
Creates an instance of exception
- TikaConfigException(String, Throwable) - Constructor for exception org.apache.tika.exception.TikaConfigException
-
- TikaConfigSerializer - Class in org.apache.tika.config
-
- TikaConfigSerializer() - Constructor for class org.apache.tika.config.TikaConfigSerializer
-
- TikaConfigSerializer.Mode - Enum in org.apache.tika.config
-
- TikaCoreProperties - Interface in org.apache.tika.metadata
-
Contains a core set of basic Tika metadata properties, which all parsers
will attempt to supply (where the file format permits).
- TikaCoreProperties.EmbeddedResourceType - Enum in org.apache.tika.metadata
-
A file might contain different types of embedded documents.
- TikaEmitterException - Exception in org.apache.tika.pipes.emitter
-
- TikaEmitterException(String) - Constructor for exception org.apache.tika.pipes.emitter.TikaEmitterException
-
- TikaEmitterException(String, Throwable) - Constructor for exception org.apache.tika.pipes.emitter.TikaEmitterException
-
- TikaException - Exception in org.apache.tika.exception
-
Tika exception
- TikaException(String) - Constructor for exception org.apache.tika.exception.TikaException
-
- TikaException(String, Throwable) - Constructor for exception org.apache.tika.exception.TikaException
-
- TikaInputStream - Class in org.apache.tika.io
-
Input stream with extended capabilities.
- TikaMemoryLimitException - Exception in org.apache.tika.exception
-
- TikaMemoryLimitException(String) - Constructor for exception org.apache.tika.exception.TikaMemoryLimitException
-
- TikaMemoryLimitException(long, long) - Constructor for exception org.apache.tika.exception.TikaMemoryLimitException
-
- TikaMimeKeys - Interface in org.apache.tika.metadata
-
A collection of Tika metadata keys used in Mime Type resolution
- TikaTimeoutException - Exception in org.apache.tika.exception
-
- TikaTimeoutException(String) - Constructor for exception org.apache.tika.exception.TikaTimeoutException
-
- TIME_SIGNATURE - Static variable in interface org.apache.tika.metadata.XMPDM
-
"The time signature of the music."
- TIMEOUT - Static variable in class org.apache.tika.pipes.PipesResult
-
- TIMEOUT - Static variable in class org.apache.tika.pipes.PipesServer
-
- TIMEOUT_EXIT_CODE - Static variable in class org.apache.tika.pipes.PipesServer
-
- TIMES_INSTANTIATED - Static variable in class org.apache.tika.config.TikaConfig
-
- TITLE - Static variable in interface org.apache.tika.metadata.DublinCore
-
A name given to the resource.
- TITLE - Static variable in interface org.apache.tika.metadata.IPTC
-
A shorthand reference for the item.
- TITLE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- ToHTMLContentHandler - Class in org.apache.tika.sax
-
SAX event handler that serializes the HTML document to a character stream.
- ToHTMLContentHandler(OutputStream, String) - Constructor for class org.apache.tika.sax.ToHTMLContentHandler
-
- ToHTMLContentHandler() - Constructor for class org.apache.tika.sax.ToHTMLContentHandler
-
- toString() - Method in class org.apache.tika.config.Param
-
- toString() - Method in class org.apache.tika.config.ParamField
-
- toString() - Method in class org.apache.tika.detect.MagicDetector
-
Returns a string representation of the Detection Rule.
- toString() - Method in class org.apache.tika.io.TikaInputStream
-
- toString() - Method in class org.apache.tika.language.detect.LanguageResult
-
- toString() - Method in class org.apache.tika.metadata.Metadata
-
- toString() - Method in class org.apache.tika.mime.MediaType
-
- toString() - Method in class org.apache.tika.mime.MimeType
-
Returns the name of this media type.
- toString() - Method in class org.apache.tika.pipes.emitter.EmitData
-
- toString() - Method in class org.apache.tika.pipes.emitter.EmitKey
-
- toString() - Method in class org.apache.tika.pipes.FetchEmitTuple
-
- toString() - Method in class org.apache.tika.pipes.fetcher.FetchKey
-
- toString() - Method in class org.apache.tika.pipes.HandlerConfig
-
- toString() - Method in class org.apache.tika.pipes.PipesResult
-
- toString() - Method in class org.apache.tika.sax.ContentHandlerDecorator
-
- toString() - Method in class org.apache.tika.sax.DIFContentHandler
-
- toString() - Method in class org.apache.tika.sax.Link
-
- toString() - Method in class org.apache.tika.sax.StandardReference
-
- toString() - Method in class org.apache.tika.sax.TextContentHandler
-
- toString() - Method in class org.apache.tika.sax.ToTextContentHandler
-
Returns the contents of the internal string buffer where
all the received characters have been collected.
- toString() - Method in class org.apache.tika.Tika
-
- TOTAL_TIME - Static variable in interface org.apache.tika.metadata.OfficeOpenXMLExtended
-
- ToTextContentHandler - Class in org.apache.tika.sax
-
SAX event handler that writes all character content out to a character
stream.
- ToTextContentHandler(Writer) - Constructor for class org.apache.tika.sax.ToTextContentHandler
-
Creates a content handler that writes character events to
the given writer.
- ToTextContentHandler(OutputStream) - Constructor for class org.apache.tika.sax.ToTextContentHandler
-
Creates a content handler that writes character events to
the given output stream using the platform default encoding.
- ToTextContentHandler(OutputStream, String) - Constructor for class org.apache.tika.sax.ToTextContentHandler
-
Creates a content handler that writes character events to
the given output stream using the given encoding.
- ToTextContentHandler() - Constructor for class org.apache.tika.sax.ToTextContentHandler
-
Creates a content handler that writes character events
to an internal string buffer.
- ToXMLContentHandler - Class in org.apache.tika.sax
-
SAX event handler that serializes the XML document to a character stream.
- ToXMLContentHandler(OutputStream, String) - Constructor for class org.apache.tika.sax.ToXMLContentHandler
-
Creates an XML serializer that writes to the given byte stream
using the given character encoding.
- ToXMLContentHandler(String) - Constructor for class org.apache.tika.sax.ToXMLContentHandler
-
- ToXMLContentHandler() - Constructor for class org.apache.tika.sax.ToXMLContentHandler
-
- TRACK_NUMBER - Static variable in interface org.apache.tika.metadata.XMPDM
-
"A numeric value indicating the order of the audio file within its
original recording."
- TrainedModel - Class in org.apache.tika.detect
-
- TrainedModel() - Constructor for class org.apache.tika.detect.TrainedModel
-
- TrainedModelDetector - Class in org.apache.tika.detect
-
- TrainedModelDetector() - Constructor for class org.apache.tika.detect.TrainedModelDetector
-
- translate(InputStream, Metadata) - Method in class org.apache.tika.extractor.DefaultEmbeddedStreamTranslator
-
This will consume the InputStream and return a new stream of translated bytes.
- translate(InputStream, Metadata) - Method in interface org.apache.tika.extractor.EmbeddedStreamTranslator
-
- translate(String, String, String) - Method in class org.apache.tika.language.translate.DefaultTranslator
-
Translate, using the first available service-loaded translator
- translate(String, String) - Method in class org.apache.tika.language.translate.DefaultTranslator
-
Translate, using the first available service-loaded translator
- translate(String, String, String) - Method in class org.apache.tika.language.translate.EmptyTranslator
-
- translate(String, String) - Method in class org.apache.tika.language.translate.EmptyTranslator
-
- translate(String, String, String) - Method in interface org.apache.tika.language.translate.Translator
-
Translate text between given languages.
- translate(String, String) - Method in interface org.apache.tika.language.translate.Translator
-
Translate text to the given language
This method attempts to auto-detect the source language of the text.
- translate(String, String, String) - Method in class org.apache.tika.Tika
-
Translate the given text String to and from the given languages.
- translate(String, String) - Method in class org.apache.tika.Tika
-
Translate the given text String to the given language, attempting to auto-detect the
source language.
- Translator - Interface in org.apache.tika.language.translate
-
Interface for Translator services.
- TRANSMISSION_REFERENCE - Static variable in interface org.apache.tika.metadata.Photoshop
-
- trimMessage(String) - Static method in class org.apache.tika.utils.ExceptionUtils
-
Utility method to trim the message from a stack trace
string.
- tryToAdd(FetchEmitTuple) - Method in class org.apache.tika.pipes.pipesiterator.PipesIterator
-
- tryToFindExistingLeafParser(Class, ParseContext) - Static method in class org.apache.tika.extractor.EmbeddedDocumentUtil
-
Tries to find an existing parser within the ParseContext.
- tryToParse(String) - Method in class org.apache.tika.utils.DateUtils
-
Tries to parse the date string; returns null if no parse was possible.
- TYPE - Static variable in interface org.apache.tika.metadata.DublinCore
-
The nature or genre of the content of the resource.
- TYPE - Static variable in interface org.apache.tika.metadata.TikaCoreProperties
-
- type - Variable in class org.apache.tika.mime.MimeTypesReader
-
Current type
- TypeDetector - Class in org.apache.tika.detect
-
Content type detection based on a content type hint.
- TypeDetector() - Constructor for class org.apache.tika.detect.TypeDetector
-
- types - Variable in class org.apache.tika.mime.MimeTypesReader
-