Package | Description |
---|---|
org.apache.tika.config |
Tika configuration tools.
|
org.apache.tika.detect |
Media type detection.
|
org.apache.tika.embedder | |
org.apache.tika.extractor |
Extraction of component documents.
|
org.apache.tika.fork |
Forked parser.
|
org.apache.tika.mime |
Media type information.
|
org.apache.tika.parser |
Tika parsers.
|
org.apache.tika.parser.external |
External parser process.
|
org.apache.tika.parser.multiple |
Modifier and Type | Method and Description |
---|---|
Parser |
TikaConfig.getParser(MediaType mimeType)
Deprecated.
Use the
TikaConfig.getParser() method instead |
Modifier and Type | Method and Description |
---|---|
MediaType |
EmptyDetector.detect(InputStream input,
Metadata metadata) |
MediaType |
FileCommandDetector.detect(InputStream input,
Metadata metadata) |
MediaType |
MagicDetector.detect(InputStream input,
Metadata metadata) |
MediaType |
TrainedModelDetector.detect(InputStream input,
Metadata metadata) |
MediaType |
TypeDetector.detect(InputStream input,
Metadata metadata)
Detects the content type of an input document based on a type hint
given in the input metadata.
|
MediaType |
NameDetector.detect(InputStream input,
Metadata metadata)
Detects the content type of an input document based on the document
name given in the input metadata.
|
MediaType |
Detector.detect(InputStream input,
Metadata metadata)
Detects the content type of the given input document.
|
MediaType |
ZeroSizeFileDetector.detect(InputStream stream,
Metadata metadata) |
MediaType |
TextDetector.detect(InputStream input,
Metadata metadata)
Looks at the beginning of the document input stream to determine
whether the document is text or not.
|
MediaType |
CompositeDetector.detect(InputStream input,
Metadata metadata) |
MediaType |
OverrideDetector.detect(InputStream input,
Metadata metadata) |
MediaType |
NNTrainedModelBuilder.getType() |
Modifier and Type | Method and Description |
---|---|
static MagicDetector |
MagicDetector.parse(MediaType mediaType,
String type,
String offset,
String value,
String mask) |
protected void |
TrainedModelDetector.registerModels(MediaType type,
TrainedModel model) |
void |
NNTrainedModelBuilder.setType(MediaType type) |
Constructor and Description |
---|
MagicDetector(MediaType type,
byte[] pattern)
Creates a detector for input documents that have the exact given byte
pattern at the beginning of the document stream.
|
MagicDetector(MediaType type,
byte[] pattern,
byte[] mask,
boolean isRegex,
boolean isStringIgnoreCase,
int offsetRangeBegin,
int offsetRangeEnd)
Creates a detector for input documents that meet the specified
magic match.
|
MagicDetector(MediaType type,
byte[] pattern,
byte[] mask,
boolean isRegex,
int offsetRangeBegin,
int offsetRangeEnd)
Creates a detector for input documents that meet the specified
magic match.
|
MagicDetector(MediaType type,
byte[] pattern,
byte[] mask,
int offsetRangeBegin,
int offsetRangeEnd)
Creates a detector for input documents that meet the specified magic
match.
|
MagicDetector(MediaType type,
byte[] pattern,
int offset)
Creates a detector for input documents that have the exact given byte
pattern at the given offset of the document stream.
|
Constructor and Description |
---|
NameDetector(Map<Pattern,MediaType> patterns)
Creates a new content type detector based on the given name patterns.
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ExternalEmbedder.getSupportedEmbedTypes() |
Set<MediaType> |
ExternalEmbedder.getSupportedEmbedTypes(ParseContext context) |
Set<MediaType> |
Embedder.getSupportedEmbedTypes(ParseContext context)
Returns the set of media types supported by this embedder when used with
the given parse context.
|
Modifier and Type | Method and Description |
---|---|
void |
ExternalEmbedder.setSupportedEmbedTypes(Set<MediaType> supportedEmbedTypes) |
Modifier and Type | Method and Description |
---|---|
void |
EmbeddedResourceHandler.handle(String filename,
MediaType mediaType,
InputStream stream)
Called to process an embedded resource within the container.
|
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ForkParser.getSupportedTypes(ParseContext context) |
Modifier and Type | Field and Description |
---|---|
static MediaType |
MediaType.APPLICATION_XML |
static MediaType |
MediaType.APPLICATION_ZIP |
static MediaType |
MediaType.EMPTY |
static MediaType |
MediaType.OCTET_STREAM |
static MediaType |
MediaType.TEXT_HTML |
static MediaType |
MediaType.TEXT_PLAIN |
Modifier and Type | Method and Description |
---|---|
static MediaType |
MediaType.application(String type) |
static MediaType |
MediaType.audio(String type) |
MediaType |
ProbabilisticMimeDetectionSelector.detect(InputStream input,
Metadata metadata) |
MediaType |
MimeTypes.detect(InputStream input,
Metadata metadata)
Automatically detects the MIME type of a document based on magic
markers in the stream prefix and any given metadata hints.
|
MediaType |
MediaType.getBaseType()
Returns the base form of the MediaType, excluding
any parameters, such as "text/plain" for
"text/plain; charset=utf-8"
|
MediaType |
MediaTypeRegistry.getSupertype(MediaType type)
Returns the supertype of the given type.
|
MediaType |
MimeType.getType()
Returns the normalized media type name.
|
static MediaType |
MediaType.image(String type) |
MediaType |
MediaTypeRegistry.normalize(MediaType type) |
static MediaType |
MediaType.parse(String string)
Parses the given string to a media type.
|
static MediaType |
MediaType.text(String type) |
static MediaType |
MediaType.video(String type) |
Modifier and Type | Method and Description |
---|---|
SortedSet<MediaType> |
MediaTypeRegistry.getAliases(MediaType type)
Returns the set of known aliases of the given canonical media type.
|
SortedSet<MediaType> |
MediaTypeRegistry.getChildTypes(MediaType type)
Returns the set of known children of the given canonical media type
|
SortedSet<MediaType> |
MediaTypeRegistry.getTypes()
Returns the set of all known canonical media types.
|
static Set<MediaType> |
MediaType.set(MediaType... types)
Convenience method that returns an unmodifiable set that contains
all the given media types.
|
static Set<MediaType> |
MediaType.set(String... types)
Convenience method that parses the given media type strings and
returns an unmodifiable set that contains all the parsed types.
|
Modifier and Type | Method and Description |
---|---|
void |
MediaTypeRegistry.addAlias(MediaType type,
MediaType alias) |
void |
MediaTypeRegistry.addSuperType(MediaType type,
MediaType supertype) |
void |
MediaTypeRegistry.addType(MediaType type) |
int |
MediaType.compareTo(MediaType that) |
SortedSet<MediaType> |
MediaTypeRegistry.getAliases(MediaType type)
Returns the set of known aliases of the given canonical media type.
|
SortedSet<MediaType> |
MediaTypeRegistry.getChildTypes(MediaType type)
Returns the set of known children of the given canonical media type
|
MediaType |
MediaTypeRegistry.getSupertype(MediaType type)
Returns the supertype of the given type.
|
boolean |
MediaTypeRegistry.isInstanceOf(MediaType a,
MediaType b)
Checks whether the given media type equals the given base type or
is a specialization of it.
|
boolean |
MediaTypeRegistry.isInstanceOf(String a,
MediaType b)
Parses and normalises the given media type string and checks whether
the result equals the given base type or is a specialization of it.
|
boolean |
MediaTypeRegistry.isSpecializationOf(MediaType a,
MediaType b)
Checks whether the given media type a is a specialization of a more
generic type b.
|
MediaType |
MediaTypeRegistry.normalize(MediaType type) |
static Set<MediaType> |
MediaType.set(MediaType... types)
Convenience method that returns an unmodifiable set that contains
all the given media types.
|
void |
MimeTypes.setSuperType(MimeType type,
MediaType parent) |
Constructor and Description |
---|
MediaType(MediaType type,
Charset charset)
Creates a media type by adding the "charset" parameter to a base type.
|
MediaType(MediaType type,
Map<String,String> parameters) |
MediaType(MediaType type,
String name,
String value)
Creates a media type by adding a parameter to a base type.
|
Modifier and Type | Method and Description |
---|---|
Map<MediaType,List<Parser>> |
CompositeParser.findDuplicateParsers(ParseContext context)
Utility method that goes through all the component parsers and finds
all media types for which more than one parser declares support.
|
Map<MediaType,Parser> |
CompositeParser.getParsers()
Returns the component parsers.
|
Map<MediaType,Parser> |
CompositeParser.getParsers(ParseContext context) |
Map<MediaType,Parser> |
DefaultParser.getParsers(ParseContext context) |
Set<MediaType> |
CryptoParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
CompositeParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
DelegatingParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
NetworkParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
ErrorParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
Parser.getSupportedTypes(ParseContext context)
Returns the set of media types supported by this parser when used
with the given parse context.
|
Set<MediaType> |
EmptyParser.getSupportedTypes(ParseContext context) |
Set<MediaType> |
RecursiveParserWrapper.getSupportedTypes(ParseContext context) |
Set<MediaType> |
ParserDecorator.getSupportedTypes(ParseContext context)
Delegates the method call to the decorated parser.
|
Modifier and Type | Method and Description |
---|---|
void |
CompositeParser.setParsers(Map<MediaType,Parser> parsers)
Sets the component parsers.
|
static Parser |
ParserDecorator.withFallbacks(Collection<? extends Parser> parsers,
Set<MediaType> types)
Deprecated.
This has been replaced by
FallbackParser |
static Parser |
ParserDecorator.withoutTypes(Parser parser,
Set<MediaType> excludeTypes)
Decorates the given parser so that it never claims to support
parsing of the given media types, but will work for all others.
|
static Parser |
ParserDecorator.withTypes(Parser parser,
Set<MediaType> types)
Decorates the given parser so that it always claims to support
parsing of the given media types.
|
Constructor and Description |
---|
CryptoParser(String transformation,
Provider provider,
Set<MediaType> types) |
CryptoParser(String transformation,
Set<MediaType> types) |
NetworkParser(URI uri,
Set<MediaType> supportedTypes) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
ExternalParser.getSupportedTypes() |
Set<MediaType> |
ExternalParser.getSupportedTypes(ParseContext context) |
Modifier and Type | Method and Description |
---|---|
void |
ExternalParser.setSupportedTypes(Set<MediaType> supportedTypes) |
Modifier and Type | Method and Description |
---|---|
Set<MediaType> |
AbstractMultipleParser.getSupportedTypes(ParseContext context) |
Copyright © 2007–2021 The Apache Software Foundation. All rights reserved.