CollectionTagger |
Assigns one or more tags to the metadata of a document based on its URL matching patterns defined
in a JSON resource file.
|
CommaSeparatedToMultivaluedMetadata |
Rewrites single metadata containing comma separated values into multiple values for the same key,
useful for instance for keyword tags.
|
DebugParseFilter |
Dumps the DOM representation of a document into a file
|
DomainParseFilter |
Adds domain (or host) to metadata - can be used later on for indexing *
|
LDJsonParseFilter |
Extracts data from JSON-LD representation (https://json-ld.org/)
|
LinkParseFilter |
ParseFilter to extract additional links with Xpath can be configured with e.g.
|
MD5SignatureParseFilter |
Computes a signature for a page, based on the binary content or text.
|
MimeTypeNormalization |
Normalises the MimeType value e.g.
|
XPathFilter |
Simple ParseFilter to illustrate and test the interface.
|