class |
CollectionTagger |
Assigns one or more tags to the metadata of a document based on its URL matching patterns defined
in a JSON resource file.
|
class |
CommaSeparatedToMultivaluedMetadata |
Rewrites single metadata containing comma separated values into multiple values for the same key,
useful for instance for keyword tags.
|
class |
DebugParseFilter |
Dumps the DOM representation of a document into a file
|
class |
DomainParseFilter |
Adds domain (or host) to metadata - can be used later on for indexing *
|
class |
LDJsonParseFilter |
Extracts data from JSON-LD representation (https://json-ld.org/)
|
class |
LinkParseFilter |
ParseFilter to extract additional links with Xpath can be configured with e.g.
|
class |
MD5SignatureParseFilter |
Computes a signature for a page, based on the binary content or text.
|
class |
MimeTypeNormalization |
Normalises the MimeType value e.g.
|
class |
XPathFilter |
Simple ParseFilter to illustrate and test the interface.
|