Class ParseFilter

    • Constructor Detail

      • ParseFilter

        public ParseFilter()
    • Method Detail

      • filter

        public abstract void filter​(String URL,
                                    byte[] content,
                                    DocumentFragment doc,
                                    ParseResult parse)
        Called when parsing a specific page
        Parameters:
        URL - the URL of the page being parsed
        content - the content being parsed
        doc - the DOM tree resulting of the parsing of the content or null if needsDOM() returns false
        parse - the metadata to be updated with the resulting of the parsing
      • needsDOM

        public boolean needsDOM()
        Specifies whether this filter requires a DOM representation of the document
        Returns:
        trueif this needs a DOM representation of the document, false otherwise.