Class LDJsonParseFilter

    • Field Detail

      • LOG

        public static final org.slf4j.Logger LOG
    • Constructor Detail

      • LDJsonParseFilter

        public LDJsonParseFilter()
    • Method Detail

      • filterJson

        public static com.fasterxml.jackson.databind.JsonNode filterJson​(org.jsoup.nodes.Document doc)
                                                                  throws Exception
        Throws:
        Exception
      • configure

        public void configure​(@NotNull
                              @NotNull Map<String,​Object> stormConf,
                              @NotNull
                              @NotNull com.fasterxml.jackson.databind.JsonNode filterParams)
        Description copied from interface: Configurable
        Called when this filter is being initialized
        Specified by:
        configure in interface Configurable
        Parameters:
        stormConf - The Storm configuration used for the configurable
        filterParams - the filter specific configuration. Never null
      • filter

        public void filter​(@NotNull
                           @NotNull String url,
                           byte[] content,
                           @NotNull
                           @NotNull org.jsoup.nodes.Document doc,
                           @NotNull
                           @NotNull ParseResult parse)
        Description copied from interface: JSoupFilter
        Called when parsing a specific page
        Specified by:
        filter in interface JSoupFilter
        Parameters:
        url - the URL of the page being parsed
        content - the content being parsed
        doc - document produced by JSoup's parsingF
        parse - the metadata to be updated with the resulting of the parsing