Class RegexURLFilter
- java.lang.Object
-
- com.digitalpebble.stormcrawler.util.AbstractConfigurable
-
- com.digitalpebble.stormcrawler.filtering.URLFilter
-
- com.digitalpebble.stormcrawler.filtering.regex.RegexURLFilterBase
-
- com.digitalpebble.stormcrawler.filtering.regex.RegexURLFilter
-
- All Implemented Interfaces:
Configurable
public class RegexURLFilter extends RegexURLFilterBase
Filters URLs based on a file of regular expressions using theJava Regex implementation
.Adapted from Apache Nutch 1.9
-
-
Constructor Summary
Constructors Constructor Description RegexURLFilter()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description protected RegexRule
createRule(boolean sign, String regex)
Creates a newRegexRule
.-
Methods inherited from class com.digitalpebble.stormcrawler.filtering.regex.RegexURLFilterBase
configure, filter
-
Methods inherited from class com.digitalpebble.stormcrawler.util.AbstractConfigurable
configure, getName
-
-
-
-
Method Detail
-
createRule
protected RegexRule createRule(boolean sign, String regex)
Description copied from class:RegexURLFilterBase
Creates a newRegexRule
.- Specified by:
createRule
in classRegexURLFilterBase
- Parameters:
sign
- of the regular expression. Atrue
value means that any URL matching this rule must be included, whereas afalse
value means that any URL matching this rule must be excluded.regex
- is the regular expression associated to this rule.
-
-