Class StdOutIndexer
- java.lang.Object
-
- org.apache.storm.topology.base.BaseComponent
-
- org.apache.storm.topology.base.BaseRichBolt
-
- com.digitalpebble.stormcrawler.indexing.AbstractIndexerBolt
-
- com.digitalpebble.stormcrawler.indexing.StdOutIndexer
-
- All Implemented Interfaces:
Serializable
,org.apache.storm.task.IBolt
,org.apache.storm.topology.IComponent
,org.apache.storm.topology.IRichBolt
public class StdOutIndexer extends AbstractIndexerBolt
Indexer which generates fields for indexing and sends them to the standard output. Useful for debugging and as an illustration of what AbstractIndexerBolt provides.- See Also:
- Serialized Form
-
-
Field Summary
-
Fields inherited from class com.digitalpebble.stormcrawler.indexing.AbstractIndexerBolt
canonicalMetadataParamName, ignoreEmptyFieldValueParamName, metadata2fieldParamName, metadataFilterParamName, textFieldParamName, textLengthParamName, urlFieldParamName
-
-
Constructor Summary
Constructors Constructor Description StdOutIndexer()
-
Method Summary
All Methods Instance Methods Concrete Methods Modifier and Type Method Description void
execute(org.apache.storm.tuple.Tuple tuple)
void
prepare(Map<String,Object> conf, org.apache.storm.task.TopologyContext context, org.apache.storm.task.OutputCollector collector)
-
Methods inherited from class com.digitalpebble.stormcrawler.indexing.AbstractIndexerBolt
declareOutputFields, fieldNameForText, fieldNameForURL, filterDocument, filterMetadata, getDocumentID, ignoreEmptyFields, trimText, valueForURL
-
-
-
-
Method Detail
-
prepare
public void prepare(Map<String,Object> conf, org.apache.storm.task.TopologyContext context, org.apache.storm.task.OutputCollector collector)
- Specified by:
prepare
in interfaceorg.apache.storm.task.IBolt
- Overrides:
prepare
in classAbstractIndexerBolt
-
execute
public void execute(org.apache.storm.tuple.Tuple tuple)
-
-