Package org.archive.modules.recrawl
-
Interface Summary Interface Description RecrawlAttributeConstants -
Class Summary Class Description AbstractContentDigestHistory Represents a store of information, presumably persistent, keyed by content digest.AbstractPersistProcessor BdbContentDigestHistory Bdb content digest history store.ContentDigestHistoryLoader ContentDigestHistoryStorer FetchHistoryProcessor Maintain a history of fetch information inside the CrawlURI's attributes.PersistLoadProcessor Loads CrawlURI attributes from previous fetch from persistent storage for consultation by a later recrawl.PersistLogProcessor Log CrawlURI attributes from latest fetch for consultation by a later recrawl.PersistOnlineProcessor Common superclass for persisting Processors which directly store/load to persistence (as opposed to logging for batch load later).PersistProcessor Superclass for Processors which utilize BDB-JE for URI state (including most notably history) persistence.PersistStoreProcessor Store CrawlURI attributes from latest fetch to persistent storage for consultation by a later recrawl.