Class MetadataRecordBuilder

java.lang.Object
org.archive.modules.warc.BaseWARCRecordBuilder
org.archive.modules.warc.MetadataRecordBuilder
All Implemented Interfaces:
WARCRecordBuilder

public class MetadataRecordBuilder extends BaseWARCRecordBuilder
  • Constructor Details

    • MetadataRecordBuilder

      public MetadataRecordBuilder()
  • Method Details

    • shouldBuildRecord

      public boolean shouldBuildRecord(CrawlURI curi)
      If you don't want metadata records, take this class out of the chain.
      Parameters:
      curi - a captured url
      Returns:
      true if it is appropriate for this WARCRecordBuilder to build a record for this capture, false otherwise
    • buildRecord

      public org.archive.io.warc.WARCRecordInfo buildRecord(CrawlURI curi, URI concurrentTo) throws IOException
      Description copied from interface: WARCRecordBuilder
      Builds a warc record for this capture.
      Parameters:
      curi - a captured url
      concurrentTo - implementations should do this:
          if (concurrentTo != null) {
              recordInfo.addExtraHeader(HEADER_KEY_CONCURRENT_TO,
                      "<" + concurrentTo + ">");
          }
      Returns:
      the freshly built warc record
      Throws:
      IOException