Interface InputFormat

    • Method Detail

      • getWeightedSize

        default long getWeightedSize​(String path,
                                     long size)
        Computes the weighted size of a given input object of the underyling input format type, weighted for its cost during ingestion. The weight calculated is dependent on the format type and compression type (CompressionUtils.Format) used if any. Uncompressed newline delimited json is used as baseline with scale factor 1. This means that when computing the byte weight that an uncompressed newline delimited json input object has towards ingestion, we take the file size as is, 1:1.
        Parameters:
        path - The path of the input object. Used to tell whether any compression is used.
        size - The size of the input object in bytes.
        Returns:
        The weighted size of the input object.