Class TableDiskUsage


  • public class TableDiskUsage
    extends Object
    This utility class will scan the Accumulo Metadata table to compute the disk usage for a table or table(s) by using the size value stored in columns that contain the column family MetadataSchema.TabletsSection.DataFileColumnFamily. This class will also track shared files to computed shared usage across all tables that are provided as part of the Set of tables when getting disk usage. Because the metadata table is used for computing usage and not the actual files in HDFS the results will be an estimate. Older entries may exist with no file metadata (resulting in size 0) and other actions in the cluster can impact the estimated size such as flushes, tablet splits, compactions, etc. For more accurate information a compaction should first be run on all files for the set of tables being computed.