Highly compressible datasets, such as those with repeated string fields, nested structures, or uniform rows, can benefit significantly from physical storage billing. Yet most datasets remain on logical storage by default, even when physical storage would reduce costs.
This inefficiency is common for cold or infrequently updated datasets that are no longer optimized or regularly reviewed. Because storage behavior and data characteristics evolve, failing to periodically reassess the billing model may result in persistent waste.
BigQuery supports two billing models for table storage:
Logical Storage (default): Billed based on the uncompressed size of user data. This model includes time travel and fail-safe storage at no extra cost.
Physical Storage: Billed based on the actual compressed bytes on disk. Time travel and fail-safe storage are charged separately at the same rate as active storage.
While physical storage can significantly reduce costs for highly compressible datasets (e.g., compression savings exceeding \~45%), its higher per-byte cost (\~1.8x) and additional charges for retention-related features may lead to higher total costs for frequently updated or less compressible data.