'BigTable: will column qualifier or timestamp filter cause the full scan?
Say I have a table that contains billions of records.
- On a particular day there are only 7 records though. If I filter it by timestamp range (as described here), will it cause the full table scan?
- Only one row has a column
"col36847629". if I apply a column qualifier filter, will it scan the entire table?
Solution 1:[1]
Any filters on Bigtable reads (besides ones related to the rowkey) will cause a full table scan if not contained by a rowkey or range. The filters are in place to help reduce the amount of data sent over the network for lower network costs and faster throughput, but are not going to reduce the size of the scan.
If this is a common scenario you're facing, you might want to add some date or timestamp information into your rowkey as a way to filter on that and then perform the scans.
Sources
This article follows the attribution requirements of Stack Overflow and is licensed under CC BY-SA 3.0.
Source: Stack Overflow
| Solution | Source |
|---|---|
| Solution 1 | Billy Jacobson |
