Scylla Compaction Fundamentals

Compaction first selects a set of sstables to process, based on compaction strategy
It then reads the sstable and writes them down compacted, also eliminates overwrites , deleted and expired data
Eventually when the output sstables are sealed and storage written down, the input now can be deleted
Overwritten, expired (by ttl), deleted (by tombstone), dropable tombstones these are only mutations that can be eliminated.

Bloom filters are read path optimisation, has lot of false positives i.e. if it says no the record is not present, but the other hand if it says yes not necessarily the data exists.
The technique of keeping sorted files and merging them is called Log-Structured Merge tree.

Size tiered Compaction Strategy [STCS] - compaction is executed based on sstable size bucket
Time Window Compaction Strategy [ TWCS ]- targeted for time series data, Compact buckets, using size tiered.
Leveled Compaction Strategies [ LCS ] - Maximum strict bounds on number of sstable

Day 12/100