Sampling with pt-table-checksum

A long while back, pt-table-checksum supported “–probability” and “–modulo” options that let you select a sampling of rows. This is quite useful for large tables. What options do we have to do something similar today (aside from --where )?


It was somewhat trickier and probably much slower, but using a --where clause on an indexed field had similar functionality.