pt-slave-restart -- what exactly does it do?

davidjhall · June 22, 2012, 8:45am

We have a slave server that, during time of a large report, queries coming in from the master start to time out (1205 error).

I want to use pt-slave-restart to keep attempting “START SLAVE” until the error clears, not skip. I’m not clear from the documentation how slave restart acts.

From the documentation below, what happens exactly when there is an error?

pt-slave-restart sleeps intelligently between polling the slave. The current sleep time varies.

The initial sleep time is given by --sleep.
If it checks and finds an error, it halves the previous sleep time.
If it finds no error, it doubles the previous sleep time.
The sleep time is bounded below by --min-sleep and above by --max-sleep.
Immediately after finding an error, pt-slave-restart assumes another error is very likely to happen next, so it sleeps the current sleep time or the initial sleep time, whichever is less.

[This says it keeps sleeping but doesn’t say "Upon finding an error, pt-slave-restart attempts to skip 1 error and then try again | pt-slave-restart attempts to start the slave again, and if fails X times, it will attempt a skip "]

Thanks
-Dave

xaprb · June 23, 2012, 8:13am

The behavior is rather complex depending on what kind of error has been encountered. In some cases it skips and starts; other times it just issues START SLAVE.

Topic		Replies	Views
Pt-slave-repair is a supplement to the original pt-slave-restart tool Percona Toolkit percona , closed-no-reply	0	617	November 3, 2023
Automatic query restart when certain error happens in replication Other MySQL® Questions	1	560	July 10, 2014
Will pt-slave-restart support the multi-master replication in 5.6? Percona Toolkit	1	495	January 28, 2014
What does pt-table-checksum actually DO? Percona Toolkit	2	651	November 16, 2015
pt-table-checksum can't complete Percona Toolkit	5	2242	April 30, 2015

pt-slave-restart -- what exactly does it do?

Related topics