Pt-archiver insert fails on Duplicate Key after exiting and resuming

sydneyos · May 20, 2021, 3:35am

I have a pt-archiver job that runs on a cron schedule. It is meant to copy a subset of ID columns from table A and insert into table B, then delete from table A (pretty standard, I think). However, Both table A and B have a UNIQUE Key on the primary identifier (primary key is an auto-id; not talking about that). If the job ends or fails, the next run will fail early with a Duplicate Key error as it seems to be trying to reprocess the last batch where INSERTS into table B were already done but, apparently, deletes from table A were not.

What is the best practice here?

Thanks!

matthewb · May 20, 2021, 1:07pm

I would try using --replace to convert INSERT’s to REPLACE, to help avoid the dup key issue, or use --commit-each which should make the SELECT, INSERT, DELETE procedure a single transaction.

sydneyos · May 20, 2021, 6:30pm

Ah, thank you. I think --ignore might actually be what I need here, but I missed those. Thanks for pointing me in the right direction!

Topic		Replies	Views
pt-archiver errors with unique key violation Percona Toolkit	0	548	February 13, 2013
mk(pt)-archiver and handling Duplicate entry errors Other Tools	2	1573	January 17, 2012
Could pt-archiver use a non-unique key as chunk index? Percona Toolkit	2	1293	January 12, 2024
pt-table-sync => Duplicate entry '2876' for key 'PRIMARY' Percona Toolkit	7	1001	November 11, 2015
Pt-online-schema-change: Alternatives to "REPLACE INTO" Percona Server for MySQL 5.7	3	1073	September 9, 2023

Pt-archiver insert fails on Duplicate Key after exiting and resuming

Related topics