Running pt-osc on RDS instance to alter a table with 1.9 billion records

ryan.griffith · November 27, 2019, 10:57am

I have an RDS instance running MySQL 5.5.46 which has a table with a primary key of int that it is currently at 1.9 billion records and approaching the 2.1 billion limit and ~425GB in size. I’m attempting to use pt-online-schema-change to alter the column to a bigint.

I was able to successfully test the change on a test server (m3.2xlarge) and, while it took about 7 days to complete, it did finish successfully. This test server was under no additional load. (Side note: 7 days seemed like a LONG time).

For the production environment, there is no replication/slave present (but there is Multi-AZ) and, to help with resource contention and speed things up, I’m using an r3.8xlarge instance type.

After two attempts, the production migration would get to about 50% and a 1 day left and then the RDS would seemingly stop accepting connections forcing the pt-osc both times to roll back or fail outright, because the RDS needed to be rebooted.

I don’t see anything obvious in the RDS console or logs to help indicate why this happened, and I feel like the instance type should be able to handle a lot of connections/load.

Looking at the CloudWatch metrics during my now third attempt, the database server itself doesn’t seem to be under much load: 5% CPU, 59 DB Connections, 45GB Freeable Memory, Write IOPS ~2200-2500.

Wondering if anyone has ran into this situation and, if so, what helped with the connection issue?

If anyone has suggestions on how to speed up the process in general I’d love to hear. I was considering trying a larger chunk-size and off hours, but wasn’t sure how that would end up affecting the application.

songezo · February 6, 2023, 9:56am

Hi Ryan,

I know it’s been a long time since you posted this issue. What did you end up doing?
I’m facing the same issue.

Topic		Replies	Views
Pt-online-schema-change slow down Other MySQL® Questions	1	1125	November 4, 2021
any considerations before updating large table with pt-osc? Percona Toolkit	4	853	May 19, 2020
How do I use pt-online-schema-change on AWS RDS instance? MySQL & MariaDB mysql	3	2591	January 16, 2023
pt-online-schema-change execute very slowly General Questions troubleshooting , percona	1	1995	September 17, 2020
pt-osc makes database unusable. It seems do not honor the --max-load parameter Percona Toolkit	1	650	March 24, 2016

Running pt-osc on RDS instance to alter a table with 1.9 billion records

Related topics