I cannot reproduce the issue.
I imported the data. Here are some rows, including one of the conflicting ones.
mysql> select * from test.log\G
*************************** 1. row ***************************
UID: NULL
numb1: 3533
numb2: 3533
created_at: 2009-10-17 21:58:00
updated_at: 2009-10-17 21:58:00
state: received
field1: something13
content: ......p.....!..........m^..t...J........Z..*.{......{.....K5./#.I.%......~|q......#.k..w...x...&l..w..2.._..pH1.V0......h.Iz.i..I.B......h.
id: 28
*************************** 2. row ***************************
UID: NULL
numb1: 123456
numb2: 3533
created_at: 2009-10-17 21:58:00
updated_at: 2009-10-17 21:58:00
state: received
field1: something1
content: ....../.F<.;
id: 31
*************************** 3. row ***************************
UID: NULL
numb1: 789012
numb2: 3533
created_at: 2009-10-17 23:28:00
updated_at: 2009-10-17 22:41:00
state: received
field1: something2
content: ⱳ⬘ᦢ
id: 32
*************************** 4. row ***************************
Then I ran pt-archiver as follows:
bin/pt-archiver --source h=127.0.0.1,P=12345,u=msandbox,p=msandbox,D=test,t=log --charset=utf8 --file a.txt --where "1=1" --progress 10000 --statistics --limit 1000 --txn-size 500
TIME ELAPSED COUNT
2018-08-23T08:54:43 0 0
2018-08-23T08:54:43 0 13
Started at 2018-08-23T08:54:43, ended at 2018-08-23T08:54:43
Source: A=utf8,D=test,P=12345,h=127.0.0.1,p=...,t=log,u=msandbox
SELECT 13
INSERT 0
DELETE 13
Action Count Time Pct
deleting 13 0.0020 53.79
select 2 0.0005 13.18
commit 1 0.0001 2.93
print_file 13 0.0001 2.73
other 0 0.0010 27.37
and the output file has:
cat a.txt
\N 3533 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something13 ......p.....!..........m^..t...J........Z..*.{......{.....K5./#.I.%......~|q......#.k..w...x...&l..w..2.._..pH1.V043....h.Iz.i..I.B......h.
\N 463456 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something1 ....../.F<.;
\N 479012 3533 2009-10-17 23:28:00 2009-10-17 22:41:00 received something2 ⱳ⬘ᦢ
\N 4833 3533 2009-10-17 21:58:00 2010-10-17 00:41:00 received something3 ???
\N 1111 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something4 ......p.....!.....v..$0......B.Q....p.....u>..7.l.A.l.sKIf.....-I.bQ...G.&....T.M..8j.=....c....*.....*..C..m.KL.J49{r.D'..J8L:.....L.*...F.
\N 22222 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something5 ......p.....!........).H..8....I....1.....t....[....l...4_.Bs3........-...I.B.U6.#_...k9...2..09.....B4)X?...ylw.H50....a..>.c(..:.)z.='5...
\N 5144 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something6 ...c.........g......
\N 52555 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something7 ...}..#.Q..0
\N 9990 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something8 ...c..p.....!..........N.P...v...>.....Cb...2M..w)....a...m...-.|D.M5.Y.H......{.nZN.|....k.0..u..R%3.)...q.5.W.y.53.+p...y....N.Vv+.7'..=F.
\N 5433 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something9 ...o....)..H
\N 5533 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something10 ........sH(|
\N 5633 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something11 ............
\N 3533 3533 2009-10-17 21:58:00 2009-10-17 21:58:00 received something12 "...}..p.....!.....?-......GP..H.+.""..v.a.Kc..t_.X.-~/H.....L.TU.}..#......Q....-..t...;..X+...../....I.....u.;f.57Z.mo.f.......Q.....>.V...."
Could you run a quick test for me?
Could you add this under the
use strict;
line? (~ line 51)
use utf8;
And run the program again?
Regards