processing large result sets...

sentinel · April 25, 2008, 6:11am

Hello,

I have a large table with about half a million (potentially a few million) rows - i need to make a snapshot of the data at a specific moment and process it for export from the database (in an xml file, but that’s not that important)…

I have some ideas on how to process them now but each has its drawbacks (i’m using php to process the data, if that is of any importance)…

one could just make a ‘select *’ on this table and then process it one row at a time at the application level - the problem - the data might grow so large that the result set does not fit the available RAM…
one could ‘select id from…’ and then process it one id at a time, fetching rows and the processing them… but that would require me to first make a copy of the table, so it doesn’t change during the whole operation… and, theoretically, it doesn’t stop the script from taking too much memory if there are enough rows (not a few million maybe but it’s certainly possible)…
I was also thinking in a direction of processing it part-by-part using ‘select … limit …’ which might (i’m not sure) allow me to process arbitrarily large result sets but I don’t think it’s a wise idea from performance point of view… of course, I would still need a copy first in this case…

Any other ideas? Maybe some obvious way I’m somehow missing? Some standard way of processing large amounts of data from a MySQL database?

Thanks in advance…

debug · April 28, 2008, 6:17am

How often do you need to do such database exports? If not very often, then if MySQL is on LVM partition, you can do LVM snapshot, mount snapshot, copy data to another server, stat MySQL with data directory from this snapshot, and export it to XML file.

sentinel · April 29, 2008, 5:05am

[B]debug wrote on Mon, 28 April 2008 13:47[/B]

How often do you need to do such database exports? If not very often, then if MySQL is on LVM partition, you can do LVM snapshot, mount snapshot, copy data to another server, stat MySQL with data directory from this snapshot, and export it to XML file.

Thanks for your answer ), the exports will need to be done once or twice a day…

As for the lvm option, I’d have to think about that - the database is now on a regular filesystem so that would require some shuffling about… But this solves only the “snapshot” part of the problem - the other part of my question still remains unanswered - What is the best way to process and export a large result set (possibly exceeding available memory) from a database to an xml file (or some other format)?

Topic		Replies	Views
split huge tables or just query differently? Other MySQL® Questions	7	1022	November 20, 2006
Tuning server for very large (250GB) data sets Other MySQL® Questions	2	685	June 26, 2007
How to import large datasets Other MySQL® Questions	2	1070	June 30, 2014
How to import large datasets Other MySQL® Questions	5	827	December 3, 2006
Performance of Large SELECT Other MySQL® Questions	1	391	December 12, 2010

processing large result sets...

Related topics