Hi, I didn’t really find similar cases:
I have 2 tables (in 2 different databases, if that makes a difference).
These tables have an equal number of fields, named the same, eg:
db1.table1:
id, name, info1, info2
db2.table1:
id, name, info1, info2
I need to match every field of every column with the other’s table column.
I need to find out which field is equal to which and if all fields of one row are NOT equal to any another row, then i display it. id is the primary key and auto incremented. Note that, I only want to know the difference against table1 from table2, so LEFT joining is ok (instead of some OUTER join)
So, i’m doing that:
SELECT db1.table1.id as cid FROM db1.table1 LEFT JOIN db2.table1 ON (db1.table1.name = db2.table1.name) AND (db1.table1.info1 = db2.table1.info1) AND (db1.table1.info2 = db2.table1.info2) WHERE db2.table1.id IS NULL
As you can see its a bit complex already.
Over very large tables however this query takes forever.
Of course, by nature, I have to compare each field’s row to every single other field’s row to determine is this very row already exist in the other table, so its quite resource consuming.
Anyway, the performance of this is terrible, any idea how to make it faster? (ps entries are inserted once, then only selected so indexs are ok)
I made a test with 150 000 rows, it took so long i had to cancel it. It’s pretty fast over 30-50 rows