Getting a Cross-Section from Two CSV Files

Posted by Jonathan Sampson on Super User See other posts from Super User or by Jonathan Sampson
Published on 2010-05-17T18:24:41Z Indexed on 2010/05/17 18:31 UTC
Read the original article Hit count: 200

I have two CSV files that I am working with. One is massive, with about 200,000 rows. The other is much smaller, having about 12,000 rows. Both fit the same format of names, and email addresses (everything is legit here, no worries). Basically I'm trying to get only a subset of the second list by removing all values that presently exist in the larger file.

So, List A has ~200k rows, and List B has ~12k. These lists overlap a bit, and I'd like to remove all entries from List B if they also exist in List A, leaving me with new and unique values only in List B. I've got a few tooks at my disposal that I can use. Open Office is loaded on this machine, along with MySQL (queries are alright).

What's the easiest way to create a third CSV with the intersection of data?

© Super User or respective owner

Related posts about csv

Related posts about openoffice.org