Getting a Cross-Section from Two CSV Files
Posted
by Jonathan Sampson
on Super User
See other posts from Super User
or by Jonathan Sampson
Published on 2010-05-17T18:24:41Z
Indexed on
2010/05/17
18:31 UTC
Read the original article
Hit count: 200
I have two CSV files that I am working with. One is massive, with about 200,000 rows. The other is much smaller, having about 12,000 rows. Both fit the same format of names, and email addresses (everything is legit here, no worries). Basically I'm trying to get only a subset of the second list by removing all values that presently exist in the larger file.
So, List A has ~200k rows, and List B has ~12k. These lists overlap a bit, and I'd like to remove all entries from List B if they also exist in List A, leaving me with new and unique values only in List B. I've got a few tooks at my disposal that I can use. Open Office is loaded on this machine, along with MySQL (queries are alright).
What's the easiest way to create a third CSV with the intersection of data?
© Super User or respective owner