Hello!
I've been tasked with cleaning up the data in our database by identifying and deleting duplicates.
My Excel file is a list of 1,032,153 rows with 10 columns. But it's not so simple as using the Remove Duplicates function.
Here's what I need to do. Refer to the format example below:
I've started by sorting Date Added (oldest to newest). The earliest record is the original record.
- I need to identify all duplicates by matching Property Address (J) and THEN by matching Last Name (G). Some records have one duplicate, some may have up to 4 or 5. Rather than delete these duplicates, I need to move them to another sheet (Sheet 2).
- So ideally, what I'm left with is Sheet 1 with all UNIQUE records (no records w/ matching property address AND name). From there, I need to run a pivot that would tell me that for each of the unique records in Sheet 1, how many matching records (same Property Address AND Last Name) on Sheet 2 are there?
I've been trying for a while now and I just can't seem to get it.
Assistance would be GREATLY appreciated!
Let me know if you have any additional questions, thanks so much!
De-Dupe Example.png
Bookmarks