Rule for removing duplicates in data

**birdie27** · 06-10-2014, 05:41 AM

I'm trying to find out the rule for de-duplicating data. I am removing duplicates based on an identification number in a data set of about 6000 records, including the duplicates (some records appear about 4 times). Due to the nature of the data I'm working with, there are only a handful of records that are "true" duplicates, i.e. some of the records appear 4 times but there is a difference in terms of location, etc and some are true duplicates in that there is no difference.
I need to know how Excel removes duplicates - does it only keep the first line that it finds for that identification number?
Also, is there a way that I could create a rule for it to keep the record with the highest rate for example?

**oeldere** · 06-10-2014, 05:47 AM

Post a (small) excel file, without confidential information.

Please also post the desired result.

I think of adding columns together and then find the duplicated rows.

**birdie27** · 06-10-2014, 05:59 AM

I have attached an example file - you will see that one is a duplicate with all the info the same, and the others are duplicates with variances. I have not attached an example for the desired result because the desired result is that I would be able to stipulate a rule for which duplicates to remove, for example, remove duplicates based on identification but keep the row with the highest rate.

I hope that makes sense..

**oeldere** · 06-10-2014, 06:09 AM

The yellow cell is the duplicated one.

f2=

Please Login or Register  to view this content.

g2=

Please Login or Register  to view this content.

Rule for removing duplicates in data

LinkBack

Thread Tools

Rate This Thread

Display

Rule for removing duplicates in data

Re: Rule for removing duplicates in data

Re: Rule for removing duplicates in data

Re: Rule for removing duplicates in data

Thread Information

Users Browsing this Thread

Similar Threads

[SOLVED] Combine data from one sheet to another, removing duplicates

removing duplicates from data

Removing Duplicates in one column and summarizing data

Removing duplicates rule

Removing Duplicates and Mainting Data - Help!

Tags for this Thread

Bookmarks

Bookmarks

Posting Permissions