Remove duplicate rows by matching values across columns in Excel 2007

**guest2013** · 08-14-2013, 04:20 AM

Hi,
I have a large dataset of text and I'm trying to find the co-occurance of words. Alterantive combinations of words are in two different columns. For instance, the column A has a co-word "apple_orange" and the column B of the same row has its flip co-word "organge_apple" and both are equivalent. This also means that all the values in column A are present somewhere in Column B and vice versa, but in different rows. For instance, consider the following:

Col A Col B
apple_fruit fruit_apple
apple_mango mango_apple
apple_orange orange_apple
fruit_apple apple_fruit
juice_mango mango_juice
juice_orange orange_juice
mango_apple apple_mango
mango_juice juice_mango
orange_apple apple_orange
orange_juice juice_orange

I need to accurately identify and remove all the duplicate rows, whereby the duplicates of Column A reside in Column B and vice versa. This means that half of the rows in the matrix have to be removed but the challenge is how to identify which rows to be removed. For instance, in the above example, rows 1 and 4 are duplicates, rows 2 and 7 are duplicates, and so forth, and need to be removed.

I have tried different formulae and techniques but failed. Any help would be highly appreciated.

Best regards,
guest2013

**ramananhrm** · 08-14-2013, 04:55 AM

Please try this file

**Ursul** · 08-14-2013, 05:07 AM

Hi,

Have a look at the attachment with various Conditional Formatting options for finding duplicate data.

CheersDuplicates.xlsx

**guest2013** · 08-14-2013, 05:33 AM

Originally Posted by ramananhrm

Please try this file

Thanks for your response. To clarify the problem further, I need to identify and remove the duplicate rows. The duplicate of a value in column A is in column B. This means that half of the matrix has to be removed.

The colour-coded file is attached. In this example, I need to find an efficient and accurate way to remove rows 9-15 because their values are present across the columns.

Best regards,
guest2013

**guest2013** · 08-15-2013, 12:00 AM

This problem has been solved here.

Remove duplicate rows by matching values across columns in Excel 2007

LinkBack

Thread Tools

Rate This Thread

Display

Remove duplicate rows by matching values across columns in Excel 2007

Re: Remove duplicate rows by matching values across columns in Excel 2007

Re: Remove duplicate rows by matching values across columns in Excel 2007

Re: Remove duplicate rows by matching values across columns in Excel 2007

Re: Remove duplicate rows by matching values across columns in Excel 2007

Thread Information

Users Browsing this Thread

Similar Threads

excel macro to remove specific columns and rows + remove duplicate

[SOLVED] How to remove duplicate rows only if columns A and B are BOTH duplicates in Excel 2003

[SOLVED] Finding Duplicate and Matching Values in Rows

[SOLVED] Excel 2007 : Return all values from a column, based on matching conditions in other columns

Remove Duplicate Rows in Excel 2007

Bookmarks

Bookmarks

Posting Permissions