+ Reply to Thread
Results 1 to 5 of 5

Remove duplicate rows by matching values across columns in Excel 2007

  1. #1
    Registered User
    Join Date
    08-14-2013
    Location
    Pakistan
    MS-Off Ver
    Excel 2016
    Posts
    32

    Remove duplicate rows by matching values across columns in Excel 2007

    Hi,
    I have a large dataset of text and I'm trying to find the co-occurance of words. Alterantive combinations of words are in two different columns. For instance, the column A has a co-word "apple_orange" and the column B of the same row has its flip co-word "organge_apple" and both are equivalent. This also means that all the values in column A are present somewhere in Column B and vice versa, but in different rows. For instance, consider the following:

    Col A Col B
    apple_fruit fruit_apple
    apple_mango mango_apple
    apple_orange orange_apple
    fruit_apple apple_fruit
    juice_mango mango_juice
    juice_orange orange_juice
    mango_apple apple_mango
    mango_juice juice_mango
    orange_apple apple_orange
    orange_juice juice_orange

    I need to accurately identify and remove all the duplicate rows, whereby the duplicates of Column A reside in Column B and vice versa. This means that half of the rows in the matrix have to be removed but the challenge is how to identify which rows to be removed. For instance, in the above example, rows 1 and 4 are duplicates, rows 2 and 7 are duplicates, and so forth, and need to be removed.

    I have tried different formulae and techniques but failed. Any help would be highly appreciated.

    Best regards,
    guest2013
    Last edited by guest2013; 08-14-2013 at 07:59 AM.

  2. #2
    Valued Forum Contributor
    Join Date
    09-15-2011
    Location
    Chennai, India
    MS-Off Ver
    Excel 2010
    Posts
    436

    Re: Remove duplicate rows by matching values across columns in Excel 2007

    Please try this file
    Attached Files Attached Files
    Please click 'Add reputation', if my answer helped you.

  3. #3
    Forum Contributor
    Join Date
    08-14-2013
    Location
    Here and there
    MS-Off Ver
    Excel 2010
    Posts
    376

    Re: Remove duplicate rows by matching values across columns in Excel 2007

    Hi,

    Have a look at the attachment with various Conditional Formatting options for finding duplicate data.

    CheersDuplicates.xlsx

  4. #4
    Registered User
    Join Date
    08-14-2013
    Location
    Pakistan
    MS-Off Ver
    Excel 2016
    Posts
    32

    Re: Remove duplicate rows by matching values across columns in Excel 2007

    Quote Originally Posted by ramananhrm View Post
    Please try this file
    Thanks for your response. To clarify the problem further, I need to identify and remove the duplicate rows. The duplicate of a value in column A is in column B. This means that half of the matrix has to be removed.

    The colour-coded file is attached. In this example, I need to find an efficient and accurate way to remove rows 9-15 because their values are present across the columns.

    Best regards,
    guest2013
    Attached Files Attached Files

  5. #5
    Registered User
    Join Date
    08-14-2013
    Location
    Pakistan
    MS-Off Ver
    Excel 2016
    Posts
    32

    Re: Remove duplicate rows by matching values across columns in Excel 2007

    This problem has been solved here.

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. excel macro to remove specific columns and rows + remove duplicate
    By garrywelson in forum Excel Programming / VBA / Macros
    Replies: 12
    Last Post: 01-17-2013, 12:03 PM
  2. [SOLVED] How to remove duplicate rows only if columns A and B are BOTH duplicates in Excel 2003
    By Benisato in forum Excel Programming / VBA / Macros
    Replies: 4
    Last Post: 06-22-2012, 03:53 AM
  3. [SOLVED] Finding Duplicate and Matching Values in Rows
    By fitkhan in forum Excel General
    Replies: 2
    Last Post: 04-05-2012, 09:07 PM
  4. Replies: 3
    Last Post: 11-24-2011, 09:55 AM
  5. Remove Duplicate Rows in Excel 2007
    By ExcelTip in forum Tips and Tutorials
    Replies: 0
    Last Post: 11-19-2007, 12:10 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1