+ Reply to Thread
Results 1 to 3 of 3

Deleting "non identical" duplicates

  1. #1
    Registered User
    Join Date
    03-29-2017
    Location
    Sydney
    MS-Off Ver
    Excel 2016 for Mac
    Posts
    7

    Deleting "non identical" duplicates

    I have an excel worksheet with several hundreds of thousands of "wisdom" quotations in it. The quotation itself and the name of the author are both contained in the same cell.

    I've removed all duplicate cells. My challenge now is that there are many occurrences of the same quote; except there's maybe an extra space somewhere or a hyphen in a different place.

    To save the time consuming process of sorting and manually deleting, is there some way I can automate a process that finds all records (cells) where there is, say, a 95% match, and then automatically deletes the "near duplicate" cell with the least characters. Maybe I'm being hopeful

    Thanks for any help with this query.

  2. #2
    Forum Moderator AliGW's Avatar
    Join Date
    08-10-2013
    Location
    Retired in Ipswich, Suffolk, but grew up in Sawley, Derbyshire (England)
    MS-Off Ver
    MS 365 Subscription Insider Beta Channel v. 2404 (Windows 11 22H2 64-bit)
    Posts
    80,432

    Re: Deleting "non identical" duplicates

    Welcome to the forums!

    I shall be flabbergasted, but also full of respect if anyone can provide a macro to do this for you. I'm not a VBA expert, but I can't help thinking that what you are describing will require some fairly complex programming. Hopefully I'm wrong - good luck with this. I'll be watching the thread with interest.
    Ali


    Enthusiastic self-taught user of MS Excel who's always learning!
    Don't forget to say "thank you" in your thread to anyone who has offered you help.
    You can reward them by clicking on * Add Reputation below their user name on the left, if you wish.

    Forum Rules (updated August 2023): please read them here.

  3. #3
    Administrator FDibbins's Avatar
    Join Date
    12-29-2011
    Location
    Duncansville, PA USA
    MS-Off Ver
    Excel 7/10/13/16/365 (PC ver 2310)
    Posts
    52,926

    Re: Deleting "non identical" duplicates

    Perhaps a start would be to use FIND/REPLACE to remove all hyphens and double-spaces (or TRIM with a helper), then Remove Duplicates again?
    1. Use code tags for VBA. [code] Your Code [/code] (or use the # button)
    2. If your question is resolved, mark it SOLVED using the thread tools
    3. Click on the star if you think someone helped you

    Regards
    Ford

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Replies: 5
    Last Post: 02-05-2019, 12:03 AM
  2. Replies: 1
    Last Post: 10-19-2016, 07:52 PM
  3. Replies: 4
    Last Post: 11-17-2013, 12:05 PM
  4. [SOLVED] How to USE """"" cells count """"" change font color
    By austin123456 in forum Excel Programming / VBA / Macros
    Replies: 7
    Last Post: 10-09-2013, 06:14 AM
  5. Replies: 2
    Last Post: 08-17-2012, 05:10 AM
  6. macro need to run without deleting column "E" & "F"
    By johncena in forum Excel Programming / VBA / Macros
    Replies: 1
    Last Post: 03-23-2010, 06:33 AM
  7. Deleting duplicates i.e. "Anne Bell"
    By Ctech in forum Excel Programming / VBA / Macros
    Replies: 7
    Last Post: 12-04-2005, 12:25 PM

Tags for this Thread

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1