I have an excel worksheet with several hundreds of thousands of "wisdom" quotations in it. The quotation itself and the name of the author are both contained in the same cell.
I've removed all duplicate cells. My challenge now is that there are many occurrences of the same quote; except there's maybe an extra space somewhere or a hyphen in a different place.
To save the time consuming process of sorting and manually deleting, is there some way I can automate a process that finds all records (cells) where there is, say, a 95% match, and then automatically deletes the "near duplicate" cell with the least characters. Maybe I'm being hopeful
Thanks for any help with this query.
Bookmarks