Hi everyone,
I have a problem with a data set of mine:
I've a table in which I identified duplicates in a column (column E). However, I would like to delete all duplicates except the one that has the highest value in column F (the similarity index). In short, I wanna keep the best of the duplicates and get rid of the others. Is there a way/formula how to approach this for a big data set with various duplicates?
I'd really appreciate any help!
Thank you!
Bookmarks