Hello,
Please see file attached.
As you can see, its a single column with many sentences (simple text).
Many of the rows share similar words.
I need a macro solution (only macro, not formula) that will scan all sentences in column A and delete all rows that has at least 4 words (or more) in common with other sentences, EXCEPT OF THE FIRST ROW THAT HAS THESE WORDS.
I am not talking about 4 words that has the exact same order, I am talking about 4 words ANYWHERE IN THE SENTENCES.
For example, if Column A has 6 rows, and a few of them look like these:
1. when and why lice lay eggs?
2. when can lice start laying eggs?
3. when lice lay eggs?
4. how lice lay eggs?
5. where can lice eggs live?
6. when can a lice lay eggs?
As you can see, rows 1, 3, and 6 has the words "when", "lice", "lay" and "eggs" in common.
After running the macro, rows 3 and 6 will be deleted and the output will look like this:
1. when and why lice lay eggs?
2. when can lice start laying eggs?
3. how lice lay eggs?
4. where can lice eggs live?
P.S: the solution should not be case sensitive, Lice and lice are the same thing.
Thanks,
Sami
Bookmarks