Return value based on most common words between two columns

**Frankjager** · 07-30-2020, 07:32 AM

I receive 200 stories per day from which I shortlist X number of stories. I also have a database of 12,000 shortlisted stories. I want to develop an algorithm, which matches contents of column A (which has 200 new stories) and column B (which has all the stories from the database) and generate the closest matched story based on the common words between the two, while ignoring all stop words. I also want it to generate the % of common words between the new story and the common story returned.

Example:

Column A has 200 stories, ranging from A1 to A200. Column B has 12,000 ranging from B1 to B 12,000. I want to match A1 with B1 to B12,000 and return the story with which it shares the maximum number of common words.

I have also attached an excel file with some sample statements for both Column A and Column B.

Thank you

**XLent** · 07-30-2020, 10:31 AM

the below is not something I pulled together for your query but, rather, something I did along time ago for someone else (under a different username) - but it might help?
(there will be other alternatives, obviously)

if it doesn't work for you we can leave there as, tbh, it would be time consuming to modify ;-)

with the UDF stored in a module, you could use this along lines of:

Formula:

Please Login or Register  to view this content.

Please Login or Register  to view this content.

**Frankjager** · 08-03-2020, 02:44 AM

Thanks XLent. I will try using your code and update with the results!

Return value based on most common words between two columns

LinkBack

Thread Tools

Rate This Thread

Display

Return value based on most common words between two columns

Re: Return value based on most common words between two columns

Re: Return value based on most common words between two columns

Thread Information

Users Browsing this Thread

Similar Threads

Return most common, 2nd common...within the data range

How to compare text of same row in two different columns and return matching words?

[SOLVED] Placing dates into multiple columns based on common ID

[SOLVED] Cross Reference two columns and return common values?

How can i sync two cells and and sort as one based on common words?

Linking 2 excel worksheets based on common columns

Merging Columns from 2 sheets based on a common column

Tags for this Thread

Bookmarks

Bookmarks

Posting Permissions