I have downloaded and been using an extension for excel called Fuzzy look up which helps me find partial duplicate strings. It can be found here. http://www.microsoft.com/en-us/downl....aspx?id=15011
The tool works pretty well, but I know I'm not fully understanding all of its configurations.
I have two large databases of names, addresses and other information that I am trying to compare. I'm trying to find the match from database one to database two. Sometimes there is an exact match, sometimes the match is spelled differently or incorrectly, and sometimes there is no match at all. When I run the fuzzy lookup tool it does its best to find a match with at least an X% similarity (default 50%). However it's all over the place. Sometimes it's spot on, other times it's not even close.
One way I know that I can improve the results is to have it search and filter first by state and then by name, but I'm unable to figure out how to tell it to look at the states first, and then the names. In other words have fuzzy look at the states in table A and match them up with the states in table B. Of the states that match up then match the names from table A with table B, then repeat for every state. Every result should have the same state. If there is no match within 50% similarity in the same state, then leave it blank, don't try another state. If I wanted to I supposed I could filter Tables A and B by state then make new "state tables" and do fuzzy lookups by each state table, but I don't relish doing that fifty times.
Once I get comfortable with this solution I'd like to extend it to other fuzzy analyses like matching one e-mail up against another, but comparing the ending of the e-mail (everything after the @ sign), and then looking at the first part of the e-mail.
Anyone have any experience with this?
Attachment shows Tables A and B from two different databases. Names and states. The third table to the right was a fuzzy match up. I selected State by state and pushed the button in the middle, then matched name against name and pushed the button. Then I pushed go. As you can see there are some pretty good match ups, but several results are not in the same state.
PS I'm all ears for a different solution for this. In the past I've used the fuzzy lookup macro found here - http://www.mrexcel.com/forum/excel-q...planation.html - but it was PAINFULLY slow and seemed to have the same functionality as this add in, but was just used for pre office 2010 versions.
Thanks!
Bookmarks