+ Reply to Thread
Results 1 to 4 of 4

Formula To "Score" Similar Data Entries

  1. #1
    Forum Contributor
    Join Date
    09-02-2009
    Location
    Texas, USA
    MS-Off Ver
    Excel 2007
    Posts
    110

    Formula To "Score" Similar Data Entries

    I am working with a database of over 3,000 homes. I am looking specifically for homes that have a mismatch between the owner address and the property address. The problem is, who ever entered data at the city was not consistent with their naming conventions and I get alot of false positives as a result.

    For instance
    123 Green Trail and 123 Green Trl will be reported as a mismatch, even though these two addresses are the same. Perhaps if you were to score this example the two entries would be 90%+ similar.

    Would it be possible to make a new column and "score" the two previous columns using some formula or function to see how similar they are to one another? That way I can easily see which entries are too similar and filter by that instead of having to manually scan the whole database.

    Thanks in advance. If I need to clarify anything please let me know.
    Last edited by cmf0106; 10-26-2013 at 11:21 AM.

  2. #2
    Valued Forum Contributor
    Join Date
    05-13-2010
    Location
    Belo Horizonte, Brazil
    MS-Off Ver
    Excel 2003; 2007
    Posts
    441

    Re: Formula To "Score" Similar Data Entries

    Why not to use FILTER and to select the proper columns.
    Through the selection box you can see all results that are similar.

    This way donīt works for you?
    ...If my answer helped you, Please, click on. * Add Reputation (at left)

    Best regards.
    Marc?lio Lob?o

  3. #3
    Forum Contributor
    Join Date
    09-02-2009
    Location
    Texas, USA
    MS-Off Ver
    Excel 2007
    Posts
    110

    Re: Formula To "Score" Similar Data Entries

    Quote Originally Posted by Mazzaropi View Post
    Why not to use FILTER and to select the proper columns.
    Through the selection box you can see all results that are similar.

    This way donīt works for you?
    Quote Originally Posted by Mazzaropi View Post
    Why not to use FILTER and to select the proper columns.
    Through the selection box you can see all results that are similar.

    This way donīt works for you?
    This will not work. Here are some examples directly from the database. I am looking for a mismatch between the fields owner address and situs address. In the image attached illustrating this notice the properties on Loraine St. They were reported as a mismatch because in the owner address field they read "NE Loraine St" and in the situs_address field they were entered as "E Loraine ST". The only reason they are being returned as a mismatch is because how the data was entered, someone forgot to include the "N" in "NE".

    Same thing for the other example, owner address "1724 N Edgewood Ter" and situs address of "1724 N Edgewood Terr". The only difference between the two is one does not include the extra "r"

    If I were able to create a new column with a formula that could somehow score how similar the entries were I could filter through the false positives much quicker.

    See image attached that illustrates this issue.

    exam3ple.png

  4. #4
    Forum Moderator alansidman's Avatar
    Join Date
    02-02-2010
    Location
    Steamboat Springs, CO
    MS-Off Ver
    MS Office 365 Version 2404 Win 11 Home 64 Bit
    Posts
    23,865

    Re: Formula To "Score" Similar Data Entries

    If you have Excel 2010 available to you then MS has an addin that may work for you.

    http://www.excelforum.com/excel-tips...for-excel.html
    Alan עַם יִשְׂרָאֵל חַי


    Change an Ugly Report with Power Query
    Database Normalization
    Complete Guide to Power Query
    Man's Mind Stretched to New Dimensions Never Returns to Its Original Form

+ Reply to Thread

Thread Information

Users Browsing this Thread

There are currently 1 users browsing this thread. (0 members and 1 guests)

Similar Threads

  1. Find id"" and replace with id"consecutive number" 3071 entries
    By RoyLittle0 in forum Excel Programming / VBA / Macros
    Replies: 9
    Last Post: 11-19-2012, 07:27 AM
  2. Replies: 1
    Last Post: 06-21-2012, 05:19 AM
  3. [SOLVED] I need help with a formula similar to "countif" but more complex
    By Hendrik in forum Excel Formulas & Functions
    Replies: 1
    Last Post: 06-22-2006, 04:30 AM
  4. .AddDataField: is there a similar "replace" data field?
    By PW in forum Excel Programming / VBA / Macros
    Replies: 0
    Last Post: 11-22-2005, 07:55 PM
  5. Excel should have a simpler subtraction formula similar to "sum".
    By Darius in forum Excel Formulas & Functions
    Replies: 5
    Last Post: 02-21-2005, 08:06 PM

Bookmarks

Posting Permissions

  • You may not post new threads
  • You may not post replies
  • You may not post attachments
  • You may not edit your posts

Search Engine Friendly URLs by vBSEO 3.6.0 RC 1