Hello,
My company is currently working on cleaning a database of roughly 22'000 customers. We have exported the customer list to Excel and want to make sure all the data is formatted correctly and in the right fields before re-importing into our system.
The formatting is not an issue, a few =PROPER and =TRIM formulas and some Find & Replacing will do the trick.
The problem is moving information into the correct columns and making everything structured and uniform.
The process for inputting and importing data has changed over the years and the result is that the data is very random. Certain standards were only adopted later on (eg. using US as a country code, whereas before it could be anything from U.S., U.S, USA, U.S.A, United States, America, etc...)
Additionally customers might have inputted their addresses in different ways, eg. city and state separated by a comma or space in one field instead of having them in different fields. It has even gone so far as in customers inputting their numbers next to their names (eg. Mike 0123456789).
I would post a screenshot however to protect our customers' privacy I am unable to do so.
Currently the only way we'd know how to clean this is to manually go through each entry. We've been advised to hire an 'Excel wizard' who can do all sorts of things with Excel through formulas, automation, macros, etc... We have however been asking ourselves how helpful this might be considering the high level of randomness and lack of structure with the data.
Would you recommend hiring an expert freelancer for the job or giving the data to an admin / data center to clean manually? With the latter there is obviously the risk of handing the data to scammers who will sell the information or phish the customers. This is a bit of a tough situation and any advice is much appreciated.
Many thanks in advance,
David
Bookmarks