Kimball Forum
Would you like to react to this message? Create an account in a few clicks or log in to continue.

Merging customer data from disparate sources to create a master customer dimension

2 posters

Go down

Merging customer data from disparate sources to create a master customer dimension Empty Merging customer data from disparate sources to create a master customer dimension

Post  jotrocks Wed Feb 01, 2012 5:47 pm

Looking for guidance in regards to merging disparate sources of customer data into one master customer table - which will be modeled into a conformed customer dimension down the road.

Problem is the data sources have the typical issues - SSN exists in some sources, not in others. Names and addresses are not consistent, DOBs are available in some, but not others - etc.

Any ideas on what is typically done in this scenario? My initial thought is to use some combination of SSN, Name, and DOB to create a user facing unique ID via a hash function in SQL Server (do not want to show users the actual SSN, obviously) and leverage the fuzzy lookup tasks in my ETL tool to ID merge customers that exist in multiple source systems.

Anyone done anything like this before? Thanks in advance!!!

Great forum, BTW!
Matt

jotrocks

Posts : 1
Join date : 2012-02-01

Back to top Go down

Merging customer data from disparate sources to create a master customer dimension Empty Re: Merging customer data from disparate sources to create a master customer dimension

Post  BoxesAndLines Thu Feb 02, 2012 11:39 am

Purchase data quality software. Profile the data, identify merge attributes, develop data quality plans. Strive for 97-98% match rate. Hand coding will reduce match rates significantly.
BoxesAndLines
BoxesAndLines

Posts : 1212
Join date : 2009-02-03
Location : USA

Back to top Go down

Back to top

- Similar topics

 
Permissions in this forum:
You cannot reply to topics in this forum