Merging customer data from disparate sources to create a master customer dimension
2 posters
Page 1 of 1
Merging customer data from disparate sources to create a master customer dimension
Looking for guidance in regards to merging disparate sources of customer data into one master customer table - which will be modeled into a conformed customer dimension down the road.
Problem is the data sources have the typical issues - SSN exists in some sources, not in others. Names and addresses are not consistent, DOBs are available in some, but not others - etc.
Any ideas on what is typically done in this scenario? My initial thought is to use some combination of SSN, Name, and DOB to create a user facing unique ID via a hash function in SQL Server (do not want to show users the actual SSN, obviously) and leverage the fuzzy lookup tasks in my ETL tool to ID merge customers that exist in multiple source systems.
Anyone done anything like this before? Thanks in advance!!!
Great forum, BTW!
Matt
Problem is the data sources have the typical issues - SSN exists in some sources, not in others. Names and addresses are not consistent, DOBs are available in some, but not others - etc.
Any ideas on what is typically done in this scenario? My initial thought is to use some combination of SSN, Name, and DOB to create a user facing unique ID via a hash function in SQL Server (do not want to show users the actual SSN, obviously) and leverage the fuzzy lookup tasks in my ETL tool to ID merge customers that exist in multiple source systems.
Anyone done anything like this before? Thanks in advance!!!
Great forum, BTW!
Matt
jotrocks- Posts : 1
Join date : 2012-02-01
Re: Merging customer data from disparate sources to create a master customer dimension
Purchase data quality software. Profile the data, identify merge attributes, develop data quality plans. Strive for 97-98% match rate. Hand coding will reduce match rates significantly.
BoxesAndLines- Posts : 1212
Join date : 2009-02-03
Location : USA
Similar topics
» De-normalizing Customer Information to create a Customer Dimension
» Master Data and Dimension location
» Dimension Design with Multiple Data Sources (ORACLE and SQL Server)
» Master Data Management
» Designing Single set of dimensions for disparate source data
» Master Data and Dimension location
» Dimension Design with Multiple Data Sources (ORACLE and SQL Server)
» Master Data Management
» Designing Single set of dimensions for disparate source data
Page 1 of 1
Permissions in this forum:
You cannot reply to topics in this forum