Table column analysis/profiling tool?
4 posters
Page 1 of 1
Table column analysis/profiling tool?
Hello,
Is there a tool (preferably freeware) that can analyse tables and present results like below?
I am in the process of doing a data analysis and profiling prior to developing a data mart. It helps me immensely if I can generate results like this for source database tables.
Could some experienced data analysts shed some light?
Thanks
Is there a tool (preferably freeware) that can analyse tables and present results like below?
Table name | Column name | data type and length | Minimum value | Maximum Value | Minimum Length* | Maximum Length* | Distinct Count | Null count | Null percentage | Top 3 values | Bottom 3 values |
dbo.Customer | JRBTGRP | varchar(50) | 'Home' | 'Joint Partnership' | 4 | 17 | 27 | 340 | 2.3% | 'Home', 'Public', 'Government' | 'Private', 'Charity', 'No rebate' |
dbo.Customer | JRBTLDN | numeric(10,2) | 0.0 | 24.67 | - | - | 675 | 2 | 0.04% | 2.45, 2.23, 1.75 | 0.45,0.0, 4.10 |
I am in the process of doing a data analysis and profiling prior to developing a data mart. It helps me immensely if I can generate results like this for source database tables.
Could some experienced data analysts shed some light?
Thanks
sman- Posts : 22
Join date : 2011-01-30
Check out Talend Data Quality Tool
I think this is what you are looking for. It is easy to get up and running and is open source.
Search google for "Talend Data Quality". I am new to the forum so I can't send you the direct link.
- dave
Search google for "Talend Data Quality". I am new to the forum so I can't send you the direct link.
- dave
For Dave...
Until Dave's noob time passes...
http://www.talend.com/products-data-quality/matrix-data-quality.php
http://www.talend.com/products-data-quality/matrix-data-quality.php
elmorejr- Posts : 25
Join date : 2011-10-20
Location : Columbus, Ohio, United States of America, Earth, Sol
Re: Table column analysis/profiling tool?
Thanks Dave and Elmorejr.
I've used Talend open profiler. I could not get it to do last two columns ("Top 3 values", "Bottom 3 values").
Also, is there a way to re-organise the Talend's output as I listed above?
I've noticed that the above format shows just enough information without taking too much space, which is handy when working with 50-100 tables.
The above info can be presented in MS Excel spreadsheets and further adhoc filtering can be done with Excel's drop down filters.
I've used Talend open profiler. I could not get it to do last two columns ("Top 3 values", "Bottom 3 values").
Also, is there a way to re-organise the Talend's output as I listed above?
I've noticed that the above format shows just enough information without taking too much space, which is handy when working with 50-100 tables.
The above info can be presented in MS Excel spreadsheets and further adhoc filtering can be done with Excel's drop down filters.
sman- Posts : 22
Join date : 2011-01-30
Re: Table column analysis/profiling tool?
Hi Sman,
There's a set of SSRS reports that came with the Kimball Microsoft Data Warehouse Toolkit book - you can download them from here:
http://www.kimballgroup.com/html/booksMDWTtools.html
I'm referring to the Data profiling reports. Being SSRS, you could extend or modify the reports to suit your requirements.
There's also a Data Profiler tool that comes with SSIS.
http://technet.microsoft.com/en-us/library/bb895310.aspx
http://consultingblogs.emc.com/jamiethomson/archive/2008/03/02/ssis-data-profiling-task-part-1-introduction.aspx
Both are limited to SQL Server tables as the data source.
Good luck!
Mike
There's a set of SSRS reports that came with the Kimball Microsoft Data Warehouse Toolkit book - you can download them from here:
http://www.kimballgroup.com/html/booksMDWTtools.html
I'm referring to the Data profiling reports. Being SSRS, you could extend or modify the reports to suit your requirements.
There's also a Data Profiler tool that comes with SSIS.
http://technet.microsoft.com/en-us/library/bb895310.aspx
http://consultingblogs.emc.com/jamiethomson/archive/2008/03/02/ssis-data-profiling-task-part-1-introduction.aspx
Both are limited to SQL Server tables as the data source.
Good luck!
Mike
Similar topics
» Dimensional table design dilemma, Aditional column or Xref table
» Column count on dimension table
» Table and Column Naming Conventions
» Column Property Enforcement
» best way for dimension table lookups, if natural key is multi-column (string)
» Column count on dimension table
» Table and Column Naming Conventions
» Column Property Enforcement
» best way for dimension table lookups, if natural key is multi-column (string)
Page 1 of 1
Permissions in this forum:
You cannot reply to topics in this forum