Develops an understanding of the content, quality, and rules of a specified set of data under management.
Category: 02 Data Profiling
Introductory Notes
Data profiling supports an e ective data quality program and is an important first step for many information technology initiatives. Data profiling should be conducted both periodically for critical data stores, and event-driven as an important activity in evalu- ating data quality for a specific purpose. For example, organizations may profile data prior to finalizing […]
Goals
1. A standard set of methods, tools, and processes for data profiling is established and followed. 2. Produce recommendations for improving the data quality improvements to data assets. 3. Physical data representation is factual, understandable, and enhances business understanding of the set of data under management.
Related Process Area
Business Glossary covers the standardization and approval of business terms that should be referenced in data profiling efforts. Metadata Management addresses classifications and detailed information about attributes and data elements referenced in data profiling efforts. Architectural Standards contains best practices that support the design and data representation standards referenced in data profiling e orts.
Core Questions
1. Does the organization have a standard method for profiling data? 2. Has the organization trained or acquired sta resources with expertise in data profiling tools and techniques? 3. Does the organization apply statistical models to analyze data profiling reports? 4. Do policies and processes specify the criteria for a data store to undergo profiling? […]
Functional Practice
[tabby title=»Performed»] Level 1: Performed 1.1 Basic profiling is performed for a data store(s). Basic profiling includes such things as analyzing the types or number of distinct values in a column, number or percent of zero, blank or null values, string length, date ranges, patterns, as opposed to the more advanced analysis such as cardinality, […]