Definition of Big DataIf someone says Big Data, it means huge volume, complex, and diversified datasets that are generated at high rate from distinct sources. These datasets are too big …
Data Science – Process and Iterative Phases
The Data Science process follows the scientific method of refining knowledge by making observations, formulating and testing hypotheses, observing results, and formulating general theories that explain results. Within Data Science, …
Data Quality Program – Readiness Assessment / Risk Assessment
Findings from a readiness assessment will help determine where to start and how quickly to proceed. Findings can also provide the basis for roadmapping program goals. If there is strong …
Data Quality – Tools
Tools should be selected and tool architectures should be set in the.planning phase of the enterprise Data Quality program. Tools provide a.partial rule set starter kit but organizations need to …
Data Quality – SLA – Service Level Agreements
A data quality Service Level Agreement (SLA) specifies an organization’s expectations for response and remediation for data quality issues in each system. Data quality inspections as scheduled in the SLA …
RDM – Reference Data Management
Reference Data is any data used to characterize or classify other data, or to relate data to information.Reference Data Management entails control and maintenance of defined domain values, definitions, and …
Data Management – What is Master Data?
Different types of Data play different roles within an organization. They also have different management requirements. Malcolm Chisholm has proposed a six-layer Taxonomy of Data that includes Metadata, Reference Data, …
Reference and Master Data Management – Guiding Principles
Shared Data: Reference and Master Data must be managed so that they are shareable across the organization. Ownership: Reference and Master Data belong to the organization, not to a particular …
Data Quality – Causes of Data Quality Issues – Part-02
Issues Caused by Data Entry Processes Data Entry Interface Issues: Poorly designed data entry interfaces can contribute to data quality issues. If a data entry interface does not have edits …
Data Quality – Business Rules
Business Rules are commonly implemented in software, or by using Document Templates for Data Entry. Some common simple Business Rule types are: Definitional Conformance: Confirm that the same understanding of …