Most organizations start with a Decentralized Model before they move toward a formal Data Management Organization (DMO). As an organization sees the impact of improvements in Data Quality, it may …
Big Data – MPP Shared-nothing Technologies and Architecture
MPP has evolved because traditional computing paradigms (indexes, distributed data sets, etc.) did not provide acceptable response times on massive tables. Massively Parallel Processing (MPP) Shared-nothing Database technologies have become …
What is Big Data: A Comprehensive Overview
Definition of Big DataIf someone says Big Data, it means huge volume, complex, and diversified datasets that are generated at high rate from distinct sources. These datasets are too big …
Data Science – Process and Iterative Phases
The Data Science process follows the scientific method of refining knowledge by making observations, formulating and testing hypotheses, observing results, and formulating general theories that explain results. Within Data Science, …
RDM – Reference Data Management
Reference Data is any data used to characterize or classify other data, or to relate data to information.Reference Data Management entails control and maintenance of defined domain values, definitions, and …
Data Management – What is Master Data?
Different types of Data play different roles within an organization. They also have different management requirements. Malcolm Chisholm has proposed a six-layer Taxonomy of Data that includes Metadata, Reference Data, …
Data Warehouse, Data Lake & Data Vault
Data Lakes & Data Warehouses Data Lakes and Data Warehouses both act as repositories, but they are designed for very different purposes. Data Warehouses work best for specific projects with …
Data Management – DII (A Momentary Look)
Plan and Analyze Define Data Integration and Lifecycle Requirements Perform Data Discovery Document Data Lineage Profile Data Collect Business Rules Design Data Integration Solutions Design Data Integration Architecture Select Interaction …
Data Management – Manage Versioning and Control
ANSI Standard 859 has three levels of control of data, based on the criticality of the data and the perceived harm that would occur if data were corrupted or otherwise …