image

Data Handling Ethics – Unethical Data Practices – Scenarios

Data Governance Ethics – Examples of Unethical Data Handling

ScenarioDescription
TimingIt is possible to lie through omission or inclusion of certain data points in a report or activity based on timing.
Misleading VisualizationsBias refers to an inclination of outlook. On the personal level, the term is associated with unreasoned judgments or prejudices. In statistics, bias refers to deviations from expected values. These are often introduced through systematic errors in sampling or data selection. Bias can be introduced at different points in the data lifecycle: when data is collected or created, when it is selected for inclusion in analysis, through the methods by which it is analyzed, and in how the results of analysis are presented. There are several types of bias, such as Data Collection for pre-defined results, Biased use of data collected, Hunch and search, Biased sampling methodology, and Context and Culture.
Unclear Definitions or Invalid ComparisonsThe ethical thing to do, in presenting information, is to provide context that informs its meaning, such as a clear, unambiguous definition of the population being measured and what it means to be “on welfare.” When the required context is left out, the surface of the presentation may imply that the data is not supported. Whether this effect is gained through the intent to deceive or through simply clumsiness, it is an unethical use of data. It is also simply necessary, from an ethical perspective, not to misuse statistics.
BiasData integration presents ethical challenges because data is changed as it moves from system to system. If data is not integrated with care, it presents risk for unethical or even illegal data handling. These ethical risks intersect with fundamental problems in data management, including Limited knowledge of data’s origin and lineage, Data of poor quality, Unreliable Metadata, and no documentation of data remediation history.
Transforming and Integrating DataBias refers to an inclination of outlook. On the personal level, the term is associated with unreasoned judgments or prejudices. In statistics, bias refers to deviations from expected values. These are often introduced through systematic errors in sampling or data selection. Bias can be introduced at different points in the data lifecycle: when data is collected or created, when it is selected for inclusion in analysis, through the methods by which it is analyzed, and in how the results of analysis are presented. There are several types of bias, such as Data Collection for pre-defined result, Biased use of data collected, Hunch and search, Biased sampling methodology, and Context and Culture.
Obfuscation / Redaction of DataObfuscating or redacting data is the practice of making information anonymous or removing sensitive information. But obfuscation alone may not be sufficient to protect data if a downstream activity (analysis or combination with other datasets) can expose the data. This risk is present in the following instances: Data aggregation, Data marking, and Data masking.

Leave a Reply

Your email address will not be published. Required fields are marked *

4 + 10 =