image

Data Handling Ethics – Unethical Data Practices – Scenarios

ScenarioDescription
TimingIt is possible to lie through omission or inclusion of certain data points in a report or activity based on timing.
Misleading VisualizationsCharts and graphs can be used to present data in a misleading manner. For instance, changing scale can make a trend line look better or worse.
Unclear Definitions or Invalid ComparisonsThe ethical thing to do, in presenting information, is to provide context that informs its meaning, such as a clear, unambiguous definition of the population being measured and what it means to be “on welfare.” When required context is left out, the surface of the presentation may imply meaning that the data does not support. Whether this effect is gained through the intent to deceive or through simply clumsiness, it is an unethical use of data. It is also simply necessary, from an ethical perspective, not to misuse statistics.
BiasBias refers to an inclination of outlook. On the personal level, the term is associated with unreasoned judgments or prejudices. In statistics, bias refers to deviations from expected values. These are often introduced through systematic errors in sampling or data selection. Bias can be introduced at different points in the data lifecycle: when data is collected or created, when it is selected for inclusion in analysis, through the methods by which it is analyzed, and in how the results of analysis are presented. There are several types of bias such as Data Collection for per-defined result, Biased use of data collected, Hunch and search, Biased sampling methodology, and Context and Culture.
Transforming and Integrating DataData integration presents ethical challenges because data is changed as it moves from system to system. If data is not integrated with care, it presents risk for unethical or even illegal data handling. These ethical risks intersect with fundamental problems in data management, including Limited knowledge of data’s origin and lineage, Data of poor quality, Unreliable Metadata, and No documentation of data remediation history.
Obfuscation / Redaction of DataObfuscating or redacting data is the practice of making information anonymous or removing sensitive information. But obfuscation alone may not be sufficient to protect data if a downstream activity (analysis or combination with other datasets) can expose the data. This risk is present in the following instances: Data aggregation, Data marking, and Data masking.

Leave a Reply

Your email address will not be published. Required fields are marked *

15 − three =