LDAPWiki: Data anonymization

Overview#

Data anonymization or Pseudonymization is a type of privacy Enhancing Technologies

Data anonymization is a procedure by which the most attributes within a data record are replaced by one or more artificial identifiers, or pseudonyms.

There can be a single pseudonym for a collection of replaced fields or a pseudonym per replaced field. The purpose is to render the data record less identifying and therefore lower End-User or patient objections to its use.

Data anonymization is the process of either encrypting or removing personally Identifiable Information from data sets, so that the Personal Entity whom can remain anonymous.

The Privacy Technology Focus Group defines Data anonymization as "technology that converts clear text data into a non-person entity readable and irreversible form, hashing and encryption techniques in which the decryption key has been discarded."

Data anonymization enables the Data In Transit across a boundary, such as between two departments within an agency or between two agencies, while reducing the risk of unintended Disclosure, and in certain environments in a manner that enables evaluation and analytics post-anonymization.

Health Data Set #

Data anonymization in the context of medical data, anonymized data refers to Patient Data from which the patient cannot be identified by the recipient of the information. HIPAA is very specific on what data MUST be removed together with any other information which, in conjunction with other data held by or disclosed to the recipient, could provide Identification the patient.

Following Data anonymization on Protected Health Information, the data is no longer Protected Health Information and is referred to a Health Dataset

De-anonymization is the reverse process in which anonymous data is cross-referenced with other data sources to re-identify the anonymous data source.

Generalization and perturbation are the two popular anonymization approaches for relational data.

vulnerability of Data anonymization[2][3]#

Data anonymization ability to maintain anonymity within Data anonymization data sets is questionable. Several reports indicate that a small amount of externally associated data may provide De-anonymization

More Information#

There might be more information for this subject on one of the following:

[#1] - Data_anonymization - based on information obtained 2016-07-06
[#2] - No silver bullet: De-identification still doesn’t work - based on information obtained 2016-08-14
[#3] - SCIENTISTS EXPOSE NEW VULNERABILITIES IN THE SECURITY OF PERSONAL GENETIC INFORMATION - based on information obtained 2016-08-14

Overview#

Health Data Set#

vulnerability of Data anonymization[2][3]#

More Information#

Health Data Set #