Modifying information
Section 9 of 10
Completely removing information from a dataset ensures the information cannot be used to identify participants or subjects. Sometimes, however, information can be modified enough that it no longer poses risk of identification and can thus remain in the dataset. This involves more effort but is a good option if complete removal of the information significantly de-values the dataset.
Methods of modifying data to limit identification include:
Methods used to modify a dataset must be documented and documentation must be published alongside the modified dataset if it is to retain its value to other researchers.
Question: which of the following is a direct identifier?
1. Year of birth
- NO – many people can be born in the same year
2. Names of relatives
- YES – names of relatives could directly identify an individual.
3. Occupation
- NO – again this information in itself is not enough to identify an individual, unless combined with other identifiers.