2 research outputs found
Priority driven K-anonymisation for privacy protection
[Abstract]: Given the threat of re-identi¯cation in our growing digital society, guaranteeing privacy while providing worthwhile data for knowledge discovery has become a diffcult problem. K-anonymity is a major technique
used to ensure privacy by generalizing and suppressing attributes and has been the focus of intense research in the last few years. However, data modification techniques like generalization may produce anonymous data unusable for medical studies because some attributes become too coarse-grained. In this paper, we propose a priority driven k-anonymisation that allows to specify the degree of acceptable distortion for each attribute separately. We also defined some appropriate metrics to measure the distance and information loss, which are suitable for both numerical and categorical attributes. Further, we formulate
the priority driven k-anonymisation as the k-nearest
neighbor (KNN) clustering problem by adding a con-
straint that each cluster contains at least k tuples.
We develop an efficient algorithm for priority driven
k-anonymisation. Experimental results show that the
proposed technique causes significantly less distor-
tions