Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[INFO REQUEST] Distinction and Separation #454

Open
umma08 opened this issue Jan 10, 2024 · 4 comments
Open

[INFO REQUEST] Distinction and Separation #454

umma08 opened this issue Jan 10, 2024 · 4 comments

Comments

@umma08
Copy link

umma08 commented Jan 10, 2024

Can you please point me to documentation that outlines what the Distinction and Separation scores are representing when calculated by the ARX risk assessment tool. I have tried to find it, but it is not clear from the documentation.

@prasser
Copy link
Collaborator

prasser commented Jan 10, 2024

Please take alook at this paper: https://redirect.cs.umbc.edu/~kunliu1/p3dm08/proceedings/2.pdf

@umma08
Copy link
Author

umma08 commented Feb 7, 2024

Please take alook at this paper: https://redirect.cs.umbc.edu/~kunliu1/p3dm08/proceedings/2.pdf

thank you - this is quite helpful.

If i may ask a follow up question.

In the Analyze Risk window - I am able to select/de-select entries as 'quasi-identifiers' (QIs), and then a window populates with relative Distinction and Separation scores. If i select a high number of QIs, I receive an error message that says there are too many QIs found, and then a number that states how many. What does this error actually mean in the context of the risk analysis?

@prasser
Copy link
Collaborator

prasser commented Feb 7, 2024

This is not an error message. ARX calculates the metrics for all combinations of the selected attributes and this message just indicates that the number of combinations becomes too large. It may be possible to adjust this threshold in the settings.

@umma08
Copy link
Author

umma08 commented Feb 8, 2024

It may be possible to adjust this threshold in the settings.

Yes, I had looked at this, but the 'max number of attributes per quasi-identifier' setting seems locked at <= 10.

Inputting any larger a number seems blocked by the settings panel.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants