You signed in with another tab or window. Reload to refresh your session.You signed out in another tab or window. Reload to refresh your session.You switched accounts on another tab or window. Reload to refresh your session.Dismiss alert
All we require is that 0<= max(data.df[col]) < data.domain[col], which you'll see is true in your example above.
It's better to have smaller domains in general. If you have one attribute that can take 10000 possible values, the code may work, but it's scalability and/or accuracy may suffer. It's best to keep these small, even 100 is probably larger than it needs to be for the discretized numeric attributes in the adult dataset.
Yes, all mechanisms that use Private-PGM expect discrete data, but it's an interesting open problem to develop approaches that can handle numeric data as well!
No description provided.
The text was updated successfully, but these errors were encountered: