Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The data generator fails to generate correct number of training data #13

Open
ChenBaiyang opened this issue Mar 2, 2023 · 1 comment

Comments

@ChenBaiyang
Copy link

Hi Dear,

I found in the code, DataGenerator.generator() can not generate data properly. The parameters:

  • at_least_one_labeled does not work.
  • la and test_size are not consistent.

Thank you for your assistance.
Bryan

@ChenBaiyang
Copy link
Author

Moreover, when I manually correct the number of training seeds, I got a result that was quite different from the reports in the paper, for example:

  • Parameter settings:
    la=0.05, test_size=0.95, at_least_one_labeled=True, model_dict = {'DevNet': DevNet, 'RF': supervised, 'CatB': supervised}

  • Results in terms of AUC:
    DevNet RF CatB
    6_cardio 0.572856 0.7334 0.847772
    25_musk 1.0 1.0 0.999278
    26_optdigits 0.974521 0.933136 0.914885
    36_speech 0.614187 0.493655 0.497312
    40_vowels 0.883842 0.805545 0.885713

  • log see attached.

Thank you very much.
Java Printing.pdf

Bryan

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant