Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Quality of adversaries and authenticity of results #24

Open
SachJbp opened this issue Jun 20, 2020 · 4 comments
Open

Quality of adversaries and authenticity of results #24

SachJbp opened this issue Jun 20, 2020 · 4 comments

Comments

@SachJbp
Copy link

SachJbp commented Jun 20, 2020

There seems to be a issue in a few adversaries.

For example: A claimed adversary from mr_bert.txt is:
orig sent (0): to portray modern women the way director davis has done is just unthinkable
adv sent (1): to portray modern women the way director davis has done is just imaginable

unthinkable and imaginable are antonyms which erroneously have high cosine similarity suggesting that those are synonyms. I suggest such examples should not be considered while evaluating the success rate of attack, as the human evaluation would clearly label it as positive (1) and not negative.

@jind11
Copy link
Owner

jind11 commented Jun 20, 2020

Yes, the human evaluation on polarity is not 100% due to these errors.

@SachJbp
Copy link
Author

SachJbp commented Jun 20, 2020

The ~13% after-attack accuracy reported considers such examples as success , which actually is not. I guess Human evaluation filter should finally govern the after-attack accuracy. Please correct me if I am wrong. Thanks.

@jind11
Copy link
Owner

jind11 commented Jun 20, 2020 via email

@Youoo1
Copy link

Youoo1 commented Oct 20, 2021

Where is the emdding.npz file, please? Or how is it generated?
3b8380b89d2686d6cb586f83719ca03
7a678cd5f2a8398b7980d8aaa9d5aec

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

3 participants