Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

The implementation of coverage mechanism seems to be wrong. #11

Open
NileZhou opened this issue Apr 25, 2019 · 4 comments
Open

The implementation of coverage mechanism seems to be wrong. #11

NileZhou opened this issue Apr 25, 2019 · 4 comments

Comments

@NileZhou
Copy link

The author of pointer-generator propose a method called coverage mechanism.
Coverage vector, which is the sum of attention distributions over all previous decoder timesteps, but the coverage vector in this repository seems to sum the attention of encoder!
Please help me find the correct method to implementation the mechanism or tell me where is my fault.

@xcfcode
Copy link

xcfcode commented Apr 25, 2019

According to the paper "Get-to-the-Point", eq10, coverage vector is the sum of previous encoder attentions.

@NileZhou
Copy link
Author

According to the paper "Get-to-the-Point", eq10, coverage vector is the sum of previous encoder attentions.

Thanks, you're right.
I want to ask more about the mechanism. When the amount of data is very small (I have 5000 pairs data which consists of content and headline), the model in this repository still likes to repeat itself.
My paremeters settings (in params.py) about the coverage mechanism:
enc_attn_cover = True
cover_func= 'max'
cover_loss: float = 1
show_cover_loss = False

I would appreciate it if you could help me !

@xcfcode
Copy link

xcfcode commented Apr 25, 2019

Sorry, I have tried some experiments using this repo but I can not reproduce the result on CNNDM, So, maybe there are still some errors in the repo. Additionally, Pointer-generator uses "sum" as cover_func.

@NileZhou
Copy link
Author

Sorry, I have tried some experiments using this repo but I can not reproduce the result on CNNDM, So, maybe there are still some errors in the repo. Additionally, Pointer-generator uses "sum" as cover_func.

Thanks for your help.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants