Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Need help for retraining and cross validation #18

Open
atulkum opened this issue Nov 30, 2018 · 12 comments
Open

Need help for retraining and cross validation #18

atulkum opened this issue Nov 30, 2018 · 12 comments
Labels
good first issue Good for newcomers help wanted Extra attention is needed

Comments

@atulkum
Copy link
Owner

atulkum commented Nov 30, 2018

Need help for retraining and cross validation and see if the ROUGE score matches exactly (or better) with the numbers reported in the paper.
I just train for 500k iteration (with batch size 8) with pointer generation enabled + coverage loss disabled and next 100k iteration (with batch size 8) with pointer generation enabled + coverage loss enabled.

It would be great if someone can help re-running these experiments and try to see if we can improve the result and match it with the paper.

You might need a better GPU though. (my current one is gtx 1070 8 gb)

@atulkum atulkum added help wanted Extra attention is needed good first issue Good for newcomers labels Nov 30, 2018
@pengzhi123
Copy link

i very want to help you, but i only have a 1080ti 12g and don't know how change code to get BLEU score.
sorry.

@atulkum
Copy link
Owner Author

atulkum commented Dec 7, 2018

You can compare the rouge score too. I used 1070 with 8 gb and it took 3 days to train for 500k iteration. On 1080 ti it must be faster.

@pengzhi123
Copy link

You can compare the rouge score too. I used 1070 with 8 gb and it took 3 days to train for 500k iteration. On 1080 ti it must be faster.

I have finished the test of 100K and am now doing another test of 500K.

@atulkum
Copy link
Owner Author

atulkum commented Dec 8, 2018

Thats great. One more option would be to train for 700k make checkpoint every 50k and verify which checkpoint give best result.

@pengzhi123
Copy link

Thats great. One more option would be to train for 700k make checkpoint every 50k and verify which checkpoint give best result.

I've done 288K/500K, and I will start the 700k test in 2 days. So I'm going to upload it to DropBox, or you can select one to me.

@atulkum
Copy link
Owner Author

atulkum commented Dec 8, 2018

You don't need to upload the model. You can just report the rouge score.

@pengzhi123
Copy link

You don't need to upload the model. You can just report the rouge score.

ok.

@shivam13juna
Copy link

shivam13juna commented Jan 2, 2019

@atulkum did you try this model on some external data? like how do you convert just a csv file of text data to bin format. And could you upload pretrianed weight as well?? @pengzhi123

@pengzhi123
Copy link

I'm sorry for uploading data now. Our machine is broken, and I only trained to 660K. The following is the experimental result:
100k (batch size 8):
ROUGE-1:
rouge_1_f_score: 0.3420 with confidence interval (0.3397, 0.3443)
rouge_1_recall: 0.3830 with confidence interval (0.3803, 0.3856)
rouge_1_precision: 0.3288 with confidence interval (0.3263, 0.3312)

ROUGE-2:
rouge_2_f_score: 0.1401 with confidence interval (0.1382, 0.1420)
rouge_2_recall: 0.1568 with confidence interval (0.1545, 0.1590)
rouge_2_precision: 0.1350 with confidence interval (0.1331, 0.1369)

ROUGE-l:
rouge_l_f_score: 0.3105 with confidence interval (0.3083, 0.3126)
rouge_l_recall: 0.3475 with confidence interval (0.3448, 0.3500)
rouge_l_precision: 0.2987 with confidence interval (0.2964, 0.3010)

500k (batch size 8):
rouge_1_f_score: 0.3603 with confidence interval (0.3580, 0.3624)
rouge_1_recall: 0.4006 with confidence interval (0.3980, 0.4032)
rouge_1_precision: 0.3475 with confidence interval (0.3449, 0.3500)

ROUGE-2:
rouge_2_f_score: 0.1538 with confidence interval (0.1515, 0.1560)
rouge_2_recall: 0.1703 with confidence interval (0.1679, 0.1727)
rouge_2_precision: 0.1492 with confidence interval (0.1469, 0.1514)

ROUGE-l:
rouge_l_f_score: 0.3292 with confidence interval (0.3270, 0.3313)
rouge_l_recall: 0.3659 with confidence interval (0.3633, 0.3684)
rouge_l_precision: 0.3177 with confidence interval (0.3153, 0.3202)

@atulkum
Copy link
Owner Author

atulkum commented Jan 7, 2019

Thanks for doing this. Did you enabled coverage loss for this result?

@pengzhi123
Copy link

@liruowei0919
Copy link

@pengzhi123 Hi there, Can I ask what machine u r running it on? Seems really fast

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
good first issue Good for newcomers help wanted Extra attention is needed
Projects
None yet
Development

No branches or pull requests

4 participants