Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

sentences lost when runs force_align #23

Open
duterscmy opened this issue Oct 19, 2020 · 2 comments
Open

sentences lost when runs force_align #23

duterscmy opened this issue Oct 19, 2020 · 2 comments

Comments

@duterscmy
Copy link

duterscmy commented Oct 19, 2020

I have train a model on a big corpus, and then I want to obtain align result on some new data.
Like ./scripts/force-align-moses.sh, the .vcb .cooc files are new generated and the .classes use existed files, then use mgiza to obtain results.
However,there are nearly a half of sentences are lost in en2cn.A3.final.part000-047, hence I can't use ./scripts/merge_result.py to merge results.
Where could be my problem???

@duterscmy
Copy link
Author

"lost" means that the align results of these sentence are not appeared in en2cn.A3.final.part000-047

@duterscmy
Copy link
Author

the error maybe WARNING: Hill Climbing yielded a zero score viterbi alignment for the following pair: AL(l:7,m:3)(a: 5 6 7 )(fert: 0 0 0 0

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

1 participant