Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Bug in averaging #1

Open
nschneid opened this issue Aug 28, 2013 · 0 comments
Open

Bug in averaging #1

nschneid opened this issue Aug 28, 2013 · 0 comments
Labels

Comments

@nschneid
Copy link
Owner

Unless the --no-averaging flag is specified, the learner will attempt to use the averaged perceptron for training. However, the averaging formula is incorrect when averages are only computed for updated weights (and averaging the entire weight vector would be too slow).

A correct approach can be found in p. 19 of Hal Daumé's thesis. This keeps two vectors: the current weights, and the weights' deviation from the sum over all learning timesteps. Both vectors are sparsely updated, and allow the averaged vector to be computed at the end of learning.

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

No branches or pull requests

1 participant