Skip to content
This repository has been archived by the owner on Oct 20, 2018. It is now read-only.

Adds log-linear weighting of features for disambiguation #390

Open
wants to merge 62 commits into
base: development
Choose a base branch
from

Conversation

tgalery
Copy link
Member

@tgalery tgalery commented Jan 9, 2016

This is a version of #373 that tries to remove some duplicated commits. The original PR adds Log Linear Model based weighting to the disambiguation procedure. It also adds some new features, such as a bias term and a lexical similarity measure of the surface form and candidate entity name.

echo "Downloading ranklib..."
mkdir -p $BASE_WDIR/ranklib/
cd $BASE_WDIR/ranklib/
curl -L -o RankLib-2.1-patched.jar http://downloads.sourceforge.net/project/lemur/lemur/RankLib-2.1/RankLib-2.1-patched.jar?r=http%3A%2F%2Fsourceforge.net%2Fprojects%2Flemur%2Ffiles%2Flemur%2FRankLib-2.1%2F&ts=1439317425&use_mirror=skylink
Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

I was wondering if we could add RankLib as a maven dependency ? There is a suggestion on how to do it here http://sourceforge.net/p/lemur/discussion/ranklib/thread/a45e2a7c/?limit=25.

Copy link
Member

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

Yeah, might be nice to do that! On the other hand it's only a training-time dependency, right?

Copy link
Member Author

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

True, but I find it a bit ugly downloading the jar on training time.

Choose a reason for hiding this comment

The reason will be displayed to describe this comment to others. Learn more.

We do the same type of thing for pignlproc and other training dependencies, I think I kind of imitated that

Philipp Dowling and others added 24 commits March 20, 2016 17:07
…working and we can run disambiguations. Also using camel case now ;)
…nted vector store building in createspotlightmodel
…t to be worked out here. Also added a standalone w2v model file converter.
Sign up for free to subscribe to this conversation on GitHub. Already have an account? Sign in.
Labels
None yet
Projects
None yet
3 participants