Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

TODO (2 week deadline) #2

Open
4 of 9 tasks
bakerwho opened this issue May 24, 2020 · 1 comment
Open
4 of 9 tasks

TODO (2 week deadline) #2

bakerwho opened this issue May 24, 2020 · 1 comment
Assignees

Comments

@bakerwho
Copy link
Owner

bakerwho commented May 24, 2020

Build Python modules to:

  • negatively sample facts (each fact is 3 numbers) from SVO dataset (n_neg_facts, sampling_params?)
  • compile datasets using specified numbers/ratios of true & false facts
  • adapt torch-kge or other code to run on our datasets
  • build 
visualizations (PCA) of embeddings

  • build Word2Vec/glove embeddings with BOW for comparison
  • build linear classifier + MLP to classify true/false based on embeddings


For June 1 Monday:

  • Create a reduced dataset (50-50 T/F) and get it pre-processed enough to run TransE, TransH, HolE
  • MAYBE start building visualization code?

Spacy-NOW & WikiHumans:

  • List of named entities, or (better) (head, , tail) triples.
@adarshmathew adarshmathew self-assigned this May 27, 2020
@bakerwho
Copy link
Owner Author

bakerwho commented May 27, 2020

For June 1 Monday:

  • Create a reduced dataset (50-50 T/F) and get it pre-processed enough to run TransE, TransH, HolE
  • MAYBE start building visualization code?

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

No branches or pull requests

2 participants