Skip to content

Block Filtering and Block Purging after Vector Based Blocking #12

Discussion options

You must be logged in to vote

What I suggest you do is start experimenting with:

  • top_k=5 (5 to 20)
  • similarity_distance='euclidean' ('cosine')

and then with the clustering:

  • similarity_threshold=0.4 (from 0 to 1)

or you can even check the optuna tutorial here https://pyjedai.readthedocs.io/en/latest/tutorials/Optuna.html

Replies: 3 comments

Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Comment options

You must be logged in to vote
0 replies
Answer selected by Nikoletos-K
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Category
Q&A
Labels
help wanted Extra attention is needed
2 participants
Converted from issue

This discussion was converted from issue #10 on July 19, 2023 08:59.