Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

[WIP] Task reorganise deduplicates #735

Open
wants to merge 14 commits into
base: main
Choose a base branch
from
Open

Conversation

jum-s
Copy link
Contributor

@jum-s jum-s commented Feb 28, 2024

This PR is a preparation for tasks related issues (namely #724 and #491) which will introduce another type (in task document), namely delete. This addition will involve several API changes, which this PR tries to address:

  • rename endpoint by-type -> by-entities-type and move parameter and type to entities-type see commit named "add a task type parameter"
  • add a minimumScoreToAutogenerate config parameter to not auto generate too many entities (see 97a0e89)

namely:
  check_entities -> check_human_duplicates
  check_entity.js -> check_human_duplicate
  collect_entities -> collect_human_duplicates
laying ground for "delete" type tasks

this also renames the current "type" parameter to "entitiesType", this naming was here to avoid taking a decision about multi-words request parameters

this commit chose camel case, but next commit is the alternative kebab possibility
since actions are already using kebab
since checkEntity has more complex results
threshold value is based on elastic score to create a defined amount of
tasks.

Its important to reject those suggestions after automerge, as a
conservative feature to not automerge too many tasks only based on
suggestions with a high lexical score, to not wrongly merge homonyms
to be able to later create deduplicate human tasks
@jum-s jum-s added the tasks label Feb 28, 2024
@jum-s jum-s requested a review from maxlath February 28, 2024 14:24
@jum-s jum-s changed the title Task reorganise duplicates [WIP] Task reorganise deduplicates Feb 28, 2024
makes a reposability separation between hooks and endpoint. It keeps couch view keys simple (not having byUrisAndbyStateAndByType) while restricts the endpoint access to stateless (aka not arcchived) tasks. This is possible since the client does not need to access archived tasks for the moment, so the new couch view only returns non-archived tasks.
which is currently not in use by the client, but leaving it since entity layout will actually reuse it to display its own related tasks one day
Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

1 participant