New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Display remaining tasks overview #724
Comments
What do you think of unifying user generated and robot generated merge tasks? The current human deduplication process would just be one provider among others of merge tasks, specialized on humans, but the rest would be entity-type agnostic and reporter (user or bot) agnostic(?). In that direction, and working from memories, I think it might make sense to make that author deduplication process create less tasks: it automerges what it can and creates tasks when it's not quite sure, but doesn't create a task for every homonym returned by Elasticsearch(?) as that information is of lower quality than if a user reports that A and B should be merged. |
I had in mind a hard split (different category) to give priority to Reducing the amount of autogenerated tasks seems like a good idea. The easiest would be to introduce a hardcoded threshold on score (dont create task if score is lower than 100) |
Here is a query to find the 10000th task sorted by descending score:
Rough idea of the results: This could allow us to set a threshold of |
In today's codebase:
deduplicate
) stored in dbsuspectUri
andsuggestionUri
pair is uniquededuplicate
tasks, based on their entity type (human
andwork
)human
entities are generated automatically, which currently creates a lot of taskswork
entities are based on user feedback ; those have areporter: userId
The objective is to have users access a dashboard of main tasks to do, aka grouping tasks by user interests (categories)
Proposed dashboard categories: [edited to integrate max comment below]
merge
: entities could be of any type.works
: already developedhumans
: should first return all user feedback tasks (aka "reporter tasks"), then autogenerated tasksdelete
: entities could be of any typeworks
humans
Proposed implementation:
delete
task type, asidededuplicate
(which could be renamed tomerge
(?))by-entities-type
->by-type
: endpoint would have two arguments:type
andentities-type
. To be able to queryaction=by-type?type=deduplicate&entities-type=humans
which would return only tasks with a reporterby-score
endpoint)merge
and adelete
task, endpointby-suspect-uri
would necessarily need atype
argument.suspectUri
->uri
(assuspect
term does not qualify anything useful)The text was updated successfully, but these errors were encountered: