Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Auto metadata chunk and filter #85

Merged
merged 2 commits into from
May 17, 2024

Conversation

ElioErrico
Copy link
Contributor

This plugin:

  1. classify your documents with metadata
  2. filter your documents filters by metadata.

How it works???

    1. Concatenates a parameterizable number of chunks (set the number in settings)
  1. Reads the possible metadata in list_of_tags.json (You could edit the list depending on your documentation, Do not cancel "no classification" from the list)
  2. Evaluates what is the correct metadata for each chunk analizing the concatenated content and classifies it using the list in list_of_tags.json
  3. Upload each chunk that composes the concatenated content with the discoreved metadada
  4. If the cat cannot find the correct metadata, generates a possible metadata of the chunk (set the prompt and/or disable this function from settings)
  5. Before recalling declarative memory analyses and classifies the chat_history using same list_of_tags.json and filters the documentation

How to use it:

  1. edit the list_of_tags.json with the keywords of your document
  2. the cat will classify your documentation with the list of the keywords (if you don't edit list_of_tags.json the cat will create the metadata)
  3. upload the document
  4. chat with the cat

@pieroit pieroit merged commit 1cd2112 into cheshire-cat-ai:main May 17, 2024
1 check passed
@pieroit
Copy link
Member

pieroit commented May 17, 2024

Nice plugin, thanks @ElioErrico :D

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
None yet
Projects
None yet
Development

Successfully merging this pull request may close these issues.

None yet

2 participants