Implement vector memory #13352

jerryjliu · 2024-05-08T04:43:09Z

UPDATE

Re-jigged vector memory to only supply list of chat messages and not handle any thing having to do with composing messages
Added BaseComposableMemory and SimpleComposableMemory for better separation of concerns
- With this class, now we can pass a VectorMemory and ChatMemoryBuffer to a SimpleComposableMemory which will do a simple composition on the messages that are retrieved by all of its memory sources
Modified vector_memory.ipynb and added new composable_memory.ipynb (added these to docs as well)
Added these classes to api reference (I think?)

OLDER

was way harder than i originally anticipated, summary of main changes:

general interface stuff:

added input argument to get function in memory
added put_messages to memory
replaced all memory.set(..) calls in the agent with memory.put_messages in terms of updating state
added BaseChatStoreMemory for all memory modules that depend on a chat store

implemented vector memory:

track each "node" in a vector index
each "node" is by default a group of messages, instead of a single message. this is because if we index individual messages, we'll retrieve just the user message or just the assistant message, but not both. you may end up retrieving a user message and an unrelated assistant message leading to hallucinations
we track the current "message batch" in the chat store, to accumulate user message, then assistant/tool msgs, etc.
we also track all message ids in a separate collection in the chat store. this is to allow us to perform deletes, which right now consists of deleting messages one by one.
NOTE: deletes from vector stores are hard, we do some hacks to make delete_ref_doc work

review-notebook-app · 2024-05-08T04:43:15Z

Check out this pull request on

See visual diffs & provide feedback on Jupyter Notebooks.

Powered by ReviewNB

llama-index-core/llama_index/core/indices/vector_store/base.py

llama-index-core/llama_index/core/agent/custom/simple.py

llama-index-core/llama_index/core/agent/custom/pipeline_worker.py

llama-index-core/llama_index/core/memory/vector_memory.py

logan-markewich · 2024-05-17T21:35:21Z

llama-index-core/llama_index/core/memory/vector_memory.py

+            self.vector_index.delete_nodes([last_node_id])
+
+        self.vector_index.insert_nodes([super_node])
+        self.chat_store.add_message(self.chat_store_key, ChatMessage(content=node_id))


A little confused on the flow here

The chat store is keeping track of the inserted node ids (effectively maintaining order)? Not 100% sure what override last is for

so I think override_last is needed since we're updating the "batch" as we go. So if its not the end of the batch, we gotta delete the last entered node, and replace it in favor with the newly updated one... i.e. one that has the latest message.

The chat_store is acting as a temp buffer (I think you had mentioned before, and I'm just getting caught up with now) to track the current user batch i.e., group.

yeah tbh it's a bit hacky, and we're not using chatstore for its intended purpose. but if we want to batch messages together we need some way of making sure the latest node is 1) always updated, but 2) can still be returned as part of vector search

see working PR here for removing chat_store as buffer: #13729

docs/docs/examples/agent/memory/composable_memory.ipynb

docs/mkdocs.yml

llama-index-core/llama_index/core/memory/simple_composable_memory.py

jerryjliu · 2024-05-24T18:41:17Z

llama-index-core/llama_index/core/memory/vector_memory.py

+    chat_store: BaseChatStore = Field(default_factory=SimpleChatStore)
+    # NOTE/TODO: we need this to store id's for the messages
+    # This is not needed once vector stores implement delete_all capabilities
+    chat_store_key: str = Field(default=DEFAULT_CHAT_STORE_KEY)


hm i guess we don't have delete_all in the vector store yet, so we still need to maintain a list of all the ids

maybe fine for now, but may want to change this later to make vector memory less brittle (right now the chat store is in memory, so if we try to reinitialize the vector memory module there's no easy way to delete old data in vector memory)

I think we have a clear() method now that should delete all. Tbh, wasn't 100% clear on what this note meant. Are you saying that we only need chat_store to maintain the id's. And, since we have "clear()we can implement this without thechat_store`?

yeah basically. i think we still need the cur_user_msg_key for batching, but we no longer need to have a chat store track all user ids

Sorry, I'm still hung up on this, but if we do a delete_all on the vector store, wouldn't that remove all past batches in addition to the current running batch? I think we would want to delete the current running batch so that we can start a new one with the incoming user message?

left this for a future PR, but have started a working branch/PR for this that removes chat_store. I have effectively got it working, but it needs further polishing.

#13729

docs/docs/examples/agent/memory/composable_memory.ipynb

llama-index-core/llama_index/core/memory/types.py

llama-index-core/llama_index/core/memory/simple_composable_memory.py

…sableMemory, set messages on primary only

jerryjliu added 7 commits April 30, 2024 21:18

cr

21eb0fe

cr

3847dff

cr

303f6a0

cr

a099286

cr

4c55ac6

cr

68d15d3

cr

68cdcf7

dosubot bot added the size:XL This PR changes 500-999 lines, ignoring generated files. label May 8, 2024

jerryjliu marked this pull request as draft May 8, 2024 04:43

cr

5f30ae6

nerdai mentioned this pull request May 11, 2024

VectorStore -> BasePydanticVectorStore, also get/delete_nodes, clear() #13439

Merged

nerdai added 6 commits May 16, 2024 16:13

fix conflicts

98e841f

lint

8a1a71e

fix tests

15cd4ca

fix tests..

3d93476

use delete_nodes from vector_store

efe468d

use delete_nodes to index

b515595

nerdai reviewed May 17, 2024

View reviewed changes

llama-index-core/llama_index/core/indices/vector_store/base.py Show resolved Hide resolved