[Bug]: FirestoreKVStore's aget_all raises AttributeError when collection is not empty #13385

gblazq · 2024-05-09T08:26:26Z

Bug Description

The aget_all method in FirestoreKVStore is assuming the type signature of google.cloud.firestore_v1.async_collection.AsyncCollectionReference.list_documents to be AsyncGenerator[DocumentReference, None]. However, since this is an async collection, the actual return type is AsyncGenerator[AsyncDocumentReference, None] (googleapis/python-firestore#918)

Assuming the wrong return type raises AttributeError: 'coroutine' object has no attribute 'to_dict' on this line

Instead, since doc is an AsyncDocumentReference, it should be awaited before accessing the to_dict method.

Version

0.10.35.post1

Steps to Reproduce

Add documents to a FirestoreKVStore
Get all documents asynchronously with aget_all

Relevant Logs/Tracbacks

Traceback (most recent call last):
  File "/Users/gubl/repos/mlt-content-generator-backend/contentgenerator/ingestion_pipeline.py", line 196, in <module>
    asyncio.run(main())
  File "/Users/gubl/.pyenv/versions/3.10.12/lib/python3.10/asyncio/runners.py", line 44, in run
    return loop.run_until_complete(main)
  File "/Users/gubl/.pyenv/versions/3.10.12/lib/python3.10/asyncio/base_events.py", line 649, in run_until_complete
    return future.result()
  File "/Users/gubl/repos/mlt-content-generator-backend/contentgenerator/ingestion_pipeline.py", line 192, in main
    await asyncio.gather(documents_pipeline.arun(), keywords_pipeline.arun())
  File "/Users/gubl/Library/Caches/pypoetry/virtualenvs/mlt-content-generator-backend-mezdK3Cq-py3.10/lib/python3.10/site-packages/llama_index/core/ingestion/pipeline.py", line 858, in arun
    nodes_to_run = await self._ahandle_upserts(
  File "/Users/gubl/Library/Caches/pypoetry/virtualenvs/mlt-content-generator-backend-mezdK3Cq-py3.10/lib/python3.10/site-packages/llama_index/core/ingestion/pipeline.py", line 781, in _ahandle_upserts
    (await self.docstore.aget_all_document_hashes()).values()
  File "/Users/gubl/Library/Caches/pypoetry/virtualenvs/mlt-content-generator-backend-mezdK3Cq-py3.10/lib/python3.10/site-packages/llama_index/core/storage/docstore/keyval_docstore.py", line 577, in aget_all_document_hashes
    for doc_id in await self._kvstore.aget_all(
  File "/Users/gubl/Library/Caches/pypoetry/virtualenvs/mlt-content-generator-backend-mezdK3Cq-py3.10/lib/python3.10/site-packages/llama_index/storage/kvstore/firestore/base.py", line 209, in aget_all
    data = doc.get().to_dict()
AttributeError: 'coroutine' object has no attribute 'to_dict'

The text was updated successfully, but these errors were encountered:

dosubot · 2024-05-09T08:27:19Z

The issue you're experiencing with FirestoreKVStore's aget_all method is due to not awaiting the doc.get() coroutine before attempting to call to_dict() on its result. This leads to the AttributeError: 'coroutine' object has no attribute 'to_dict' because doc.get() returns a coroutine that needs to be awaited to obtain the document snapshot, which then allows you to call .to_dict() on the snapshot.

Here's how you can fix the method:

async def aget_all(self, collection: str = DEFAULT_COLLECTION) -> Dict[str, dict]:
    collection_id = self.firestore_collection(collection)
    docs = self._adb.collection(collection_id).list_documents()
    output = {}
    async for doc in docs:
        key = doc.id
        doc_ref = await doc.get()  # Await the coroutine to get the document snapshot
        if doc_ref.exists:
            data = doc_ref.to_dict()  # Now you can safely call to_dict on the document snapshot
            if data is None:
                continue
            val = self.replace_field_name_get(data)
            output[key] = val
    return output

By awaiting doc.get(), you ensure that you're working with a document snapshot from which you can safely call .to_dict(), thus resolving the AttributeError.

Details

Sources

About Dosu
This response is meant to be useful and save you time. It is not meant to be a precise solution, but rather a starting point for your own research.

Help me be more useful! Please leave a 👍 if this is helpful and 👎 if it is irrelevant.

To continue the conversation, mention @dosu.

gblazq added bug Something isn't working triage Issue needs to be triaged/prioritized labels May 9, 2024

gblazq mentioned this issue May 9, 2024

Await Firestores's AsyncDocumentReference before converting to dict #13386

Merged

19 tasks

logan-markewich closed this as completed in #13386 May 9, 2024

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

[Bug]: FirestoreKVStore's aget_all raises AttributeError when collection is not empty #13385

[Bug]: FirestoreKVStore's aget_all raises AttributeError when collection is not empty #13385

gblazq commented May 9, 2024

dosubot bot commented May 9, 2024 •

edited

Details

[Bug]: FirestoreKVStore's aget_all raises AttributeError when collection is not empty #13385

[Bug]: FirestoreKVStore's aget_all raises AttributeError when collection is not empty #13385

Comments

gblazq commented May 9, 2024

Bug Description

Version

Steps to Reproduce

Relevant Logs/Tracbacks

dosubot bot commented May 9, 2024 • edited

Details

dosubot bot commented May 9, 2024 •

edited