Expose embeddings API #490

fantix · 2024-05-01T16:55:01Z

This PR adds EdgeDBAI.generate_embeddings() function to allow users to generate embedding vectors without using the AI index in EdgeDB with custom input text.

I'll create another PR to add docs.

scotttrinh · 2024-05-01T17:53:30Z

edgedb/ai/core.py

                stream=True,
            ).to_httpx_request(),
        ) as event_source:
            event_source.response.raise_for_status()
            for sse in event_source.iter_sse():
                yield sse.data

+    def generate_custom_embeddings(self, *inputs: str, model: str):


working on the JS version of this but my instinct was to name this generateEmbeddings. Is there a significance to the "custom" here? Does it help to denote that these are not the automatically indexed ones?

Yeah, that was my intention, but I'm not sure this "custom" makes sense (like in English).

I wonder if custom communicates enough to pay for the cost. search would be even more specific and point at the intended use case, but maybe that's too restrictive since you might use it outside of ext::ai::search? Does generate_embeddings make it seem too much like you're triggering something rather than doing this one-off embedding generation?

The only reason I ask is that having "custom" here would make me as a developer want to know more about what "custom" means and what other non-custom methods their might be. Maybe that's a good thing and worth adding here, but it feels a little like an unnecessary mental speed bump.

Does generate_embeddings make it seem too much like you're triggering something rather than doing this one-off embedding generation?

Yeah, I had the same struggle. I thought about retrieve_embeddings(), which is just worse. (maybe generate_oneoff_embeddings()?)

Maybe that's a good thing and worth adding here, but it feels a little like an unnecessary mental speed bump.

Right, I'll also change it to generate_embeddings() and add an explanation in the docs.

edgedb/ai/core.py

fantix · 2024-05-01T19:10:01Z

edgedb/ai/core.py

-    ):
-        if context is None:
-            context = self.context
+    ) -> typing.Iterator[str]:


This returned str is a JSON string like:

{"type": "content_block_delta","index":0,"delta":{"type": "text_delta", "text": " blocking"}}

fantix requested review from 1st1 and elprans May 1, 2024 16:55

scotttrinh reviewed May 1, 2024

View reviewed changes

elprans reviewed May 1, 2024

View reviewed changes

edgedb/ai/core.py Outdated Show resolved Hide resolved

fantix commented May 1, 2024

View reviewed changes

fantix requested a review from elprans May 2, 2024 22:19

fantix added 5 commits May 28, 2024 10:20

Extract _make_rag_request()

00f0cc0

Expose embeddings API

6642a98

CRF: rename to generate_embeddings()

8d0d913

CRF: unpack embedding result

aa3c755

Unbox embedding data

cc6c9a9

fantix force-pushed the ai branch from e60188b to cc6c9a9 Compare May 28, 2024 14:20

scotttrinh approved these changes May 28, 2024

View reviewed changes

fantix merged commit 7386cd0 into master May 28, 2024
42 checks passed

fantix deleted the ai branch May 28, 2024 15:18

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Expose embeddings API #490

Expose embeddings API #490

fantix commented May 1, 2024 •

edited

scotttrinh May 1, 2024

fantix May 1, 2024 •

edited

scotttrinh May 1, 2024

fantix May 1, 2024 •

edited

fantix May 1, 2024

Expose embeddings API #490

Expose embeddings API #490

Conversation

fantix commented May 1, 2024 • edited

scotttrinh May 1, 2024

Choose a reason for hiding this comment

fantix May 1, 2024 • edited

Choose a reason for hiding this comment

scotttrinh May 1, 2024

Choose a reason for hiding this comment

fantix May 1, 2024 • edited

Choose a reason for hiding this comment

fantix May 1, 2024

Choose a reason for hiding this comment

fantix commented May 1, 2024 •

edited

fantix May 1, 2024 •

edited

fantix May 1, 2024 •

edited