Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We鈥檒l occasionally send you account related emails.

Already on GitHub? Sign in to your account

Add ingestion harness for vectorDB experiments #49

Open
steventkrawczyk opened this issue Aug 1, 2023 · 0 comments
Open

Add ingestion harness for vectorDB experiments #49

steventkrawczyk opened this issue Aug 1, 2023 · 0 comments
Labels
enhancement New feature or request vector_databases

Comments

@steventkrawczyk
Copy link
Contributor

馃殌 The feature

We need a way to experiment with different chunking + ingestion strategies. For example, we have some "raw" documents we want to ingest into a vector database, and there are different ways of transforming those "raw" documents into the documents we end up vectorizing. For example, we can ingest them as is, "chunk" them into 10-line chunks, or do other pre-processing to extract keywords and relevant phrases.

Motivation, pitch

Talking to some customers about their needs regarding vector DB evaluation at scale.

Alternatives

No response

Additional context

No response

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
enhancement New feature or request vector_databases
Projects
None yet
Development

No branches or pull requests

1 participant