AI App Template

A template project to run ingestion and querying with AWS services.

Features

common: Common functions, e.g jwt decode, get env var
composer: Compose LLMs input prompt
database: Database module to interact with database (dynamodb)
document: Document module to parse documents and build document nodes, to chunk document with overlapped chunking
helpers: Helper functions for aws services
indexer: Indexer module to build vector graph with embedding models
lambdas: AWS lambdas functions to do ingestion with SQS, querying, slack API
resources: PDFium resource which need to mount in AWS Index lambda to parse PDF
slack: Slack module to handle slack integration

When an user uploads document to the system, system saves the document in S3
A indexing task is created
Document analyser analyzes the document layout and build a document graph
A document vector graph is created respect to the document graph with embedding model and store in S3
A overlapped chunking method is applied to reduce chance for incomplete context
User can associate the document to a collection for multiple documents querying

When received an user query
Query is embedded with embedding model
System scans all documents in the target collection and filter with cosine similarity
System picks top K document graph nodes
System constructs the GPT prompt with selected nodes as context
System send the enriched query to external GPT service
When system got response from external GPT service, a callback request will be triggered

Setup DynamoDB with stream filter which can in found in readme file.
Mount PDFium resources to lambda need to run PDF parsing. e.g. document-indexer lambda
Mount embedding model resources to lambda need to run embedding. e.g. document-indexer lambda and seach-api lambda
Map API lambdas with API gateway and set up auth

Every lambda function in lambdas has two deployment command.

Name		Name	Last commit message	Last commit date
Latest commit History 3 Commits
.devcontainer		.devcontainer
common		common
composer		composer
database		database
docs		docs
document		document
helpers		helpers
indexer		indexer
lambdas		lambdas
resources/lib		resources/lib
slack		slack
.env-template		.env-template
.gitignore		.gitignore
Cargo.toml		Cargo.toml
LICENSE		LICENSE
Makefile.toml		Makefile.toml
README.md		README.md
rustfmt.toml		rustfmt.toml