-
Notifications
You must be signed in to change notification settings - Fork 34
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
(New Construct): LangServ Support #249
Comments
Thank you for the suggestion @tech4life87 ! Do you mind describing a few things for us:
|
@laithalsaadoon I just updated the issue with these details. |
Hi @tech4life87 , thank your for this suggestion, looks very interesting !
|
@tech4life87 any update on this one ? |
|
great @tech4life87 , thanks ! I see you mentioned you may be able to implement this feature, would you be able to create a draft of this construct in a branch ? |
Yes I will start working on it |
Thank you @tech4life87 ! I will assign the ticket to you for now |
Hi @tech4life87 , any update on this ? Thank you ! |
Describe the feature
This construct deploys Langserv a FastAPI based framework that helps developers deploy LangChain's Runnables and Chains as REST API in production.
Use Case
Langserv simplifies the deployment of Large language Models from a prototype to production with industry best practices such as
Proposed Solution
At a high level this construct will deploy LangServ and LangFuse on AWS using Services such as Fargate, ECS and RDS but will depend on existing constructs such as aws_ecs_patterns. Developers will be able to make custom REST api calls to FastAPI endpoints with support for streaming, batch and invoke. The construct will be LLM provider agnostic such as Bedrock, SageMaker or VLLM on EC2 Instances giving developers flexibility to experiment with different model providers.
The construct will also provide monitoring and observability capability using LangFuse giving developers the flexibility to trace and debug LLM applications, monitor latency and model evaluation capabilities.
No response
Other Information
No response
Acknowledgements
Tasks
The text was updated successfully, but these errors were encountered: