Bedrock AI models add usage information. #605

maxjiang153 · 2024-04-19T11:02:13Z

This PR closes: #600

When Bedrock invokes model API, Bedrock will return input, output token count, and latency information from the SDKHttpResponse header.

extract this information as ChatResponseMetadata to ChatResponse

tzolov · 2024-04-21T09:52:28Z

@wmz7year thank you for looking at this. It would be a valuable contribution to add more, structured usage metadata.

From the PR code, I have the impression that we can implement (most of) this inside AnthropicChatBedrockApi.java without having to expose it via the AmazonBedrockInvocationContex and reduce some necessary code repetition.

For example It looks like the BedrockAnthropicChatResponseMetadata , BedrockAnthropic3ChatResponseMetadata, CohereChatResponseMetadata, BedrockAi21Jurassic2ChatResponseMetadata, BedrockLlama2ChatResponseMetadata and BedrockTitanChatResponseMetadata are the same? Then why not create one instance and use it inside the AnthropicChatBedrockApi to generate the metadata inside the AnthropicChatBedrockApi?

What do you think about this approach?

maxjiang153 · 2024-04-21T10:07:23Z

The idea behind AmazonBedrockInvocationContext is that the Amazon Bedrock invoke model API will return input/output tokens and latency information from HTTP headers, not only from the response body. So I wrap it up as an AmazonBedrockInvocationContext to include more information besides the API response.

Check this out:

spring-ai/models/spring-ai-bedrock/src/main/java/org/springframework/ai/bedrock/api/AbstractBedrockApi.java

Line 220 in 65e6880

protected O internalInvocation(I request, Class<O> clazz) {

And also, the reason why I create multiple metadata classes like BedrockAnthropicChatResponseMetadata is because each model has a different response structure. Currently, it seems to look similar, but for example, the response message ID is different. So I decided to keep this up for future changes in each model.

How do you think about this? btw I can reduce metadata classes to a single one maybe we can add a dynamic field like Map<String, Object> attribute fields to cover the different metadata fields?

tzolov · 2024-04-26T13:46:22Z

Thanks, @wmz7year I need to look into this further, but wont be able to do it in coming 2 weeks.

maxjiang153 · 2024-04-28T07:00:12Z

Thanks, @wmz7year I need to look into this further, but wont be able to do it in coming 2 weeks.

Sure, if you have any thoughts just let me know

Bedrock chat client response add usage support.

36e197d

tzolov added the model client label Apr 20, 2024

tzolov added this to the 1.0.0-M1 milestone Apr 20, 2024

tzolov self-assigned this Apr 20, 2024

tzolov modified the milestones: 1.0.0-M1, 1.0.0-M2 Apr 26, 2024

tzolov mentioned this pull request Apr 26, 2024

BedrockAnthropic3ChatClient returns ChatResponse without AnthropicUsage #600

Open

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Bedrock AI models add usage information. #605

Bedrock AI models add usage information. #605

maxjiang153 commented Apr 19, 2024

tzolov commented Apr 21, 2024

maxjiang153 commented Apr 21, 2024

tzolov commented Apr 26, 2024

maxjiang153 commented Apr 28, 2024

Bedrock AI models add usage information. #605

Are you sure you want to change the base?

Bedrock AI models add usage information. #605

Conversation

maxjiang153 commented Apr 19, 2024

tzolov commented Apr 21, 2024

maxjiang153 commented Apr 21, 2024

tzolov commented Apr 26, 2024

maxjiang153 commented Apr 28, 2024