Skip to content
New issue

Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.

By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.

Already on GitHub? Sign in to your account

Livestreaming typing animation #5141 | impact on operational costs chatbots #5188

Closed
HesselWellema opened this issue May 16, 2024 · 2 comments
Labels
Bot Services Required for internal Azure reporting. Do not delete. Do not change color. customer-replied-to Required for internal reporting. Do not delete. customer-reported Required for internal Azure reporting. Do not delete. question Further information is requested. Stack Overflow candidate

Comments

@HesselWellema
Copy link

I have a question

Hi Team,

I noticed the new streaming feature is available and I'm looking forward to implementing it for my customers. However, I have a short question about its feasibility.

We are charged for the use of Directline on a per message basis. Messages are (according to the pricing page) Activity objects being sent to the Bot’s endpoint via HTTP POST or from bot to user.

When a long answer is generated by an LLM, I could wait until it's finished and send it as one HTTP call. Alternatively, I could start streaming right away, which might result in multiple HTTP calls (start, content, finish), or even more.

If every chunk is an HTTP call, I am not sure if this solution will be feasible due to the additional costs (approximately 50 cents per 1000 chunks).

I did notice the line "Livestream is more-or-less 'multiple revisions of the same activity'," but I am not sure how to interpret it.

Thank you.

@HesselWellema HesselWellema added Bot Services Required for internal Azure reporting. Do not delete. Do not change color. customer-reported Required for internal Azure reporting. Do not delete. question Further information is requested. Stack Overflow candidate labels May 16, 2024
@HesselWellema HesselWellema changed the title Livestreaming typing animation #5141 | impact on operational cost chatbots Livestreaming typing animation #5141 | impact on operational costs chatbots May 16, 2024
@compulim
Copy link
Contributor

Rest assured. We designed this with Azure Bot Services team.

The streaming response, no matter how many intermediate chunks are sent, are only count once.

@compulim compulim added the customer-replied-to Required for internal reporting. Do not delete. label May 25, 2024
@HesselWellema
Copy link
Author

Great. ln that case I will leverage your implementation. Thank you

Sign up for free to join this conversation on GitHub. Already have an account? Sign in to comment
Labels
Bot Services Required for internal Azure reporting. Do not delete. Do not change color. customer-replied-to Required for internal reporting. Do not delete. customer-reported Required for internal Azure reporting. Do not delete. question Further information is requested. Stack Overflow candidate
Projects
None yet
Development

No branches or pull requests

2 participants