1s Latency Definitiion #14

RuchirB · 2024-02-09T15:04:26Z

Tried out the project, very impressed. Thanks for open sourcing. Quick question on latency.

Noticed a minimum latency of at least 3-4s. I am measuring latency as delay between when the human speaks and when the AI responds. This was with everything deployed on fly.io in Ashburn using the exact demo as instructed.

Looks like the biggest bottleneck is the request from Twilio -> Fly.io and Fly.io -> Twilio. Second biggest bottleneck looks like transcription via deepgram.

The ReadMe suggests a latency of 1s—can you clarify the definition of latency here? Is that just looking at gpt response + TTS?

Any ideas on how to reduce latency? Is there a roadmap for this project we can follow somewhere?

ansario · 2024-03-08T17:13:56Z

You could use the gpt-4-turbo-preview (or 3.5) GPT model for a small boost.

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

1s Latency Definitiion #14

1s Latency Definitiion #14

RuchirB commented Feb 9, 2024

ansario commented Mar 8, 2024

1s Latency Definitiion #14

1s Latency Definitiion #14

Comments

RuchirB commented Feb 9, 2024

ansario commented Mar 8, 2024