-
Notifications
You must be signed in to change notification settings - Fork 67
New issue
Have a question about this project? Sign up for a free GitHub account to open an issue and contact its maintainers and the community.
By clicking “Sign up for GitHub”, you agree to our terms of service and privacy statement. We’ll occasionally send you account related emails.
Already on GitHub? Sign in to your account
Got the repo kinda working #26
Comments
I thought the issue might have been me trying to add typescript to the entire repo but I've just cloned it fresh and started the server, then on another console run the outbound script and these are the logs
On the phone I hear nothing, and there does not appear to be any transcription going on whatsoever, I'm not sure what might be the issue here but on the logs I can only see Deepgram connection closed, any ideas? |
Ok turns out that I forgot to add the elevenlabs api key to the .env, but still I get an issue with deepgram where it always clses the conection, can't figure out why yet, but at the very least I got the first message as audio in the phone call (Icould hear "Rachel's" voice on the phone), but after responding the next log is always Deepgram concection closed |
If anyone sees this, the issue was that deeprgram SDK changed, I managed to get everything to work, or kind of, every service is now working by modifying the actual transcription service to comply with the new SDK
The issue I'm facing now is that the conversations seem to be out of sync, so I answer then gpt generates an answer then that seems to hapen a couple of times in a row and while I wait on the call for the agent to speak (Which speaks threee messages in a row without me talking) the deepgram timout hits and the conection closes. Logs if anyone is interested.
Given how the interaction orders are numbered I think there might be an issue in the interaction handling, I'll have to keep debuguing to see, but I hope this helps someone else that might want to give this fantastic repo a try! |
@SDCalvo Hey sorry for the late reply here! Do you know which version of the Deepgram SDK caused the change? My guess would be 3.x.x, but what I'm wondering is how you got that version of the SDK since this project specifies ^2.4.0. Did you intentionally upgrade to the latest version? I'll take a look at supporting the new DG SDK. |
Honestly I don't remember, I think I might've upgraded by accident? Not entirely sure, also thanks for the reply! And let me know if I could help you upgrade and/or add typescript support, the work ou've done here is fantastic! |
My package.json right now
I probly updated the SDK version without noticing it at some point |
Any update on this? would be great to be able to use deepgram for the TTS it is much better then 11labs |
Not really, I ended up using only openai to make a POC, using tts and stt from open ai directly and the new model gpt4o, it's pretty fast, got it to use tools, and it works overall great tbh |
I also updated the SDK because i was trying to code a class for deepgram to work with TTS now i see what you meant with it messing up the STT :( @cweems any chance on supporting the new SDK ? |
So i found a workaround i just installed both versions of the SKD 2.4 and 3.3 with an aliase and i use the 3.3 for the TTS, works great but probably best to update the transcribe "STT" to work with the new SDK |
oh that's smart!! |
@SDCalvo, You need to take a subscription to https://elevenlabs.io/ and use that API key it might work for me. |
Right now these are my logs from a recent call
I get no audio on the call, like no audio whatsoever but I do see some transcripts in the console, I honestly can't seem to understand what the issue might be as there are no errors, it seems like deepgram closes conection out of the blue for one thing and also TTS never sends the audio to the actual call. Any ideas?
The text was updated successfully, but these errors were encountered: