Skip to content

.Net 6.0 - WPF Application for Azure Speech and OpenAI GPT

License

Notifications You must be signed in to change notification settings

der3318/audio-powered-gpt

Repository files navigation

💬 Audio Powered GPT

version dotnetf openai speech portable .NET WPF App Release Builder

A tiny WPF interface that integrates Azure cognitive service with GPT endpoint. This requires Azure subscription resources of both speech service and OpenAI.

Demo.png

Interactive Mode

Simply type or speak (via microphone) to ask GTP questions in this mode. Press the "start button" to trigger a speech QA session, and click the "start/stop button" again to pause.

InteractiveMode.png

Translation Mode

This is the real time translation (into Chinese) functionality. Result texts will also be displayed as a 3-second toast in the bottom corner, so the app can be run completely in the background.

TranslationMode.png

An audio redirection (from speacker to input) interface is a prerequisite to use the feature. Windows stereo mix or VoiceMeeter is probably a good choice.

References