Skip to content

Real time caption generator using Microsoft Azure speech services

License

Notifications You must be signed in to change notification settings

JadenChun/real-time-caption-generator

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

13 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

Real time caption generator using Microsoft Azure speech services

Overview

This project is aiming to provide real time captioning for any content with the help of speech transcription and translation technology. SubGen is capable to perform real time captioning for every application that runs on windows.

How it works?

  1. The audio output of the system is being streamed to the Microsoft Azure speech service.
  2. The transcription result from the speech services will then be displayed in the subtitle window.

Supported Platform

Windows: Tested with Windows 10 and Windows 11

Technical Choice

  1. Language: C++
  2. UI Framework: Qt
  3. Dependency: rtaudio, Azure Speech SDK
  4. External Driver: VB-Cable

Setup Guide

  1. Install VB-Cable.
  2. Download SubGen.zip from here and extract it.
  3. Run the subgen.exe from the file.
  4. Add your own key and region for Azure speech service.

About Microsoft Azure Speech Service

You will require a Microsoft Azure account to use the speech service. You can register a free account at Microsoft and create the speech resource to get the key and region information.

-Noted that the captioning will not work without the Azure Speech Service

Screenshots

homepage

setting

subtitle window

License

MIT