Thematic Transcription

A tool to transcribe video to audio using Whisper API for thematic analysis.

Attention weights to capture dependencies and interactions between words in the audio.

The following is split up into a few sections: (1) Focused on transcription and performing speaker diarization, meaning separating speakers in the audio (2) Thematic analysis of the transcription (in progress)

Installation

Set up Whisper API GitHub instructions

If you do not have admin access, make sure to run as administrator when installing

Install dependencies in your command prompt

pip install whisper pip install python-docx pip install fpdf

Add audio file to the same directory or add the correct path in code
Modify code as necessary such as changing the name of the audio file, language, etc.
Run the script. Replace your_script.py with the name of your script

python your_script.py

See saved transcription as a word (.docx) and pdf (.pdf) file in the same directory as the audio file

Acceptable file types

The acceptable file types¹ are:

m4a: MPEG-4 Audio File
mp3: MPEG-1 Audio Layer 3 File
webm: WebM Audio/Video File
mp4: MPEG-4 Video File
mpga: MPEG Audio File
wav: Waveform Audio File
mpeg: MPEG Movie File

Links to audio files are not supported at the moment.

Requests per minute

50 requests per minute¹

File size

Up to 25MB¹

Command prompts

Transcribe audio

cd into the directory where the audio file is located
whisper transcribe --filename <filename> --language <language> --output <output_filename>
- filename is the name of the file you want to transcribe
- language is the language of the audio file
- output is the name of the file you want to save the transcription as

Transcribe audio with speaker separation

cd into the directory where the audio file is located
whisper transcribe --filename <filename> --language <language> --output <output_filename> --speaker-separation
- filename is the name of the file you want to transcribe
- language is the language of the audio file
- output is the name of the file you want to save the transcription as

Transcribe audio with speaker separation and speaker labels

cd into the directory where the audio file is located
whisper transcribe --filename <filename> --language <language> --output <output_filename> --speaker-separation --speaker-labels
- filename is the name of the file you want to transcribe
- language is the language of the audio file
- output is the name of the file you want to save the transcription as

Transcribe audio with speaker separation and speaker labels and punctuation

cd into the directory where the audio file is located
whisper transcribe --filename <filename> --language <language> --output <output_filename> --speaker-separation --speaker-labels --punctuation
- filename is the name of the file you want to transcribe
- language is the language of the audio file
- output is the name of the file you want to save the transcription as

Transcribe audio with speaker separation and speaker labels and punctuation and profanity filter

cd into the directory where the audio file is located
whisper transcribe --filename <filename> --language <language> --output <output_filename> --speaker-separation --speaker-labels --punctuation --profanity-filter
- filename is the name of the file you want to transcribe

License

MIT

Whisper API FAQ ↩ ↩² ↩³

Name		Name	Last commit message	Last commit date
Latest commit History 6 Commits
.vscode		.vscode
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Transcribe_Audio_With_Whisper.ipynb		Transcribe_Audio_With_Whisper.ipynb
transcribe.py		transcribe.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

.vscode

.vscode

.gitignore

.gitignore

LICENSE

LICENSE

README.md

README.md

Transcribe_Audio_With_Whisper.ipynb

Transcribe_Audio_With_Whisper.ipynb

transcribe.py

transcribe.py

Repository files navigation

Thematic Transcription

Installation

Acceptable file types

Requests per minute

File size

Command prompts

Transcribe audio

Transcribe audio with speaker separation

Transcribe audio with speaker separation and speaker labels

Transcribe audio with speaker separation and speaker labels and punctuation

Transcribe audio with speaker separation and speaker labels and punctuation and profanity filter

License

About

Languages

License

taylorylee/thematic-transcription

Folders and files

Latest commit

History

Repository files navigation

Thematic Transcription

Installation

Acceptable file types

Requests per minute

File size

Command prompts

Transcribe audio

Transcribe audio with speaker separation

Transcribe audio with speaker separation and speaker labels

Transcribe audio with speaker separation and speaker labels and punctuation

Transcribe audio with speaker separation and speaker labels and punctuation and profanity filter

License

Footnotes

About

Topics

Resources

License

Stars

Watchers

Forks

Languages