GitHub - urvishp80/whisper-diarization: Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

This Repo is cloned from:- https://github.com/MahmoudAshraf97/whisper-diarization

It will download audio file from s3, process it and upload the srt file to s3.

Required python>=3.9 and install all dependencies using:

we have to install transformer separately due to versioning error with nemo lib

Set up environment variables: Create .env file in the root folder and add following keys -

AWS_ACCESS_KEY_ID = ""
AWS_SECRET_ACCESS_KEY = ""
BUCKET_NAME = ""

if you want stemming=True and model_name="medium.en" as deafult use:-

python main.py

else give the args for example:-

python main.py --no-stem --whisper-model "large.en"

Name		Name	Last commit message	Last commit date
Latest commit History 31 Commits
.gitignore		.gitignore
LICENSE		LICENSE
README.md		README.md
Whisper_Transcription_+_NeMo_Diarization.ipynb		Whisper_Transcription_+_NeMo_Diarization.ipynb
diarize.py		diarize.py
helpers.py		helpers.py
main.py		main.py
requirements.txt		requirements.txt