Skip to content

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

License

Notifications You must be signed in to change notification settings

urvishp80/whisper-diarization

 
 

Repository files navigation

To run the whisper diarization on all audio files from s3

It will download audio file from s3, process it and upload the srt file to s3.

Required python>=3.9 and install all dependencies using:

  • pip install requirements.txt
  • pip install transformers==4.26.1

we have to install transformer separately due to versioning error with nemo lib

Set up environment variables: Create .env file in the root folder and add following keys -

AWS_ACCESS_KEY_ID = ""
AWS_SECRET_ACCESS_KEY = ""
BUCKET_NAME = ""

Usage:-

if you want stemming=True and model_name="medium.en" as deafult use:-

python main.py

else give the args for example:-

python main.py --no-stem --whisper-model "large.en"

About

Automatic Speech Recognition with Speaker Diarization based on OpenAI Whisper

Resources

License

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published

Languages

  • Jupyter Notebook 61.3%
  • Python 38.7%