Audio to Text

Python program that will transcribe mp3 files to text whether you're online or offline.

Getting Started

First create a folder to place cloned git project into. Inside this folder also create a folder called “vosk_lang” to place vosk language models into. Go to the link: https://alphacephei.com/vosk/models To use Vosk models. Choose your desired language to transcribe in. In this repo, it is set up to use vosk-model-en-us-0.22. This can later be changed in the project files to suit your needs. Download the model into the folder that was created earlier called “vosk_lang” to be used later during set up.

Prerequisites

Have an IDE to run python such as pycharm. Have the Vosk model folder downloaded for use. Have git installed on the computer. A mp3 audio file to transcribe.

Installing

Go to your preferred python IDE.
Then open up the terminal in that IDE.
Navigate to the folder you created earlier for the git project.
Once in said folder go to github and click on the green code button and choose the method of download (Usually it will be HTTPS)
Copy link and go back to the IDE terminal and type “git clone {place coped url here}”.
After hitting enter the repo will be cloned into that folder.
Once done downloading, open project with your IDE and download the necessary packages.
In the root of the project create 3 folders.
- audio_files
- results
- vosk_lang
Inside the “vosk_lang” folder place the downloaded and unzipped model from vosk here.

Deployment

After the setup is complete, run the main.py file. The conversion will take a few minutes depending on model type. While running the program will sound a notification sound to signify the completion of the program.

Built With

Python - Programming language used Vosk - Translation library

Authors

Andy Min - Creator

Name		Name	Last commit message	Last commit date
Latest commit History 9 Commits
.gitignore		.gitignore
README.md		README.md
main.py		main.py
notification_sound.mp3		notification_sound.mp3
transcribe.py		transcribe.py

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Audio to Text

Getting Started

Prerequisites

Installing

Deployment

Built With

Authors

About

Releases

Packages

Languages

andrewymin/audio-to-text

Folders and files

Latest commit

History

Repository files navigation

Audio to Text

Getting Started

Prerequisites

Installing

Deployment

Built With

Authors

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages