Skip to content

chanyk-joseph/digits_sounds_recognizer

Folders and files

NameName
Last commit message
Last commit date

Latest commit

 

History

10 Commits
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 
 

Repository files navigation

digits_sounds_recognizer

Simple NN model for recovering Cantonese digits to text (99% Accuracy after 100 epoch)

--
Training Step: 18000  | total loss: 0.20440 | time: 2.408s
| Adam | epoch: 100 | loss: 0.20440 - acc: 0.9911 | val_loss: 0.00000 - val_acc: 1.0000 -- iter: 1800/1800
--

TensorBoard

Installation on ubuntu

$ sudo apt-get install libasound2-dev
$ pip3 install -r requirements.txt

Predict

$ python predict.py ./data/4999.mp3
Prediction for ./data/4999.mp3 = 9972

Train

$ python train.py

Fetch more samples & perform labelling manually

$ node samples_collector.js
$ python 4-digits-audio-splitter-and-labeler.py
$ python generate-data-summary-csv.py

Deploy on Google Cloud

$ cd ./gcloud-function
$ gcloud functions deploy digits_recognizer --runtime=python37 --trigger-http --memory=512 --timeout=60s --region=asia-east2

# Test
$ curl -d '{"mp3":"<mp3_in_base64_string>"}' -H "Content-Type: application/json" -X POST https://asia-east2-protean-silicon-865.cloudfunctions.net/digits_recognizer

About

Simple NN model for recovering Cantonese digits to text

Resources

Stars

Watchers

Forks

Releases

No releases published

Packages

No packages published