ASL recognition demo #169

jeffxtang · 2021-07-12T17:49:29Z

American Sign Language Recognition on Android

Introduction

American Sign Language (ASL) is a natural language used by deaf communities in many countries around the world. It has 26 signs corresponding to the 26 letters of the English language. This repo shows Python scripts that train a deep learning model to recognize the 26 ASL signs (and 3 additional signs for deletion, space, and nothing) and converts and optimizes the model to the Mobile Interpreter format, and an Android app that uses the model to recognize the 26 ASL signs.

Prerequisites

PyTorch 1.9.0 and torchvision 0.10.0 (Optional)
Python 3.8 or above (Optional)
Android Pytorch library pytorch_android_lite:1.9.0, pytorch_android_torchvision:1.9.0
Android Studio 4.0.1 or later

Quick Start

To Test Run the ASL recognition Android App, follow the steps below:

1. Train and Prepare the Model

If you don't have the PyTorch 1.9.0 and torchvision 0.10.0 installed, or if don't want to install them, you can skip this step. The trained, scripted and optimized model is already included in the repo, located at ASLRecognitionapp/src/main/assets.

Otherwise, open a terminal window, make sure you have torch 1.9.0 and torchvision 0.10.0 installed using command like pip list|grep torch, or install them using command like pip install torch torchvision, then run the following commands:

git clone https://github.com/pytorch/android-demo-app
cd android-demo-app/ASLRecognition/scripts

Download the ASL alphabet dataset here and unzip it into the ASLRecognition/scripts folder. Then run the scripts below, which are based on this tutorial, to pre-process the training images, train the model and convert and optimize the trained model to the mobile interpreter model:

python preprocess_image.py
python create_csv.py
python train.py --epochs 5 # on a machine without GPU this can take hours
python convert_lite.py

If all goes well, the model asl.ptl will be generated and you can copy it to ASLRecognition/app/src/main/assets.

You can also run python test.py to see the result of a test image located at ../app/src/main/assets/C1.jpg:

Predicted output: C
0.043 seconds

For more information on how to use a test script like the above to find out the expected model input and output and use them in an Android app, see Step 2 of the tutorial Image Segmentation DeepLabV3 on Android.

2. Use Android Studio

Open the ASLRecognition project using Android Studio. Note the app's build.gradle file has the following lines:

implementation 'org.pytorch:pytorch_android_lite:1.9.0'
implementation 'org.pytorch:pytorch_android_torchvision:1.9.0'

and in the MainActivity.java, the code below is used to load the model:

mModule = LiteModuleLoader.load(MainActivity.assetFilePath(getApplicationContext(), "asl.ptl"));

3. Run the App

Select an Android emulator or device and build and run the app. Some of the 26 test images of the ASL alphabet and their recognition results are as follows:

To test the live ASL alphabet gesture recognition, after you get familiar with the 26 ASL signs by tapping Next and Recognize, select the LIVE button and make some ASL gesture in front of the camera. A screencast of the app running is available here.

4. What's Next

With a different sign language dataset such as the RWTH-PHOENIX-Weather 2014 MS Public Hand Shape Dataset or the Continuous Sign Language Recognition Dataset and a state-of-the-art sign language transformer based model, more powerful sign language recognition Android app can be developed based on the app here.

This reverts commit 5a65775.

… signs

… app icon

IvanKobzarev · 2021-08-03T18:25:19Z

ASLRecognition/app/proguard-rules.pro

@@ -0,0 +1,21 @@
+# Add project specific ProGuard rules here.


File can be removed

IvanKobzarev · 2021-08-03T18:26:50Z

...ecognition/app/src/main/java/org/pytorch/demo/aslrecognition/LiveASLRecognitionActivity.java

+
+        public AnalysisResult(String results) {
+            mResults = results;
+        }


needs formatting

IvanKobzarev · 2021-08-03T18:29:33Z

...ecognition/app/src/main/java/org/pytorch/demo/aslrecognition/LiveASLRecognitionActivity.java

+            if (maxScoreIdx == DELETE) result = "DELETE";
+            else if (maxScoreIdx == NOTHING) result = "NOTHING";
+            else if (maxScoreIdx == SPACE) result = "SPACE";


nit: Imo using blocks for every case will be more readable:

if (maxScoreIdx == DELETE) { result = "DELETE"; } else if (maxScoreIdx == NOTHING) { result = "NOTHING"; } else if (maxScoreIdx == SPACE) { result = "SPACE"; }

IvanKobzarev · 2021-08-03T18:31:23Z

ASLRecognition/app/src/main/java/org/pytorch/demo/aslrecognition/MainActivity.java

+        btnNext.setOnClickListener(new View.OnClickListener() {
+            public void onClick(View v) {
+                mStartLetterPos = (mStartLetterPos + 1) % 26;
+                if (mStartLetterPos == 0) mStartLetterPos = 26;


nit:

if (mStartLetterPos == 0) { mStartLetterPos = 26; }

jeffxtang added 17 commits November 19, 2020 18:22

initial commit

5a65775

Revert "initial commit"

0fe4660

This reverts commit 5a65775.

main readme and helloworld/demo app readme updates

724ebee

Merge branch 'master' of https://github.com/pytorch/android-demo-app

055b0ed

Merge branch 'master' into master_readme

f1bf35d

Merge branch 'jeffxtang-master_readme'

aeb41d2

Merge branch 'master' of https://github.com/pytorch/android-demo-app

c5b70cb

Merge branch 'master' of https://github.com/pytorch/android-demo-app

b2d277f

Merge branch 'master' of https://github.com/pytorch/android-demo-app

054b5ce

Merge branch 'master' of https://github.com/pytorch/android-demo-app

9091bf8

Merge branch 'master' of https://github.com/pytorch/android-demo-app

fab0135

Merge branch 'master' of https://github.com/jeffxtang/android-demo-app

40c326d

Merge branch 'master' of https://github.com/pytorch/android-demo-app

044302f

Merge branch 'master' of https://github.com/pytorch/android-demo-app

5edf238

Merge branch 'master' of https://github.com/pytorch/android-demo-app

e368a1b

initial commit - pre and post processing 80% done

fc2e90c

showing golden and recognized results of all 26 letters; loop thru 26…

745982a

… signs

facebook-github-bot added the cla signed label Jul 12, 2021

jeffxtang added 6 commits July 12, 2021 14:57

adding live asl detection

a4c7633

live asl recognition - 50% done

727234f

code completed, refactoring, cleanup

22e6f32

scripts to train, convert and test the model; README; screenshots and…

50ca37f

… app icon

typo fix in ImageSegmentation demo README

ca86d97

README and UI update

ebaa6f9

jeffxtang marked this pull request as ready for review July 26, 2021 19:04

replaced transforms with albumentations by torchvision

09d537d

IvanKobzarev reviewed Aug 3, 2021

View reviewed changes

IvanKobzarev approved these changes Aug 3, 2021

View reviewed changes

jeffxtang added 3 commits August 3, 2021 11:51

PR feedback addressed

5fdf587

delete unneeded file

8b5023e

one more nit change

56aaa91

IvanKobzarev merged commit f09816a into pytorch:master Aug 3, 2021

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

ASL recognition demo #169

ASL recognition demo #169

jeffxtang commented Jul 12, 2021 •

edited

Loading

IvanKobzarev Aug 3, 2021

IvanKobzarev Aug 3, 2021

IvanKobzarev Aug 3, 2021

IvanKobzarev Aug 3, 2021

ASL recognition demo #169

ASL recognition demo #169

Conversation

jeffxtang commented Jul 12, 2021 • edited Loading

American Sign Language Recognition on Android

Introduction

Prerequisites

Quick Start

1. Train and Prepare the Model

2. Use Android Studio

3. Run the App

4. What's Next

IvanKobzarev Aug 3, 2021

Choose a reason for hiding this comment

IvanKobzarev Aug 3, 2021

Choose a reason for hiding this comment

IvanKobzarev Aug 3, 2021

Choose a reason for hiding this comment

IvanKobzarev Aug 3, 2021

Choose a reason for hiding this comment

jeffxtang commented Jul 12, 2021 •

edited

Loading