Deep_Learning_Hand_Gesture_Recognition_Project

Goal: To recognize hand gestures using state-of-the-art neural networks.

Problem Statement

Imagine you are working as a data scientist at a home electronics company which manufactures state of the art smart televisions. You want to develop a cool feature in the smart-TV that can recognise five different gestures performed by the user which will help users control the TV without using a remote

The gestures are continuously monitored by the webcam mounted on the TV. Each gesture corresponds to a specific command:

Thumbs up: Increase the volume
Thumbs down: Decrease the volume
Left swipe: 'Jump' backwards 10 seconds
Right swipe: 'Jump' forward 10 seconds
Stop: Pause the movie

Each video is a sequence of 30 frames (or images)

Understanding the Dataset

The training data consists of a few hundred videos categorised into one of the five classes. Each video (typically 2-3 seconds long) is divided into a sequence of 30 frames(images). These videos have been recorded by various people performing one of the five gestures in front of a webcam - similar to what the smart TV will use.

The data is in a zip file. The zip file contains a 'train' and a 'val' folder with two CSV files for the two folders. These folders are in turn divided into subfolders where each subfolder represents a video of a particular gesture. Each subfolder, i.e. a video, contains 30 frames (or images). Note that all images in a particular video subfolder have the same dimensions but different videos may have different dimensions. Specifically, videos have two types of dimensions - either 360x360 or 120x160 (depending on the webcam used to record the videos). Hence, you will need to do some pre-processing to standardise the videos.

Each row of the CSV file represents one video and contains three main pieces of information - the name of the subfolder containing the 30 images of the video, the name of the gesture and the numeric label (between 0-4) of the video.

The ultimate task is to train a model on the 'train' folder which performs well on the 'val' folder as well (as usually done in ML projects).

Model Overview

Please refer to the 'DL_Hands_Gesture_Recognition_Project_Writeup.docx' file for detailed model overview.

Jupyter Notebook Viewer

If you are unable to view or load the jupyter IPython notebook via Github, please click on this link. Thank you!

Name		Name	Last commit message	Last commit date
Latest commit History 7 Commits
DL_Hands_Gesture_Recognition_Project.ipynb		DL_Hands_Gesture_Recognition_Project.ipynb
DL_Hands_Gesture_Recognition_Project_Writeup.docx		DL_Hands_Gesture_Recognition_Project_Writeup.docx
LICENSE		LICENSE
README.md		README.md
model-00019-0.42598-0.87675-0.36377-0.87500.h5		model-00019-0.42598-0.87675-0.36377-0.87500.h5

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

Deep_Learning_Hand_Gesture_Recognition_Project

Problem Statement

Understanding the Dataset

Model Overview

Jupyter Notebook Viewer

About

Releases

Packages

Languages

License

ChaitanyaC22/Deep_Learning_Hand_Gesture_Recognition_Project

Folders and files

Latest commit

History

Repository files navigation

Deep_Learning_Hand_Gesture_Recognition_Project

Problem Statement

Understanding the Dataset

Model Overview

Jupyter Notebook Viewer

About

Topics

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages