Skip to content

Latest commit

 

History

History
15 lines (8 loc) · 958 Bytes

File metadata and controls

15 lines (8 loc) · 958 Bytes

Speech to Text with OpenVINO™

Binder

output

In this tutorial speech to text recognition is presented.

Notebook Contents

This tutorial uses the quartznet 15x5 model. QuartzNet performs automatic speech recognition. Its design is based on the Jasper architecture, which is a convolutional model trained with Connectionist Temporal Classification (CTC) loss. The model is available from Open Model Zoo.

Installation Instructions

If you have not installed all required dependencies, follow the Installation Guide.