Invention Grant
- Patent Title: Voice commands recognition method and system based on visual and audio cues
-
Application No.: US16719291Application Date: 2019-12-18
-
Publication No.: US11508374B2Publication Date: 2022-11-22
- Inventor: Andrew Abou Antoun , Andrew Osaro Idehen
- Applicant: KRYSTAL TECHNOLOGIES
- Applicant Address: CA Laval
- Assignee: KRYSTAL TECHNOLOGIES
- Current Assignee: KRYSTAL TECHNOLOGIES
- Current Assignee Address: CA Laval
- Agency: Praxis
- Main IPC: G10L15/24
- IPC: G10L15/24 ; G10L15/25 ; G10L15/22 ; G10L15/16 ; G06T17/00 ; G10L25/24 ; G06N3/04 ; G06N3/08 ; G06V40/16

Abstract:
A method and system for voice commands recognition. The system comprises a video camera and a microphone producing an audio/video recording of a user issuing vocal commands and at least one processor connected to the video camera and the microphone. The at least one processor has an associated memory having stored therein processor executable code causing the processor to perform the steps of: obtain the audio/video recording from the video camera and the microphone; extract video features from the audio/video recording and store the result in a first matrix; extract audio features from the audio/video recording and store the result in a second matrix; apply a speech-to-text engine to the audio portion of the audio/video recording and store the resulting syllables in a text file; and identify via a neural network the vocal commands of the user based on the first matrix, the second matrix and the text file.
Information query