Invention Grant
- Patent Title: Raw speech speaker-recognition
-
Application No.: US16852950Application Date: 2020-04-20
-
Publication No.: US10706857B1Publication Date: 2020-07-07
- Inventor: Viswanathan Ramasubramanian , Sunderrajan Kumar
- Applicant: Viswanathan Ramasubramanian , Sunderrajan Kumar
- Applicant Address: US NJ Edison
- Assignee: KAIZEN SECURE VOIZ, INC.
- Current Assignee: KAIZEN SECURE VOIZ, INC.
- Current Assignee Address: US NJ Edison
- Agent Walter J. Tencza, Jr.
- Main IPC: G10L17/18
- IPC: G10L17/18 ; G06N3/04 ; G10L17/10

Abstract:
An apparatus including a multi time-frequency resolution convolution neural network module; a two dimensional convolution neural network layers module; and a discriminative fully-connected classifier layers module; wherein the multi time-frequency resolution convolution neural network module receives a raw speech signal from a human speaker and processes the raw speech signal to provide a first processed output in the form of multiple multi time-frequency resolution spectrographic feature maps; wherein the two dimensional convolution neural network layers module processes the first processed output to provide a second processed output; and wherein the discriminative fully-connected classifier layers module processes the second processed output to provide a third processed output, wherein the third processed output provides an indication of an identify of a human speaker or provides an indication of verification of the identify of a human speaker.
Information query