Raw speech speaker-recognition

Invention Grant

US10706857B1 Raw speech speaker-recognition 有权

Please log in to see more content

Patent Title: Raw speech speaker-recognition
Application No.: US16852950

Application Date: 2020-04-20
Publication No.: US10706857B1

Publication Date: 2020-07-07
Inventor: Viswanathan Ramasubramanian , Sunderrajan Kumar
Applicant: Viswanathan Ramasubramanian , Sunderrajan Kumar
Applicant Address: US NJ Edison
Assignee: KAIZEN SECURE VOIZ, INC.
Current Assignee: KAIZEN SECURE VOIZ, INC.
Current Assignee Address: US NJ Edison
Agent Walter J. Tencza, Jr.
Main IPC: G10L17/18
IPC: G10L17/18 ; G06N3/04 ; G10L17/10

Abstract:

An apparatus including a multi time-frequency resolution convolution neural network module; a two dimensional convolution neural network layers module; and a discriminative fully-connected classifier layers module; wherein the multi time-frequency resolution convolution neural network module receives a raw speech signal from a human speaker and processes the raw speech signal to provide a first processed output in the form of multiple multi time-frequency resolution spectrographic feature maps; wherein the two dimensional convolution neural network layers module processes the first processed output to provide a second processed output; and wherein the discriminative fully-connected classifier layers module processes the second processed output to provide a third processed output, wherein the third processed output provides an indication of an identify of a human speaker or provides an indication of verification of the identify of a human speaker.

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L17/00	讲话者辨认或验证
G10L17/18	.人工神经网络，连接方法