Speech model-based neural network-assisted signal enhancement

Invention Grant

US10381020B2 Speech model-based neural network-assisted signal enhancement 有权

Please log in to see more content

Patent Title: Speech model-based neural network-assisted signal enhancement
Application No.: US15625966

Application Date: 2017-06-16
Publication No.: US10381020B2

Publication Date: 2019-08-13
Inventor: Sean A. Ramprashad
Applicant: Apple Inc.
Applicant Address: US CA Cupertino
Assignee: Apple Inc.
Current Assignee: Apple Inc.
Current Assignee Address: US CA Cupertino
Agency: Womble Bond Dickinson (US) LLP
Main IPC: G10L15/20
IPC: G10L15/20 ; G10L19/00 ; G10L25/15 ; G10L25/30 ; G10L21/003 ; G10L21/0208 ; G10L21/0232

Speech model-based neural network-assisted signal enhancement

Abstract:

Several embodiments of a digital speech signal enhancer are described that use an artificial neural network that produces clean speech coding parameters based on noisy speech coding parameters as its input features. A vocoder parameter generator produces the noisy speech coding parameters from a noisy speech signal. A vocoder model generator processes the clean speech coding parameters into estimated clean speech spectral magnitudes. In one embodiment, a magnitude modifier modifies an original frequency spectrum of the noisy speech signal using the estimated clean speech spectral magnitudes, to produce an enhanced frequency spectrum, and a synthesis block converts the enhanced frequency spectrum into time domain, as an output speech sequence. Other embodiments are also described.

Public/Granted literature

US20180366138A1 Speech Model-Based Neural Network-Assisted Signal Enhancement Public/Granted day:2018-12-20

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L15/00	语音识别（G10L17/00优先）
G10L15/20	.专门适用于不利环境（例如，噪音环境）中保持鲁棒性或增强语音强度的语音识别技术（G10L21/02优先）