Systems and methods for audio signal processing using spectral-spatial mask estimation

Invention Grant

US11289109B2 Systems and methods for audio signal processing using spectral-spatial mask estimation 有权

Please log in to see more content

Patent Title: Systems and methods for audio signal processing using spectral-spatial mask estimation
Application No.: US16858185

Application Date: 2020-04-24
Publication No.: US11289109B2

Publication Date: 2022-03-29
Inventor: Chengyun Deng , Hui Song , Yi Zhang , Yongtao Sha
Applicant: BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO., LTD.
Applicant Address: CN Beijing
Assignee: BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO., LTD.
Current Assignee: BEIJING DIDI INFINITY TECHNOLOGY AND DEVELOPMENT CO., LTD.
Current Assignee Address: CN Beijing
Agency: Bayes PLLC
Main IPC: G10L21/0264
IPC: G10L21/0264 ; G10L15/16 ; G10L25/18

Systems and methods for audio signal processing using spectral-spatial mask estimation

Abstract:

Embodiments of the disclosure provide systems and methods for audio signal processing. An exemplary system may include a communication interface configured to receiving a first audio signal acquired from an audio source through a first channel, and a second audio signal acquired from the same audio source through a second channel. The system may also include at least one processor coupled to the communication interface. The at least one processor may be configured to determine channel features based on the first audio signal and the second audio signal individually and determine a cross-channel feature based on the first audio signal and the second audio signal collectively. The at least one processor may further be configured to concatenate the channel features and the cross-channel feature and estimate spectral-spatial masks for the first channel and the second channel using the concatenated channel features and the cross-channel feature. The at least one processor may also be configured to perform beamforming based on the spectral-spatial masks for the first channel and the second channel.

Public/Granted literature

US20200342891A1 SYSTEMS AND METHODS FOR ADUIO SIGNAL PROCESSING USING SPECTRAL-SPATIAL MASK ESTIMATION Public/Granted day:2020-10-29

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L21/00	为了改变语音或声音信号的质量或其可识度而处理语音或声音信号，以产生另一种可听的或非可听的信号，例如视觉信号或触觉信号（G10L19/00优先）
G10L21/02	.语音增强，例如降低噪声或消除回声（在直线传送系统中减轻回声效应入H04B3/20；免提电话中的回声抑制入H04M9/08）
G10L21/0208	..噪声过滤
G10L21/0264	...以参数测量的类型为特征的，如相关技术，零交叉技术或预测技术