Systems and methods for speech separation and neural decoding of attentional selection in multi-speaker environments

Invention Grant

US11961533B2 Systems and methods for speech separation and neural decoding of attentional selection in multi-speaker environments 有权

Please log in to see more content

Patent Title: Systems and methods for speech separation and neural decoding of attentional selection in multi-speaker environments
Application No.: US17826474

Application Date: 2022-05-27
Publication No.: US11961533B2

Publication Date: 2024-04-16
Inventor: Nima Mesgarani , Yi Luo , James O'Sullivan , Zhuo Chen
Applicant: The Trustees of Columbia University in the City of New York
Applicant Address: US NY New York
Assignee: The Trustees of Columbia University in the City of New York
Current Assignee: The Trustees of Columbia University in the City of New York
Current Assignee Address: US NY New York
Agency: Potomac Law Group, PLLC
Main IPC: G10L25/30
IPC: G10L25/30 ; A61B5/12 ; G10L17/26 ; G10L21/0272 ; G10L25/66

Systems and methods for speech separation and neural decoding of attentional selection in multi-speaker environments

Abstract:

Disclosed are devices, systems, apparatus, methods, products, and other implementations, including a method comprising obtaining, by a device, a combined sound signal for signals combined from multiple sound sources in an area in which a person is located, and applying, by the device, speech-separation processing (e.g., deep attractor network (DAN) processing, online DAN processing, LSTM-TasNet processing, Conv-TasNet processing), to the combined sound signal from the multiple sound sources to derive a plurality of separated signals that each contains signals corresponding to different groups of the multiple sound sources. The method further includes obtaining, by the device, neural signals for the person, the neural signals being indicative of one or more of the multiple sound sources the person is attentive to, and selecting one of the plurality of separated signals based on the obtained neural signals. The selected signal may then be processed (amplified, attenuated).

Public/Granted literature

US20220392482A1 SYSTEMS AND METHODS FOR SPEECH SEPARATION AND NEURAL DECODING OF ATTENTIONAL SELECTION IN MULTI-SPEAKER ENVIRONMENTS Public/Granted day:2022-12-08

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/27	.以分析方法为特征的
G10L25/30	..利用神经网络