Invention Grant
- Patent Title: Systems and methods for speech separation and neural decoding of attentional selection in multi-speaker environments
-
Application No.: US17826474Application Date: 2022-05-27
-
Publication No.: US11961533B2Publication Date: 2024-04-16
- Inventor: Nima Mesgarani , Yi Luo , James O'Sullivan , Zhuo Chen
- Applicant: The Trustees of Columbia University in the City of New York
- Applicant Address: US NY New York
- Assignee: The Trustees of Columbia University in the City of New York
- Current Assignee: The Trustees of Columbia University in the City of New York
- Current Assignee Address: US NY New York
- Agency: Potomac Law Group, PLLC
- Main IPC: G10L25/30
- IPC: G10L25/30 ; A61B5/12 ; G10L17/26 ; G10L21/0272 ; G10L25/66

Abstract:
Disclosed are devices, systems, apparatus, methods, products, and other implementations, including a method comprising obtaining, by a device, a combined sound signal for signals combined from multiple sound sources in an area in which a person is located, and applying, by the device, speech-separation processing (e.g., deep attractor network (DAN) processing, online DAN processing, LSTM-TasNet processing, Conv-TasNet processing), to the combined sound signal from the multiple sound sources to derive a plurality of separated signals that each contains signals corresponding to different groups of the multiple sound sources. The method further includes obtaining, by the device, neural signals for the person, the neural signals being indicative of one or more of the multiple sound sources the person is attentive to, and selecting one of the plurality of separated signals based on the obtained neural signals. The selected signal may then be processed (amplified, attenuated).
Public/Granted literature
Information query