Invention Grant
- Patent Title: Multi-stream target-speech detection and channel fusion
-
Application No.: US16706519Application Date: 2019-12-06
-
Publication No.: US11158333B2Publication Date: 2021-10-26
- Inventor: Francesco Nesta , Saeed Mosayyebpour Kaskari
- Applicant: SYNAPTICS INCORPORATED
- Applicant Address: US CA San Jose
- Assignee: SYNAPTICS INCORPORATED
- Current Assignee: SYNAPTICS INCORPORATED
- Current Assignee Address: US CA San Jose
- Agency: Paradice & Li LLP
- Main IPC: G10L21/0364
- IPC: G10L21/0364 ; G10L25/60 ; G10L15/22 ; G10L25/84 ; H04R1/40 ; H04R3/00 ; H04S3/00 ; H04L29/06

Abstract:
Audio processing systems and methods include an audio sensor array configured to receive a multichannel audio input and generate a corresponding multichannel audio signal and target-speech detection logic and an automatic speech recognition engine or VoIP application. An audio processing device includes a target speech enhancement engine configured to analyze a multichannel audio input signal and generate a plurality of enhanced target streams, a multi-stream target-speech detection generator comprising a plurality of target-speech detector engines each configured to determine a probability of detecting a specific target-speech of interest in the stream, wherein the multi-stream target-speech detection generator is configured to determine a plurality of weights associated with the enhanced target streams, and a fusion subsystem configured to apply the plurality of weights to the enhanced target streams to generate an enhancement output signal.
Public/Granted literature
- US20200184985A1 MULTI-STREAM TARGET-SPEECH DETECTION AND CHANNEL FUSION Public/Granted day:2020-06-11
Information query