Invention Grant
- Patent Title: Real-time low-complexity stereo speech enhancement with spatial cue preservation
-
Application No.: US17810303Application Date: 2022-06-30
-
Publication No.: US12167223B2Publication Date: 2024-12-10
- Inventor: Masahito Togami , Karim Helwani , Jean-Marc Valin , Michael Mark Goodwin
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Kowert, Hood, Munyon, Rankin & Goetzel, P.C.
- Agent S. Scott Foster
- Main IPC: H04S7/00
- IPC: H04S7/00 ; G10L21/0216 ; H04S1/00

Abstract:
Real-time low-complexity stereo speech enhancement with spatial cue preservation may be performed. A stereo speech enhancement system receives a stereo input signal (e.g., a left and right input signal). The stereo speech enhancement system estimates spatial cues for a target speaker and downmixes the stereo input signal into a monaural signal. A low-complexity model may then process the monaural signal to generate an enhanced monaural signal. The stereo speech enhancement system upmixes the enhanced monaural signal based on the estimated spatial cues for the target speaker, to generate an enhanced stereo output signal.
Public/Granted literature
- US20240007817A1 REAL-TIME LOW-COMPLEXITY STEREO SPEECH ENHANCEMENT WITH SPATIAL CUE PRESERVATION Public/Granted day:2024-01-04
Information query