Real-time low-complexity stereo speech enhancement with spatial cue preservation

Invention Grant

US12167223B2 Real-time low-complexity stereo speech enhancement with spatial cue preservation 有权

Please log in to see more content

Patent Title: Real-time low-complexity stereo speech enhancement with spatial cue preservation
Application No.: US17810303

Application Date: 2022-06-30
Publication No.: US12167223B2

Publication Date: 2024-12-10
Inventor: Masahito Togami , Karim Helwani , Jean-Marc Valin , Michael Mark Goodwin
Applicant: Amazon Technologies, Inc.
Applicant Address: US WA Seattle
Assignee: Amazon Technologies, Inc.
Current Assignee: Amazon Technologies, Inc.
Current Assignee Address: US WA Seattle
Agency: Kowert, Hood, Munyon, Rankin & Goetzel, P.C.
Agent S. Scott Foster
Main IPC: H04S7/00
IPC: H04S7/00 ; G10L21/0216 ; H04S1/00

Abstract:

Real-time low-complexity stereo speech enhancement with spatial cue preservation may be performed. A stereo speech enhancement system receives a stereo input signal (e.g., a left and right input signal). The stereo speech enhancement system estimates spatial cues for a target speaker and downmixes the stereo input signal into a monaural signal. A low-complexity model may then process the monaural signal to generate an enhanced monaural signal. The stereo speech enhancement system upmixes the enhanced monaural signal based on the estimated spatial cues for the target speaker, to generate an enhanced stereo output signal.

Public/Granted literature

US20240007817A1 REAL-TIME LOW-COMPLEXITY STEREO SPEECH ENHANCEMENT WITH SPATIAL CUE PRESERVATION Public/Granted day:2024-01-04

Information query

Espacenet

IPC分类:

H	电学
H04	电通信技术
H04S	立体声系统
H04S7/00	指示装置；控制装置，例如平衡控制