Invention Grant
- Patent Title: Target speaker separation system, device and storage medium
-
Application No.: US17980473Application Date: 2022-11-03
-
Publication No.: US11978470B2Publication Date: 2024-05-07
- Inventor: Jiaming Xu , Jian Cui , Bo Xu
- Applicant: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
- Applicant Address: CN Beijing
- Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
- Current Assignee: INSTITUTE OF AUTOMATION, CHINESE ACADEMY OF SCIENCES
- Current Assignee Address: CN Beijing
- Agency: Westbridge IP LLC
- Priority: CN 2210602186.2 2022.05.30
- Main IPC: G10L21/0272
- IPC: G10L21/0272 ; G10L17/02 ; G10L17/04 ; G10L17/06 ; G10L21/028 ; H04S1/00

Abstract:
Disclosed are a target speaker separation system, an electronic device and a storage medium. The system includes: first, performing, jointly unified modeling on a plurality of cues based a masked pre-training strategy, to boost the inference capability of a model for missing cues and enhance the representation accuracy of disturbed cues; and second, constructing a hierarchical cue modulation module. A spatial cue is introduced into a primary cue modulation module for directional enhancement of a speech of a speaker; in an intermediate cue modulation module, the speech of the speaker is enhanced on the basis of temporal coherence of a dynamic cue and an auditory signal component; a steady-state cue is introduced into an advanced cue modulation module for selective filtering; and finally, the supervised learning capability of simulation data and the unsupervised learning effect of real mixed data are sufficiently utilized.
Public/Granted literature
- US20240005941A1 TARGET SPEAKER SEPARATION SYSTEM, DEVICE AND STORAGE MEDIUM Public/Granted day:2024-01-04
Information query