Invention Grant
- Patent Title: Source separation for automatic speech recognition (ASR)
-
Application No.: US17437748Application Date: 2019-03-10
-
Publication No.: US12148441B2Publication Date: 2024-11-19
- Inventor: Alon Slapak , Dani Cherkassky
- Applicant: Kardome Technology Ltd.
- Applicant Address: IL Mazor
- Assignee: Kardome Technology Ltd.
- Current Assignee: Kardome Technology Ltd.
- Current Assignee Address: IL Mazor
- Agency: Shichrur & Co.
- International Application: PCT/IB2019/051933 WO 20190310
- International Announcement: WO2020/183219 WO 20200917
- Main IPC: G10L21/0232
- IPC: G10L21/0232 ; G10L15/22 ; G10L21/0208 ; G10L21/0216 ; G10L21/0264

Abstract:
A method for speech enhancement, the method may include receiving or generating sound samples that represent sound signals that were received during a given time period by an array of microphones; frequency transforming the sound samples to provide frequency-transformed samples; clustering the frequency-transformed samples to speakers to provide speaker related clusters, wherein the clustering is based on (i) spatial cues related to the received sound signals and (ii) acoustic cues related to the speakers; determining a relative transfer function for each speaker of the speakers to provide speakers related relative transfer functions; applying a multiple multiple output (MIMO) beamforming operation on the speakers related relative transfer functions to provide beamformed signals; and inverse-frequency transforming the beamformed signals to provide speech signals.
Public/Granted literature
- US20220148611A1 SPEECH ENHANCEMENT USING CLUSTERING OF CUES Public/Granted day:2022-05-12
Information query