Invention Publication
- Patent Title: METHOD AND APPARATUS FOR PERFORMING SPEAKER DIARIZATION ON MIXED-BANDWIDTH SPEECH SIGNALS
-
Application No.: US17538604Application Date: 2021-11-30
-
Publication No.: US20230169981A1Publication Date: 2023-06-01
- Inventor: Myungjong KIM , Vijendra Raj APSINGEKAR , Aviral ANSHU , Taeyeon KI
- Applicant: SAMSUNG ELECTRONICS CO., LTD.
- Applicant Address: KR Suwon-si
- Assignee: SAMSUNG ELECTRONICS CO., LTD.
- Current Assignee: SAMSUNG ELECTRONICS CO., LTD.
- Current Assignee Address: KR Suwon-si
- Main IPC: G10L17/06
- IPC: G10L17/06 ; G10L21/0308 ; G10L17/02 ; G10L17/18 ; G06N3/04

Abstract:
An apparatus for processing speech data may include a processor configured to: separate an input speech into speech signals; identify a bandwidth of each of the speech signals; extract speaker embeddings from the speech signals based on the bandwidth of each of the speech signals, using at least one neural network configured to receive the speech signals and output the speaker embeddings; and cluster the speaker embeddings into one or more speaker clusters, each speaker cluster corresponding to a speaker identity.
Public/Granted literature
- US12087307B2 Method and apparatus for performing speaker diarization on mixed-bandwidth speech signals Public/Granted day:2024-09-10
Information query