Method and apparatus for performing speaker diarization on mixed-bandwidth speech signals

Invention Grant

US12087307B2 Method and apparatus for performing speaker diarization on mixed-bandwidth speech signals 有权

Please log in to see more content

Patent Title: Method and apparatus for performing speaker diarization on mixed-bandwidth speech signals
Application No.: US17538604

Application Date: 2021-11-30
Publication No.: US12087307B2

Publication Date: 2024-09-10
Inventor: Myungjong Kim , Vijendra Raj Apsingekar , Aviral Anshu , Taeyeon Ki
Applicant: SAMSUNG ELECTRONICS CO., LTD.
Applicant Address: KR Suwon-si
Assignee: SAMSUNG ELECTRONICS CO., LTD.
Current Assignee: SAMSUNG ELECTRONICS CO., LTD.
Current Assignee Address: KR Suwon-si
Agency: Sughrue Mion, PLLC
Main IPC: G10L17/06
IPC: G10L17/06 ; G10L17/02 ; G10L17/18 ; G10L21/0272 ; G10L21/0308

Method and apparatus for performing speaker diarization on mixed-bandwidth speech signals

Abstract:

An apparatus for processing speech data may include a processor configured to: separate an input speech into speech signals; identify a bandwidth of each of the speech signals; extract speaker embeddings from the speech signals based on the bandwidth of each of the speech signals, using at least one neural network configured to receive the speech signals and output the speaker embeddings; and cluster the speaker embeddings into one or more speaker clusters, each speaker cluster corresponding to a speaker identity.

Public/Granted literature

US20230169981A1 METHOD AND APPARATUS FOR PERFORMING SPEAKER DIARIZATION ON MIXED-BANDWIDTH SPEECH SIGNALS Public/Granted day:2023-06-01

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L17/00	讲话者辨认或验证
G10L17/06	.决策方法，模式适配策略