-
公开(公告)号:US12300251B2
公开(公告)日:2025-05-13
申请号:US18070499
申请日:2022-11-29
Applicant: Gwangju Institute of Science and Technology
Inventor: Dong Keon Park , Hong Kook Kim , Ye Chan Yu
IPC: G10L17/18 , G10L21/0272 , G10L21/0308 , G10L25/18
Abstract: The present invention relates to a speaker diarization technology, and more specifically to, end-to-end speaker diarization system and method through transformer learning having an auxiliary loss-based residual connection to separate speakers by dividing the speakers for time interval, wherein the end-to-end speaker diarization system and method using an auxiliary loss can differentiate and separate speakers through speaker labeling based on the transformer learning using an auxiliary loss even if speaker speeches overlap in a multi-speaker environment.