Speaker identification using spatial information

Invention Grant

US09626970B2 Speaker identification using spatial information 有权

Please log in to see more content

Patent Title: Speaker identification using spatial information
Application No.: US14971401

Application Date: 2015-12-16
Publication No.: US09626970B2

Publication Date: 2017-04-18
Inventor: Shen Huang , Xuejing Sun
Applicant: Dolby Laboratories Licensing Corporation
Applicant Address: US CA San Francisco
Assignee: Dolby Laboratories Licensing Corporation
Current Assignee: Dolby Laboratories Licensing Corporation
Current Assignee Address: US CA San Francisco
Priority: WOPCT/CN2014/094409 20141219
Main IPC: G10L17/00
IPC: G10L17/00 ; G10L15/30 ; G10L25/24 ; G10L25/78

Abstract:

Embodiments of the present invention relate to speaker identification using spatial information. A method of speaker identification for audio content being of a format based on multiple channels is disclosed. The method comprises extracting, from a first audio clip in the format, a plurality of spatial acoustic features across the multiple channels and location information, the first audio clip containing voices from a speaker, and constructing a first model for the speaker based on the spatial acoustic features and the location information, the first model indicating a characteristic of the voices from the speaker. The method further comprises identifying whether the audio content contains voices from the speaker based on the first model. Corresponding system and computer program product are also disclosed.

Public/Granted literature

US20160180852A1 SPEAKER IDENTIFICATION USING SPATIAL INFORMATION Public/Granted day:2016-06-23

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L17/00	讲话者辨认或验证