Non-verbal utterance detection apparatus, non-verbal utterance detection method, and program

Invention Grant

US11741989B2 Non-verbal utterance detection apparatus, non-verbal utterance detection method, and program 有权

Please log in to see more content

Patent Title: Non-verbal utterance detection apparatus, non-verbal utterance detection method, and program
Application No.: US17293021

Application Date: 2019-10-31
Publication No.: US11741989B2

Publication Date: 2023-08-29
Inventor: Takashi Nakamura , Takaaki Fukutomi , Kiyoaki Matsui
Applicant: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
Applicant Address: JP Tokyo
Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
Current Assignee: NIPPON TELEGRAPH AND TELEPHONE CORPORATION
Current Assignee Address: JP Tokyo
Priority: JP 18212666 2018.11.13
International Application: PCT/JP2019/042739 2019.10.31
International Announcement: WO2020/100606A 2020.05.22
Date entered country: 2021-05-11
Main IPC: G10L25/93
IPC: G10L25/93 ; G10L25/24 ; G10L25/30 ; G10L25/51 ; G06N3/02

Non-verbal utterance detection apparatus, non-verbal utterance detection method, and program

Abstract:

Detection precision of a non-verbal sound is improved. An acoustic model storage unit 10A stores an acoustic model that is configured by a deep neural network with a bottleneck structure, and estimates a phoneme state from a sound feature value. A non-verbal sound model storage unit 10B stores a non-verbal sound model that estimates a posterior probability of a non-verbal sound likeliness from the sound feature value and a bottleneck feature value. A sound feature value extraction unit 11 extracts a sound feature value from an input sound signal. A bottleneck feature value estimation unit 12 inputs the sound feature value to the acoustic model and obtains an output of a bottleneck layer of the acoustic model as a bottleneck feature value. A non-verbal sound detection unit 13 inputs the sound feature value and the bottleneck feature value to the non-verbal sound model and obtains the posterior probability of the non-verbal sound likeliness output by the non-verbal sound model.

Public/Granted literature

US20210272587A1 NON-VERBAL UTTERANCE DETECTION APPARATUS, NON-VERBAL UTTERANCE DETECTION METHOD, AND PROGRAM Public/Granted day:2021-09-02

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/93	.判别语音信号之间的浊音和清音部分（G10L25/90优先）