Apparatus, method, and non-transitory computer-readable storage medium for storing program for utterance section detection

Invention Grant

US10755731B2 Apparatus, method, and non-transitory computer-readable storage medium for storing program for utterance section detection 有权

Please log in to see more content

Patent Title: Apparatus, method, and non-transitory computer-readable storage medium for storing program for utterance section detection
Application No.: US15643576

Application Date: 2017-07-07
Publication No.: US10755731B2

Publication Date: 2020-08-25
Inventor: Masanao Suzuki , Chisato Shioda , Nobuyuki Washio
Applicant: FUJITSU LIMITED
Applicant Address: JP Kawasaki
Assignee: FUJITSU LIMITED
Current Assignee: FUJITSU LIMITED
Current Assignee Address: JP Kawasaki
Agency: Staas & Halsey LLP
Priority: com.zzzhc.datahub.patent.etl.us.BibliographicData$PriorityClaim@76baffa9
Main IPC: G10L25/78
IPC: G10L25/78 ; G10L25/84 ; G10L15/04 ; G10L19/08 ; G10L25/90

Apparatus, method, and non-transitory computer-readable storage medium for storing program for utterance section detection

Abstract:

A method for utterance section detection includes: executing pitch gain calculation processing that includes calculating a pitch gain indicating an intensity of periodicity of an audio signal expressing a voice of a speaker for each of frames that are obtained by dividing the audio signal and that each have a predetermined length; and executing utterance section detection processing that includes determining that an utterance section on the audio signal starts when the pitch gain becomes greater than or equal to a first threshold value after a non-utterance section on the audio signal lasts, wherein the utterance section detection processing further includes determining that the utterance section ends when the pitch gain becomes less than a second threshold value lower than the first threshold value after the utterance section lasts.

Public/Granted literature

US20180068677A1 APPARATUS, METHOD, AND NON-TRANSITORY COMPUTER-READABLE STORAGE MEDIUM FOR STORING PROGRAM FOR UTTERANCE SECTION DETECTION Public/Granted day:2018-03-08

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/78	.语音信号存在或不存在的检测（在双向扩音电话系统中通过语音频率切换传输的方向入H04M9/10）