-
1.
公开(公告)号:JP2005031632A
公开(公告)日:2005-02-03
申请号:JP2004101094
申请日:2004-03-30
Applicant: ATR ADVANCED TELECOMM RES INST
Inventor: FRANK K SOONG , NAKAMURA SATORU , ASHIKARI YUTAKA , ITO GEN
Abstract: PROBLEM TO BE SOLVED: To provide an utterance section detecting device capable of properly detecting an utterance section without reference to environmental noise. SOLUTION: The utterance section detecting device includes a speech input part 104 which generates speech data in frames, a frame buffer 110 which stores the energy value of the frame-constituted speech in an FIFO basis, an initial environmental noise calculation part 112 which processes energy values of frames in the frame buffer 110 in a specified statistical method to calculate an initial value of an estimated value of environmental noise, a dynamic threshold calculation part 116 which calculates thresholds of energy values for detecting an utterance section by frames according to the energy values stored in the frame buffer 110 to vary following up the environmental noise included in speech data, and a state decision part 118 which decides the states of the frames according to the threshold. COPYRIGHT: (C)2005,JPO&NCIPI