Patent search ap:("CANON KK") AND inv:"OHORA YASUNORI" Page 1

1.

发明专利
未知

公开(公告)号：DE69729542T2

公开(公告)日：2005-08-18

申请号：DE69729542

申请日：1997-12-19

Applicant: CANON KK

Inventor： OTSUKA MITSURU , OHORA YASUNORI , ASO TAKASHI , OKUTANI YASUO

IPC: G10L13/00 , G10L13/06 , G10L25/90

Abstract: In a speech synthesis apparatus for outputting synthesized speech on the basis of a parameter sequence of a speech waveform, a parameter generation unit generates a parameter sequence for speech synthesis on the basis of a character sequence input by a character sequence input unit, and stores the generated parameter sequence in a parameter storage unit. A waveform generation unit generates pitch waveforms each for one pitch period on the basis of synthesis parameters and pitch scales included in the parameter sequence, and generates a speech waveform by connecting the generated pitch waveforms in accordance with frame lengths set by a frame length setting unit.

2.

发明专利
未知

公开(公告)号：DE69523998D1

公开(公告)日：2002-01-03

申请号：DE69523998

申请日：1995-05-25

Applicant: CANON KK

Inventor： OTSUKA MITSURU , FUKADA TOSHIAKI , OHORA YASUNORI , ASO TAKASHI

IPC: G10L13/00 , G10L13/04 , G10L13/06 , G10L13/08 , G10L25/93 , G10L13/02

3.

发明专利
未知

公开(公告)号：DE69009571T2

公开(公告)日：1994-10-27

申请号：DE69009571

申请日：1990-03-12

Applicant: CANON KK

Inventor： ASO TAKASHI , OHORA YASUNORI

IPC: G10L13/00 , G10L13/06 , G10L13/04 , H03K3/84 , G10L3/00

Abstract: There is provided a speech synthesizer including first indication means for indicating the amplitude of an impulse response waveform by using a random number; second indication means for indicating the superposition period for impulse response waveforms by using a random number; impulse response waveform generating means for generating an impulse response waveform having an amplitude indicated by the first indication means; and waveform superposition means for synthesizing an unvoiced speech waveform by superposing an impulse response waveform generated by the impulse response waveform generating means onto an impulse response waveform obtained by delaying the first-mentioned impulse response waveform by a superposition period indicated by the second indication means, the speech synthesizer being capable of making the frequency characteristic at the unvoiced speech section analogous to that of white noises, and generating a synthesized speech natural and analogous to human actual voice.

4.

发明公开
Speech recognizing method and apparatus 失效
Title translation: Verfahren und Vorrichtung zur Spracherkennung

公开(公告)号：EP0798695A3

公开(公告)日：1998-09-09

申请号：EP97301980

申请日：1997-03-24

Applicant: CANON KK

Inventor： KOSAKA TETSUO , OHORA YASUNORI

IPC: G10L15/06 , G10L15/02 , G10L15/14 , G10L15/20 , G10L15/22 , G10L3/00

CPC classification number: G10L15/20 , G10L25/24

Abstract: Speech including a speech portion and a non-speech portion is inputted, a Cepstrum long time mean of the speech portion is obtained from the speech portion of the input speech, a Cepstrum long time mean of the non-speech portion is obtained from the non-speech portion of the input speech, each Cepstrum long time mean is converted from a Cepstrum region to a linear region, and after that, it is subtracted on a linear spectrum dimension, the subtracted mean is converted into a Cepstrum dimension, a Cepstrum long time mean of a speech portion in a speech database for learning is subtracted from the converted result, and the subtracted result is added to a speech model expressed by Cepstrum. Thus, even when a noise is large, a presuming precision of a line fluctuation is raised and a recognition rate can be improved.

Abstract translation: 输入包括语音部分和非语音部分的语音，从输入语音的语音部分获得语音部分的倒频谱长时间均值，非语音部分的倒频谱长时间均值从非语音部分输入语音的语音部分，将每个倒谱长时间均值从倒谱区域转换为线性区域，然后在线性频谱维度上减去倒数均值，将该相减平均值转换为倒谱维度，倒谱长度从转换的结果中减去用于学习的语音数据库中的语音部分的时间平均，并将相减的结果加到由倒谱表示的语音模型中。因此，即使在噪声大的情况下，线性变动的推定精度也提高，可以提高识别率。

5.

发明专利
未知

公开(公告)号：DE69729542D1

公开(公告)日：2004-07-22

申请号：DE69729542

申请日：1997-12-19

Applicant: CANON KK

Inventor： OTSUKA MITSURU , OHORA YASUNORI , ASO TAKASHI , OKUTANI YASUO

IPC: G10L13/00 , G10L13/06 , G10L25/90

Abstract: In a speech synthesis apparatus for outputting synthesized speech on the basis of a parameter sequence of a speech waveform, a parameter generation unit generates a parameter sequence for speech synthesis on the basis of a character sequence input by a character sequence input unit, and stores the generated parameter sequence in a parameter storage unit. A waveform generation unit generates pitch waveforms each for one pitch period on the basis of synthesis parameters and pitch scales included in the parameter sequence, and generates a speech waveform by connecting the generated pitch waveforms in accordance with frame lengths set by a frame length setting unit.

6.

发明专利
未知

公开(公告)号：DE69715281T2

公开(公告)日：2003-07-31

申请号：DE69715281

申请日：1997-03-24

Applicant: CANON KK

Inventor： KOSAKA TETSUO , OHORA YASUNORI

IPC: G10L15/06 , G10L15/02 , G10L15/14 , G10L15/20 , G10L15/22

Abstract: Speech including a speech portion and a non-speech portion is inputted, a Cepstrum long time mean of the speech portion is obtained from the speech portion of the input speech, a Cepstrum long time mean of the non-speech portion is obtained from the non-speech portion of the input speech, each Cepstrum long time mean is converted from a Cepstrum region to a linear region, and after that, it is subtracted on a linear spectrum dimension, the subtracted mean is converted into a Cepstrum dimension, a Cepstrum long time mean of a speech portion in a speech database for learning is subtracted from the converted result, and the subtracted result is added to a speech model expressed by Cepstrum. Thus, even when a noise is large, a presuming precision of a line fluctuation is raised and a recognition rate can be improved.

7.

发明专利
未知

公开(公告)号：DE69715281D1

公开(公告)日：2002-10-17

申请号：DE69715281

申请日：1997-03-24

Applicant: CANON KK

Inventor： KOSAKA TETSUO , OHORA YASUNORI

IPC: G10L15/06 , G10L15/02 , G10L15/14 , G10L15/20 , G10L15/22

Abstract: Speech including a speech portion and a non-speech portion is inputted, a Cepstrum long time mean of the speech portion is obtained from the speech portion of the input speech, a Cepstrum long time mean of the non-speech portion is obtained from the non-speech portion of the input speech, each Cepstrum long time mean is converted from a Cepstrum region to a linear region, and after that, it is subtracted on a linear spectrum dimension, the subtracted mean is converted into a Cepstrum dimension, a Cepstrum long time mean of a speech portion in a speech database for learning is subtracted from the converted result, and the subtracted result is added to a speech model expressed by Cepstrum. Thus, even when a noise is large, a presuming precision of a line fluctuation is raised and a recognition rate can be improved.

8.

发明专利
未知

公开(公告)号：DE69028072D1

公开(公告)日：1996-09-19

申请号：DE69028072

申请日：1990-11-05

Applicant: CANON KK

Inventor： KOSAKA TETSUO , SAKURAI ATSUSHI , TAMURA JUNICHI , OHORA YASUNORI , FUJITA TAKESHI , ASO TAKASHI , KAWASAKI KATSUHIKO

IPC: G10L13/07 , G10L5/04

Abstract: Disclosed is a method and apparatus for reading out a feature parameter and a driver sound source stored in a VCV (vowel-consonant-vowel) speech segment file, sequentially connecting the readout parameter and the readout sound source information in accordance with a predetermined rule, and supplying connected data to a speech synthesizer, thereby generating a speech output, including a memory for storing an average power of each vowel, and a power controller for controlling to normalize the VCV segment so that powers at both ends of each VCV segment coincide with the average power of each vowel.

9.

发明专利
未知

公开(公告)号：DE69629763T2

公开(公告)日：2004-07-15

申请号：DE69629763

申请日：1996-06-18

Applicant: CANON KK

Inventor： KOMORI YASUHIRO , OHORA YASUNORI

IPC: G10L15/02 , G10L15/06 , G10L15/14

Abstract: An object of the invention is to provide a method of generating a state transition model capable of high speed voice recognition and to provide a voice recognition method and apparatus using the state transition model. To this end, a method is provided which generates a state transition model in which a state shared structure of the state transition model is designed, the method including a step of setting the states of a triphone state transition model in an acoustic space as initial clusters, a clustering step of generating a cluster containing the initial clusters by top-down clustering, a step of determining a state shared structure by assigning a short distance cluster among clusters generated by the clustering step, to the state transition model and a step of learning a state shared model by analyzing the states of the triphones in accordance with the determined state shared structure.

10.

发明专利
未知

公开(公告)号：DE69629763D1

公开(公告)日：2003-10-09

申请号：DE69629763

申请日：1996-06-18

Applicant: CANON KK

Inventor： KOMORI YASUHIRO , OHORA YASUNORI

IPC: G10L15/02 , G10L15/06 , G10L15/14

Abstract: An object of the invention is to provide a method of generating a state transition model capable of high speed voice recognition and to provide a voice recognition method and apparatus using the state transition model. To this end, a method is provided which generates a state transition model in which a state shared structure of the state transition model is designed, the method including a step of setting the states of a triphone state transition model in an acoustic space as initial clusters, a clustering step of generating a cluster containing the initial clusters by top-down clustering, a step of determining a state shared structure by assigning a short distance cluster among clusters generated by the clustering step, to the state transition model and a step of learning a state shared model by analyzing the states of the triphones in accordance with the determined state shared structure.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification