21.
    发明专利
    未知

    公开(公告)号:DE3688747T2

    公开(公告)日:1993-10-28

    申请号:DE3688747

    申请日:1986-12-18

    Applicant: MOTOROLA INC

    Abstract: The present invention describes a method and arrangement for reducing a sequence of initial frames into a reduced set of representative frames by combining the initial frames into a plurality of representative frames, the combining process including generating a distortion measure associated with each representative frame and comparing each distortion measure to a distortion threshold. From these representative frames, a set of mutually exclusive frames is determined to minimize the number of representative frames, whereby each representative frame in the set represents a unique set of contiguous initial frames and has an associated distortion measure which does not exceed the distortion threshold.

    22.
    发明专利
    未知

    公开(公告)号:DE3688614D1

    公开(公告)日:1993-07-29

    申请号:DE3688614

    申请日:1986-12-18

    Applicant: MOTOROLA INC

    Abstract: Described herein, is an arrangement and method for processing speech information in a speech recognition system (300). In such a system where the speech information is depicted as words, each word representing a sequence of frames (510) and where the recognition system has means (120) for comparing present input speech to a word template, the word template stored in template memory and derived from one or more previous input word, the present invention is best employed. The invention describes combining contiguous acoustically similar frames (512) derived from the previous input word or words into representative frames to form a corresponding reduced word template, storing the reduced word template in template memory in an efficient manner, and comparing frames of the present input speech to the representative frames of the reduced word template according to the number of frames combined in the representative frames of the reduced word template. In doing so, a measure of similarity between the present input speech and the word template is generated.

    METHOD AND APPARATUS FOR SYNTHESIZING SPEECH FROM SPEECH RECOGNITION TEMPLATES.
    24.
    发明公开
    METHOD AND APPARATUS FOR SYNTHESIZING SPEECH FROM SPEECH RECOGNITION TEMPLATES. 失效
    VERFAHREN UND VORRICHTUNG ZUR SPRACHSYNTHESE AUS SPRACHERKENNUNGSMODELLEN。

    公开(公告)号:EP0255523A4

    公开(公告)日:1988-06-23

    申请号:EP87900604

    申请日:1986-12-22

    Applicant: MOTOROLA INC

    CPC classification number: G10L15/22 G10L13/00 H04M1/271 H04M1/6041

    Abstract: A user-interactive control system for an electronic device which synthesizes speech from speech recognition templates to generate voice reply feedback to the user indicative of which template word was recognized. The acoustic features of the user-spoken speech are extracted by the acoustic processor (110) and applied to the training processor (170) to generate word recognition templates stored in the template memory (160). Recognition processor (120) compares the user-spoken features to the recognition templates to provide voice command data for the device controller (130) which controls the operating parameters of the electronic device (150). The device controller also produces device status data for the synthesis processor (140) which synthesizes a speech reply signal from the word recognition templates. In the preferred embodiment, a hands-free user-interactive control system for a mobile radiotelephone is provided utilizing speech synthesis from speech recognition templates.

    Abstract translation: 一种用于电子设备的用户交互式控制系统,其合成来自语音识别模板的语音以生成对用户的语音回复反馈,以指示哪个模板字被识别。 声学处理器(110)提取用户口头语音的声学特征并将其应用于训练处理器(170)以生成存储在模板存储器(160)中的单词识别模板。 识别处理器(120)将用户说出的特征与识别模板进行比较,以为控制电子设备(150)的操作参数的设备控制器(130)提供语音命令数据。 设备控制器还产生用于合成来自单词识别模板的语音应答信号的合成处理器(140)的设备状态数据。 在优选实施例中,利用来自语音识别模板的语音合成来提供用于移动无线电话的免提用户交互式控制系统。

    METHOD AND APPARATUS FOR SYNTHESIZING SPEECH WITHOUT VOICING OR PITCH INFORMATION.
    25.
    发明公开
    METHOD AND APPARATUS FOR SYNTHESIZING SPEECH WITHOUT VOICING OR PITCH INFORMATION. 失效
    方法和装置语音合成而对语音或语音高度信息。

    公开(公告)号:EP0255524A4

    公开(公告)日:1988-06-23

    申请号:EP87900607

    申请日:1986-12-22

    Applicant: MOTOROLA INC

    CPC classification number: G10L19/02

    Abstract: A channel bank speech synthesizer for reconstructing speech from externally-generated acoustic feature information without using externally-generated voicing or pitch information is disclosed. An N-channel pitch-excited channel bank synthesizer (340) is provided having a first low-frequency group of channel gain values (1 to M) and a second high-frequency group of channel gain values (+1 to N). The first group controls a first group of amplitude modulators (950) excited by a periodic pitch pulse source (920), and the second group controls amplitude modulators excited by a noise source (930). Both groups of modulated excitation signals are applied to the bandpass filters (960) to reconstruct the speech channels, and then combined at the summation network (970) to form a reconstructed synthesized speech signal. Additionally, the pitch pulse source (920) varies the pitch pulse period such that the pitch pulse rate decreases over the length of the word.

    OPTIMAL METHOD OF DATA REDUCTION IN A SPEECH RECOGNITION SYSTEM.
    26.
    发明公开
    OPTIMAL METHOD OF DATA REDUCTION IN A SPEECH RECOGNITION SYSTEM. 失效
    用于语音识别系统的数据缩减最佳实践。

    公开(公告)号:EP0252946A4

    公开(公告)日:1988-05-31

    申请号:EP87900588

    申请日:1986-12-18

    Applicant: MOTOROLA INC

    CPC classification number: G10L15/00

    Abstract: The present invention describes a method and arrangement for reducing a sequence of initial frames into a reduced set of representative frames by combining the initial frames into a plurality of representative frames, the combining process including generating a distortion measure associated with each representative frame and comparing each distortion measure to a distortion threshold. From these representative frames, a set of mutually exclusive frames is determined to minimize the number of representative frames, whereby each representative frame in the set represents a unique set of contiguous initial frames and has an associated distortion measure which does not exceed the distortion threshold.

    FRAME COMPARISON METHOD FOR WORD RECOGNITION IN HIGH NOISE ENVIRONMENTS.
    28.
    发明公开
    FRAME COMPARISON METHOD FOR WORD RECOGNITION IN HIGH NOISE ENVIRONMENTS. 失效
    认字框架比较法与无事生非的环境。

    公开(公告)号:EP0255529A4

    公开(公告)日:1988-06-08

    申请号:EP87900768

    申请日:1986-12-29

    Applicant: MOTOROLA INC

    CPC classification number: G10L15/00

    Abstract: Method and arrangement for a speech recognition system using channel bank information to represent speech. The method considers background noise included with the speech. The method includes determining three energy levels for each channel, the first representative of background noise energy (20), the second representative of the input frame energy (16) and the third representative of the word template frame energy (18). Values representing energy level differentials are assigned at each channel. If the second energy level is less than the first energy level, then a predetermined constant value is assigned at that particular channel. These values are combined to generate a distance measure depicting the similarity between the two frames.

Patent Agency Ranking