COMPLEX ACOUSTIC RESONANCE SPEECH ANALYSIS SYSTEM
    1.
    发明申请
    COMPLEX ACOUSTIC RESONANCE SPEECH ANALYSIS SYSTEM 审中-公开
    复合声学共振语音分析系统

    公开(公告)号:WO2011068608A2

    公开(公告)日:2011-06-09

    申请号:PCT/US2010054572

    申请日:2010-10-28

    Inventor: KROEKER JOHN P

    CPC classification number: G10L25/15

    Abstract: A method and apparatus are provided for determining an instantaneous frequency and an instantaneous bandwidth of a speech resonance of a speech signal. The method includes receiving a speech signal having a real component; filtering the speech signal so as to generate a plurality of filtered signals such that the real component and an imaginary component of the speech signal are reconstructed; and generating a first estimated frequency and a first estimated bandwidth of a speech resonance of the speech signal based on both a first filtered signal of the plurality of filtered signals and a single-lag delay of the first filtered signal.

    Abstract translation: 提供了一种用于确定语音信号的语音共振的瞬时频率和瞬时带宽的方法和设备。 该方法包括接收具有实分量的语音信号; 对所述语音信号进行滤波以生成多个滤波信号,使得重构所述语音信号的实分量和虚分量; 以及基于所述多个经滤波的信号的第一经滤波的信号和所述第一经滤波的信号的单延迟延迟来产生所述语音信号的语音共振的第一估计频率和第一估计带宽。

    SPEECH-RECOGNITION CIRCUITRY EMPLOYING NONLINEAR PROCESSING, SPEECH ELEMENT MODELING AND PHONEME ESTIMATION

    公开(公告)号:CA2023424C

    公开(公告)日:2001-11-27

    申请号:CA2023424

    申请日:1990-08-16

    Applicant: ELIZA CORP

    Abstract: A phoneme estimator in a speech-recognition system includes energy detect circuitry for detecting the segments of a speech signal that should be analyzed for phoneme content. Speech-element processors then process the speech signal segments, calculating nonlinear representations of the segments. The nonlinear representation data is applied to speech-element modeling circuitry which reduces the data through speech element specific modeling. The reduced data are then subjected to further nonlinear processing. The results of the further nonlinear processing are again applied to speech- element modeling circuitry, producing phoneme isotype estimates. The phoneme isotype estimates are rearranged and consolidated, that is, the estimates are uniformly labeled and duplicated estimates are consolidated, forming estimates of words or phrases containing minimal numbers of phonemes. The estimates may then be compared with stored words or phrases to determine what was spoken.

    COMPLEX ACOUSTIC RESONANCE SPEECH ANALYSIS SYSTEM
    4.
    发明公开
    COMPLEX ACOUSTIC RESONANCE SPEECH ANALYSIS SYSTEM 审中-公开
    声学共振复杂的语言分析系统

    公开(公告)号:EP2507791A4

    公开(公告)日:2014-08-13

    申请号:EP10834909

    申请日:2010-10-28

    Applicant: ELIZA CORP

    Inventor: KROEKER JOHN P

    CPC classification number: G10L25/15

    Abstract: A method and apparatus are provided for determining an instantaneous frequency and an instantaneous bandwidth of a speech resonance of a speech signal. The method includes receiving a speech signal having a real component; filtering the speech signal so as to generate a plurality of filtered signals such that the real component and an imaginary component of the speech signal are reconstructed; and generating a first estimated frequency and a first estimated bandwidth of a speech resonance of the speech signal based on both a first filtered signal of the plurality of filtered signals and a single-lag delay of the first filtered signal.

    A novel approach to speech recognition

    公开(公告)号:AU5295101A

    公开(公告)日:2001-10-08

    申请号:AU5295101

    申请日:2001-03-23

    Applicant: ELIZA CORP

    Inventor: KROEKER JOHN P

    Abstract: A speech recognition system includes a line of service including a first server object coupled to a telephone network for receiving a voice data message from the telephone network, a second server object having a first connection to the first server object for receiving the voice data message from the first server object and converting the voice data message to a phonetic data message, a third server object having a second connection to the second server object for receiving the phonetic data message from the second server object and converting the phonetic data message to a syntactic data message and a fourth server object having a third connection to the third server object for receiving the syntactic data message from the third server object and converting the syntactic data message to a semantic data message, which is representative of the voice data message. The first, second, third and fourth server objects may be remote with respect to each other and the first, second and third connections are formed over a first computer network.

Patent Agency Ranking