CLASS QUANTIZATION FOR DISTRIBUTED SPEECH RECOGNITION
    41.
    发明公开
    CLASS QUANTIZATION FOR DISTRIBUTED SPEECH RECOGNITION 有权
    KLASSENQUANTISIERUNG用于分布式语音识别

    公开(公告)号:EP1595249A4

    公开(公告)日:2007-06-20

    申请号:EP04708622

    申请日:2004-02-05

    Applicant: MOTOROLA INC IBM

    CPC classification number: G10L25/93 G10L15/30 G10L25/90 G10L2025/935

    Abstract: A system, method and computer readable medium for quantizing class information and pitch information of audio is disclosed. The method on an information processing system includes receiving audio and capturing a frame of the audio. The method further includes determining a pitch of the frame and calculating a codeword representing the pitch of the frame, wherein a first codeword value indicates an indefinite pitch. The method further includes determining a class of the frame, wherein the class is any one of at least two classes indicating an indefinite pitch and at least one class indicating a definite pitch. The method further includes calculating a codeword representing the class of the frame, wherein the codeword length is the maximum of the minimum number of bits required to represent the at least two classes and the minimum number of bits required to represent the at least one class.

    PITCH QUANTIZATION FOR DISTRIBUTED SPEECH RECOGNITION
    42.
    发明公开
    PITCH QUANTIZATION FOR DISTRIBUTED SPEECH RECOGNITION 有权
    量化用于分布式语音识别

    公开(公告)号:EP1595244A4

    公开(公告)日:2006-03-08

    申请号:EP04708630

    申请日:2004-02-05

    Applicant: MOTOROLA INC IBM

    CPC classification number: G10L19/09 G10L15/30

    Abstract: A system, method and computer readable medium for quantizing pitch information of audio is disclosed. The method includes capturing audio representing a numbered frame of a plurality of numbered frames. The method further includes calculating a class of the frame, wherein a class is any one of a voiced or unvoiced class. If the frame is a voiced class, a pitch is calculated for the frame. If the frame is an even numbered frame and a voiced class, a codeword of a first length is calculated by absolutely quantizing the frame pitch. If the frame is an odd numbered frame and a voiced class and a reliable frame is available, a codeword of a second length is calculated by differentially quantizing the frame pitch. If there is no reliable frame available, a codeword of the second length is calculated by absolutely quantizing the frame pitch.

Patent Agency Ranking