TRIFURCATED CHANNEL ENCODING FOR COMPRESSED SPEECH
    11.
    发明申请
    TRIFURCATED CHANNEL ENCODING FOR COMPRESSED SPEECH 审中-公开
    用于压缩语音的定制通道编码

    公开(公告)号:WO1997013242A1

    公开(公告)日:1997-04-10

    申请号:PCT/US1996013394

    申请日:1996-08-19

    Applicant: MOTOROLA INC.

    CPC classification number: G10L19/005

    Abstract: A method and apparatus for protecting compressed speech in very low bit rate voice messaging comprising the steps of analyzing compressed input speech data to discriminate between the data such as heading, pitch, energy, spectral and timing information (506); providing a plurality of methods for channel encoding wherein the most error sensitive information for speech replication is provided a coding method with the greatest protection and sequentially less error sensitive information is encoded utilizing methods of sequentially less protection (510), thereby providing for significantly less overhead in the overall channel encoding process than would be present under a standard channel encoding scheme; passing the output of each encoder to a multiplexor (512), which multiplexes the plurality of channel encoded data and sends the encoded data via a transmission channel to a de-multiplexer wherein the channel encoded data is separated and passed to a plurality of decoders designed to decode the data of its paired encoder; passing the decoded data to an analog to digital converter wherein the digital data is converted to analog data; and passing the analog data to a speech synthesizer which replicates the input speech.

    Abstract translation: 一种用于以非常低比特率语音消息保护压缩语音的方法和装置,包括以下步骤:分析压缩输入语音数据,以区分诸如标题,音调,能量,频谱和定时信息之类的数据(506); 提供了用于信道编码的多种方法,其中用于语音复制的最具有误差的信息被提供为具有最大保护性的编码方法,并且使用顺序地较少保护的方法(510)编码依次较小的误差敏感信息,从而提供显着更少的开销 在总体信道编码过程中比在标准信道编码方案下存在的编码过程; 将每个编码器的输出传递到多路复用器(512),多路复用器(512)多路复用多个信道编码数据,并且经由传输信道将编码数据发送到解复用器,其中信道编码数据被分离并传递到多个解码器 解码其配对编码器的数据; 将解码的数据传送到模拟数字转换器,其中数字数据被转换为模拟数据; 并将模拟数据传送到复制输入语音的语音合成器。

    LETTER TO SOUND CONVERSION FOR SYNTHESIZED PRONOUNCIATION OF A TEXT SEGMENT
    12.
    发明公开
    LETTER TO SOUND CONVERSION FOR SYNTHESIZED PRONOUNCIATION OF A TEXT SEGMENT 有权
    声音字母对于文本段合成实施辩论

    公开(公告)号:EP1668629A1

    公开(公告)日:2006-06-14

    申请号:EP04784356.0

    申请日:2004-09-17

    Applicant: MOTOROLA, INC.

    CPC classification number: G10L13/08

    Abstract: There is described a method (200) for text to speech synthesis, the method (200) includes receiving (220) a text string and selecting at least one word from the string. Then a step of segmenting (240) the word into a sub-words forming a sub-word sequence with at least one of the sub-words comprising at least two letters. The step of identifying (250) provides for identifying phonemes for the sub-words and step (260) effects concatenating the phonemes into a phoneme sequence. A performing speech synthesis (280) on the phoneme sequence is then conducted.

    VERY LOW BIT RATE VOICE MESSAGING SYSTEM USING VARIABLE RATE BACKWARD SEARCH INTERPOLATION PROCESSING
    13.
    发明公开
    VERY LOW BIT RATE VOICE MESSAGING SYSTEM USING VARIABLE RATE BACKWARD SEARCH INTERPOLATION PROCESSING 失效
    使用可变速率的非常低的比率语音消息传送系统向后搜索插值处理

    公开(公告)号:EP0850471A1

    公开(公告)日:1998-07-01

    申请号:EP96922667.0

    申请日:1996-07-08

    Applicant: MOTOROLA, INC.

    CPC classification number: G10L19/06 G10L2019/0013

    Abstract: A method and apparatus is provided for a low bit rate speech transmission. Speech spectral parameter vectors are generated from a voice message and stored in a sequence of speech spectral parameter vectors within a speech spectral parameter matrix. A first index identifying a first speech parameter template corresponding to a first speech spectral parameter vector of the sequence of speech spectral parameter vectors is transmitted. A subsequent speech spectral parameter vector of the sequence is selected and a subsequent speech parameter template is determined having a subsequent index. One or more intervening interpolated speech parameter templates are interpolated between the first speech parameter template and the subsequent speech parameter template. The one or more intervening speech spectral parameter vectors are compared to the corresponding one or more intervening interpolated speech parameter templates to derive a distance. The subsequent index is transmitted when the distance derived is less than or equal to a predetermined distance.

    Abstract translation: 提供了一种用于低比特率语音传输的方法和装置。 语音频谱参数向量从语音消息中产生并存储在语音频谱参数矩阵内的语音频谱参数向量序列中(602)。 识别与语音频谱参数矢量序列的第一语音频谱参数矢量(604)对应的第一语音参数模板(614)的第一索引被发送。 选择该序列的随后的语音频谱参数矢量(608),并确定具有随后的索引的随后的语音参数模板(618)。 在第一语音参数模板(614)和随后的语音参数模板(618)之间内插一个或多个中间内插语音参数模板(620)。 将一个或多个中间语音频谱参数矢量(606)与对应的一个或多个中间内插语音参数模板(620)进行比较以导出距离。 当导出的距离小于或等于预定距离时传送后续索引。

    IMPROVEMENTS TO AN UTTERANCE WAVEFORM CORPUS
    14.
    发明公开
    IMPROVEMENTS TO AN UTTERANCE WAVEFORM CORPUS 有权
    的改进在信号发声形式BODY

    公开(公告)号:EP1668630A1

    公开(公告)日:2006-06-14

    申请号:EP04784432.9

    申请日:2004-09-17

    Applicant: MOTOROLA, INC.

    CPC classification number: G10L13/10 G10L13/06

    Abstract: There is described a method (200) for providing a representation of a waveform for a word. The method (200) includes providing (220) transcriptions representing phrases and corresponding sampled and digitized utterance waveforms of the transcriptions, the transcriptions having marked natural phrase boundaries. The method (200) also provides for clustering (230) parts of the waveforms corresponding to identical words in the transcriptionst to provide groups of waveforms for the identical words with similar prosodic features, the clustering being effected when the identical words are positioned in the transcriptions at locations relative to natural phrase boundaries. Then processing each of the groups of waveforms for the identical words to provide a representative utterance waveform for each other 240.

    METHOD AND APPARATUS FOR MINIMAL REDUNDANCY ERROR DETECTION AND CORRECTION OF VOICE SPECTRUM PARAMETERS
    15.
    发明公开
    METHOD AND APPARATUS FOR MINIMAL REDUNDANCY ERROR DETECTION AND CORRECTION OF VOICE SPECTRUM PARAMETERS 失效
    具有最小冗余方法和设备进行检测和差错更正的语音频谱参数

    公开(公告)号:EP0900482A1

    公开(公告)日:1999-03-10

    申请号:EP96925314.0

    申请日:1996-07-15

    Applicant: MOTOROLA, INC.

    CPC classification number: G10L19/07 G10L19/005

    Abstract: Error detection and correction of a received message, such as a digitized voice message is achieved by generating (318) interpolated vectors for each error vector corresponding to a codebook index in a sequence of codebook indexes representing parameters of portions of the message. A plurality of error corrected candidate vectors for the vector corresponding to the codebook index in error, are generated (322, 324, 326) by flipping one bit in a sequence of bits representing the codebook index in error. The error corrected candidate vector which has a minimal difference from its corresponding interpolated vector is used (338) to replace the error vector. In the case of digital voice, the vectors are spectral vectors which represent spectral information for a time sample of a voice message. An ordering property of vector components is exploited to detect errors in a received codebook index without parity bits.

    METHOD FOR CLASSIFYING SPEECH DATA
    16.
    发明申请

    公开(公告)号:WO2007076279A3

    公开(公告)日:2007-07-05

    申请号:PCT/US2006/062032

    申请日:2006-12-13

    Abstract: A computationally non-intensive method for classifying real-time speech data is useful for improved animations of avatars. The method includes identifying a voiced speech segment of the speech data (step 410). A high-amplitude spectrum is then determined by performing a spectral analysis on a high-amplitude component of the voiced speech segment (step 415). The high-amplitude spectrum is then classified as a vowel phoneme, where the vowel phoneme is selected from a reduced vowel set (step 440).

    IMPROVEMENTS TO AN UTTERANCE WAVEFORM CORPUS
    17.
    发明申请
    IMPROVEMENTS TO AN UTTERANCE WAVEFORM CORPUS 审中-公开
    改进波士顿公司

    公开(公告)号:WO2005034084A1

    公开(公告)日:2005-04-14

    申请号:PCT/US2004/030569

    申请日:2004-09-17

    CPC classification number: G10L13/10 G10L13/06

    Abstract: There is described a method (200) for providing a representation of a waveform for a word. The method (200) includes providing (220) transcriptions representing phrases and corresponding sampled and digitized utterance waveforms of the transcriptions, the transcriptions having marked natural phrase boundaries. The method (200) also provides for clustering (230) parts of the waveforms corresponding to identical words in the transcriptionst to provide groups of waveforms for the identical words with similar prosodic features, the clustering being effected when the identical words are positioned in the transcriptions at locations relative to natural phrase boundaries. Then processing each of the groups of waveforms for the identical words to provide a representative utterance waveform for each other 240.

    Abstract translation: 描述了用于提供词的波形的表示的方法(200)。 方法(200)包括提供(220)代表短语的转录和相应的转录的采样和数字化话音波形,所述转录具有标记的自然短语边界。 方法(200)还提供了对应于转录中的相同单词的波形的聚类(230)部分,以为具有相似韵律特征的相同单词提供波形组,当相同单词位于转录中时聚类 在相对于自然短语界限的位置。 然后处理相同单词的波形组中的每一组以提供彼此240的代表性话音波形。

    METHOD AND SYSTEM FOR COMPRESSING HANDWRITTEN CHARACTER TEMPLATES
    18.
    发明申请
    METHOD AND SYSTEM FOR COMPRESSING HANDWRITTEN CHARACTER TEMPLATES 审中-公开
    用于压缩手写字符模板的方法和系统

    公开(公告)号:WO2005034026A2

    公开(公告)日:2005-04-14

    申请号:PCT/US2004/030554

    申请日:2004-09-17

    CPC classification number: G06K9/6255 G06K9/00852 G06K9/6223

    Abstract: A method and system for compressing handwritten character templates. The system includes a codebook generator module (105) for generating a codebook (125). The codebook (125) includes vectors defining the centers of clusters (115) of uncompressed model character feature vectors (110) provided from model character templates. A template compression module (120) is connected to the codebook generator module (105) for comparing the uncompressed model character feature vectors (110) with the codebook (125) to provide compressed templates of model characters (135). Optionally, a template matching module (140) is connected to the template compression module (120) for providing candidate characters (150) by comparing the distances between uncompressed input character feature vectors (130) and the model character templates.

    Abstract translation: 一种压缩手写字符模板的方法和系统。 该系统包括用于产生码本(125)的码本生成器模块(105)。 码本(125)包括定义从模型字符模板提供的未压缩模型字符特征向量(110)的聚类中心(115)的向量。 模板压缩模块(120)连接到码本生成器模块(105),用于将未压缩的模型字符特征向量(110)与码本(125)进行比较,以提供模型字符(135)的压缩模板。 可选地,模板匹配模块(140)连接到模板压缩模块(120),用于通过比较未压缩输入字符特征向量(130)和模型字符模板之间的距离来提供候选字符(150)。

    VERY LOW BIT RATE VOICE MESSAGING SYSTEM USING ASYMMETRIC VOICE COMPRESSION PROCESSING
    19.
    发明申请
    VERY LOW BIT RATE VOICE MESSAGING SYSTEM USING ASYMMETRIC VOICE COMPRESSION PROCESSING 审中-公开
    使用不对称语音压缩处理的非常低的比特率语音消息系统

    公开(公告)号:WO1997010584A1

    公开(公告)日:1997-03-20

    申请号:PCT/US1996011340

    申请日:1996-06-28

    Applicant: MOTOROLA INC.

    CPC classification number: G10L19/0212 G10L25/27

    Abstract: An apparatus and method for processing a voice message to provide low bit rate speech transmission processes the voice message to generate speech parameters which are arranged into a two dimensional parameter matrix (502) including a sequence of parameter frames. The two dimensional parameter matrix (502) is transformed using a predetermined two dimensional matrix transformation function (414) to obtain a two dimensional transform matrix (506). Distance values representing distances between templates of a set of predetermined templates and the two dimensional transform matrix (506) are then derived. The distance values derived are identified by indexes identifying the templates of the set of predetermined templates. The distance values derived are compared, and an index corresponding to a template of the set of predetermined templates having a shortest distance is selected and then transmitted.

    Abstract translation: 一种用于处理语音消息以提供低比特率语音传输的装置和方法,用于处理语音消息以产生被布置成包括参数帧序列的二维参数矩阵(502)的语音参数。 使用预定的二维矩阵变换函数(414)来变换二维参数矩阵(502),以获得二维变换矩阵(506)。 然后导出表示一组预定模板的模板与二维变换矩阵(506)之间的距离的距离值。 导出的距离值通过标识该组预定模板的模板的索引来识别。 比较导出的距离值,选择与具有最短距离的一组预定模板的模板对应的索引,然后发送。

    LETTER-TO-SOUND CONVERSION FOR SYNTHESIZED PRONUNCIATION OF A TEXT SEGMENT
    20.
    发明授权
    LETTER-TO-SOUND CONVERSION FOR SYNTHESIZED PRONUNCIATION OF A TEXT SEGMENT 有权
    声音字母对于文本段合成实施辩论

    公开(公告)号:EP1668629B1

    公开(公告)日:2009-03-11

    申请号:EP04784356.0

    申请日:2004-09-17

    Applicant: MOTOROLA, INC.

    CPC classification number: G10L13/08

    Abstract: There is described a method (200) for text to speech synthesis, the method (200) includes receiving (220) a text string and selecting at least one word from the string. Then a step of segmenting (240) the word into a sub-words forming a sub-word sequence with at least one of the sub-words comprising at least two letters. The step of identifying (250) provides for identifying phonemes for the sub-words and step (260) effects concatenating the phonemes into a phoneme sequence. A performing speech synthesis (280) on the phoneme sequence is then conducted.

Patent Agency Ranking