Method and apparatus for high performance low bit-rate coding of unvoiced speech
    12.
    发明授权
    Method and apparatus for high performance low bit-rate coding of unvoiced speech 有权
    无声语音的高性能低比特率编码方法和装置

    公开(公告)号:US06947888B1

    公开(公告)日:2005-09-20

    申请号:US09690915

    申请日:2000-10-17

    Applicant: Pengjun Huang

    Inventor: Pengjun Huang

    CPC classification number: G10L19/12 G10L19/083 G10L19/18 G10L25/93

    Abstract: A low-bit-rate coding technique for unvoiced segments of speech, without loss of quality compared to the conventional Code Excited Linear Prediction (CELP) method operating at a much higher bit rate. A set of gains are derived from a residual signal after whitening the speech signal by a linear prediction filter. These gains are then quantized and applied to a randomly generated sparse excitation. The excitation is filtered, and its spectral characteristics are analyzed and compared to the spectral characteristics of the original residual signal. Based on this analysis, a filter is chosen to shape the spectral characteristics of the excitation to achieve optimal performance.

    Abstract translation: 一种用于无声段语音的低比特率编码技术,与以更高比特率运行的常规码激励线性预测(CELP)方法相比,没有质量损失。 通过线性预测滤波器对语音信号进行白化后的残差信号得到一组增益。 然后将这些增益量化并应用于随机产生的稀疏激发。 对激发进行滤波,并对其光谱特征进行分析,并将其与原始残留信号的光谱特性进行比较。 基于该分析,选择滤波器来形成激发的光谱特征以实现最佳性能。

    METHOD AND APPARATUS FOR IMPROVED DETECTION OF RATE ERRORS IN VARIABLE RATE RECEIVERS
    17.
    发明申请
    METHOD AND APPARATUS FOR IMPROVED DETECTION OF RATE ERRORS IN VARIABLE RATE RECEIVERS 有权
    改进检测可变速率接收机中速率误差的方法和装置

    公开(公告)号:US20100036668A1

    公开(公告)日:2010-02-11

    申请号:US12537906

    申请日:2009-08-07

    CPC classification number: H04L1/08 H04L1/0046 H04L1/201

    Abstract: A system and method for detection of rate determination algorithm errors in variable rate communications system receivers. The disclosed embodiments prevent rate determination algorithm errors from causing audible artifacts such as screeches or beeps. The disclosed system and method detects frames with incorrectly determined data rates and performs frame erasure processing and/or memory state clean up to prevent propagation of distortion across multiple frames. Frames with incorrectly determined data rates are detected by checking illegal rate transitions, reserved bits, validating unused filter type bit combinations and analyzing relationships between fixed code-book gains and linear prediction coefficient gains.

    Abstract translation: 一种用于在可变速率通信系统接收机中检测速率确定算法错误的系统和方法。 所公开的实施例防止速率确定算法错误引起可听见的伪影,例如吱吱声或嘟嘟声。 所公开的系统和方法检测具有错误确定的数据速率的帧,并执行帧擦除处理和/或存储器状态清理,以防止跨多个帧的失真传播。 通过检查非法速率转换,保留位,验证未使用的过滤器类型位组合以及分析固定代码簿增益和线性预测系数增益之间的关系来检测具有不正确确定的数据速率的帧。

    Low-complexity encoding/decoding of quantized MDCT spectrum in scalable speech and audio codecs
    18.
    发明申请
    Low-complexity encoding/decoding of quantized MDCT spectrum in scalable speech and audio codecs 有权
    可扩展语音和音频编解码器中量化MDCT频谱的低复杂度编码/解码

    公开(公告)号:US20090234644A1

    公开(公告)日:2009-09-17

    申请号:US12255604

    申请日:2008-10-21

    CPC classification number: G10L19/24 G10L19/038

    Abstract: A scalable speech and audio codec is provided that implements combinatorial spectrum encoding. A residual signal is obtained from a Code Excited Linear Prediction (CELP)-based encoding layer, where the residual signal is a difference between an original audio signal and a reconstructed version of the original audio signal. The residual signal is transformed at a Discrete Cosine Transform (DCT)-type transform layer to obtain a corresponding transform spectrum having a plurality of spectral lines. The transform spectrum spectral lines are transformed using a combinatorial position coding technique.The combinatorial position coding technique includes generating a lexicographical index for a selected subset of spectral lines, where each lexicographic index represents one of a plurality of possible binary strings representing the positions of the selected subset of spectral lines. The lexicographical index represents non-zero spectral lines in a binary string in fewer bits than the length of the binary string.

    Abstract translation: 提供可实现组合频谱编码的可扩展语音和音频编解码器。 从基于码激励线性预测(CELP)的编码层获得残留信号,其中残留信号是原始音频信号和原始音频信号的重建版本之间的差异。 残差信号在离散余弦变换(DCT)型变换层处变换,以获得具有多个谱线的对应变换频谱。 使用组合位置编码技术对变换频谱谱线进行变换。 组合位置编码技术包括为选定的谱线子集生成词典索引,其中每个词典索引表示表示所选择的谱线子集的位置的多个可能的二进制串中的一个。 字典索引表示二进制串中的非零谱线,比二进制串的长度少。

    Method and apparatus for high performance low bit-rate coding of unvoiced speech

    公开(公告)号:US20050143980A1

    公开(公告)日:2005-06-30

    申请号:US11066356

    申请日:2005-02-24

    Applicant: Pengjun Huang

    Inventor: Pengjun Huang

    CPC classification number: G10L19/12 G10L19/083 G10L19/18 G10L25/93

    Abstract: A low-bit-rate coding technique for unvoiced segments of speech, without loss of quality compared to the conventional Code Excited Linear Prediction (CELP) method operating at a much higher bit rate. A set of gains are derived from a residual signal after whitening the speech signal by a linear prediction filter. These gains are then quantized and applied to a randomly generated sparse excitation. The excitation is filtered, and its spectral characteristics are analyzed and compared to the spectral characteristics of the original residual signal. Based on this analysis, a filter is chosen to shape the spectral characteristics of the excitation to achieve optimal performance.

Patent Agency Ranking