A VOICE DETECTOR AND A METHOD FOR SUPPRESSING SUB-BANDS IN A VOICE DETECTOR
    4.
    发明公开
    A VOICE DETECTOR AND A METHOD FOR SUPPRESSING SUB-BANDS IN A VOICE DETECTOR 有权
    同意探测器和方法子带在语音探测器还原

    公开(公告)号:EP1982324A4

    公开(公告)日:2012-01-25

    申请号:EP07709334

    申请日:2007-02-09

    Inventor: SEHLSTEDT MARTIN

    Abstract: Embodiments of the present invention relate to a voice detector receiving an input signal that is divided into sub-signals that represent a frequency sub-band. The voice detector calculates, for each sub-band, a signal-to-noise (SNR) value based on a corresponding sub-signal for each sub-band and a background signal for each sub-band. The voice detector also calculates a power SNR value for each sub-band, where at least one of the power SNR values is calculated based on a non-linear function. The voice detector forms a single value based on the calculated power SNR values and compares the single value and a given threshold value to make a voice activity decision presented on an output port.

    ENERGY CONSERVATIVE MULTI-CHANNEL AUDIO CODING
    5.
    发明公开
    ENERGY CONSERVATIVE MULTI-CHANNEL AUDIO CODING 有权
    节能多声道音频CODING

    公开(公告)号:EP2345027A4

    公开(公告)日:2016-10-12

    申请号:EP09819478

    申请日:2009-09-25

    CPC classification number: G10L19/008

    Abstract: The invention relates to the technical field of audio encoding and/or decoding technologies, and thus concerns an overall encoding procedure and associated decoding procedure. The encoding procedure involves at least two signal encoding processes (S1-S3) operating on signal representations of a set of audio input channels, as well as residual encoding (S7-S8). It also involves a dedicated process (S4-S6) to estimate and encode energies of the audio input channels. Each encoding process is associated with a corresponding decoding process. In the overall decoding procedure the decoded signals from each encoding process are preferably combined such that the output channels are close to the input channels in terms of energy and/or quality. Normally, the combination step also adapts to the possible loss of one or more signal representation in part or in whole, such that the energy and quality is optimized with the signals at hand in the decoder. In this way, the overall quality of the output channels is improved.

    A VOICE DETECTOR AND A METHOD FOR SUPPRESSING SUB-BANDS IN A VOICE DETECTOR
    8.
    发明申请
    A VOICE DETECTOR AND A METHOD FOR SUPPRESSING SUB-BANDS IN A VOICE DETECTOR 审中-公开
    一种语音检测器和一种在语音检测器中抑制子带的方法

    公开(公告)号:WO2007091956A2

    公开(公告)日:2007-08-16

    申请号:PCT/SE2007000118

    申请日:2007-02-09

    Inventor: SEHLSTEDT MARTIN

    Abstract: The present invention relates to a voice detector 30; 51; 61 being responsive to an input signal being divided into sub-signals representing a frequency sub-band, comprising: means to calculate 20, for each sub-band, an SNR value snr[n] based on a corresponding sub-signal for each sub-band and a background signal for each sub-band. The voice detector 30; 51; 61 further comprises: means to calculate 31 n , 21 a power SNR value for each sub-band, wherein at least one of said power SNR values is calculated based on a non- linear function, means to form 22 a single value snr_sum based on the calculated power SNR values, and means to compare 23 said single value snr_sum and a given threshold value vad_thr to make a voice activity decision vad_prim presented on an output port. The invention also relates to a voice activity detector, a node and a method for selectively suppressing sub-bands in a voice detector.

    Abstract translation: 本发明涉及一种语音检测器30; 51; 61响应于被分成表示频率子带的子信号的输入信号,包括:对于每个子带,基于每个子信号的相应子信号计算20的SNR值snr [n] 带和每个子带的背景信号。 语音检测器30; 51; 61还包括:用于计算每个子带的功率SNR值的方法,其中基于非线性函数计算所述功率SNR值中的至少一个,用于形成 22基于所计算的功率SNR值的单值snr_sum,以及将所述单个值snr_sum和给定阈值vad_thr进行比较的装置,以在输出端口上呈现语音活动决策vad_prim。 本发明还涉及一种用于选择性地抑制语音检测器中的子带的语音活动检测器,节点和方法。

    Estimation de forme spectrale à partir de coefficients mdct

    公开(公告)号:MA53768A1

    公开(公告)日:2021-11-30

    申请号:MA53768

    申请日:2020-02-20

    Abstract: L'invention concerne un procédé, un décodeur et un code de programme pour commander un procédé de dissimulation pour une trame audio perdue. Une première trame audio et une seconde trame audio d'un signal audio reçu sont décodées pour obtenir des coefficients de transformée en cosinus discrète modifiée (mdct). Des valeurs d'une première forme spectrale basées sur les coefficients mdct décodés à partir de la première trame audio décodée et des valeurs d'une seconde forme spectrale basées sur des coefficients mdct décodés à partir de la seconde trame audio décodée sont déterminées, les formes spectrales comprenant chacune un certain nombre de sous-bandes. Les valeurs des formes spectrales et des énergies de trame de la première trame audio et de la seconde trame audio sont transformées en représentations d'analyses spectrales basées sur des fft. Une condition transitoire est détectée sur la base des représentations des fft. En réponse à la détection de la condition transitoire, le procédé de dissimulation est modifié par réglage sélectif d'une amplitude de spectre d'un spectre de trame de substitution.

    Estimación de ruido de fondo en señales de audio

    公开(公告)号:ES2819032T3

    公开(公告)日:2021-04-14

    申请号:ES18195924

    申请日:2014-12-01

    Inventor: SEHLSTEDT MARTIN

    Abstract: Un método para la estimación de ruido de fondo en un segmento de señal de audio que comprende una pluralidad de subbandas, comprendiendo el método: calcular una posible estimación de nuevo ruido de subbanda y actualizar una estimación de ruido de subbanda actual con la estimación de nuevo ruido de subbanda si el nuevo valor es menor que el valor actual; y cuando el nivel de energía del segmento de señal de audio es menor que un umbral más alto (202:2) que un nivel de energía mínimo a largo plazo It_min, pero no se detecta ninguna pausa (204:1) en el segmento de señal de audio: - determinar (203) si el segmento de señal de audio comprende música; y - reducir (206) la estimación de ruido de subbanda actual si se determina que el segmento de señal de audio (203:2) comprende música y la estimación de ruido de subbanda actual excede un valor mínimo (205:1).

Patent Agency Ranking