21.
    发明专利
    未知

    公开(公告)号:AT329347T

    公开(公告)日:2006-06-15

    申请号:AT03745516

    申请日:2003-02-14

    Applicant: MOTOROLA INC

    Abstract: A system or method for modeling a signal, such as a speech signal, in which harmonic frequencies and amplitudes are identified and the harmonic magnitudes are interpolated to obtain spectral magnitudes at a set of fixed frequencies. An inverse transform is applied to the spectral magnitudes to obtain a pseudo auto-correlation sequence, from which linear prediction coefficients are calculated. From the linear prediction coefficients, model harmonic magnitudes are generated by sampling the spectral envelope defined by the linear prediction coefficients. A set of scale factors are then calculated as the ratio of the harmonic magnitudes to the model harmonic magnitudes and interpolated to obtain a second set of scale factors at the set of fixed frequencies. The spectral envelope magnitudes at the set of fixed frequencies are multiplied by the second set of scale factors to obtain new spectral magnitudes and the process is iterated to obtain final linear prediction coefficients. The signal is modeled by the linear prediction coefficients.

    22.
    发明专利
    未知

    公开(公告)号:BRPI0406765A

    公开(公告)日:2005-12-20

    申请号:BRPI0406765

    申请日:2004-01-13

    Applicant: MOTOROLA INC

    Abstract: A method and apparatus for speech reconstruction within a distributed speech recognition system is provided herein. Missing MFCCs are reconstructed and utilized to generate speech. Particularly, partial recovery of the missing MFCCs is achieved by exploiting the dependence of the missing MFCCs on the transmitted pitch period P as well as on the transmitted MFCCs. Harmonic magnitudes are then obtained from the transmitted and reconstructed MFCCs, and the speech is reconstructed utilizing these harmonic magnitudes.

    METHOD FOR MODELING SPEECH HARMONIC MAGNITUDES

    公开(公告)号:AU2003216276A1

    公开(公告)日:2003-10-13

    申请号:AU2003216276

    申请日:2003-02-14

    Applicant: MOTOROLA INC

    Abstract: A system or method for modeling a signal, such as a speech signal, in which harmonic frequencies and amplitudes are identified and the harmonic magnitudes are interpolated to obtain spectral magnitudes at a set of fixed frequencies. An inverse transform is applied to the spectral magnitudes to obtain a pseudo auto-correlation sequence, from which linear prediction coefficients are calculated. From the linear prediction coefficients, model harmonic magnitudes are generated by sampling the spectral envelope defined by the linear prediction coefficients. A set of scale factors are then calculated as the ratio of the harmonic magnitudes to the model harmonic magnitudes and interpolated to obtain a second set of scale factors at the set of fixed frequencies. The spectral envelope magnitudes at the set of fixed frequencies are multiplied by the second set of scale factors to obtain new spectral magnitudes and the process is iterated to obtain final linear prediction coefficients. The signal is modeled by the linear prediction coefficients.

    26.
    发明专利
    未知

    公开(公告)号:BR0016127A

    公开(公告)日:2002-08-06

    申请号:BR0016127

    申请日:2000-11-30

    Applicant: MOTOROLA INC

    Abstract: A method and apparatus for suppressing acoustic background noise in a communication system. An operating signal-to-noise ratio (SNR) level is reliably evaluated from channel energy (293) and background noise energy (294) values by a SNR level estimator (295). A minimum gain factor and a gain slope are adapted (290) depending on the operating SNR level. Using these adapted values and the channel SNR, the channel gain is selected (233). When the channel SNR is below a certain threshold, the channel is completely noise-like and the gain factor selected is minimum so that the channel is maximally attenuated. When the channel SNR is fairly high, the channel gain selected is 0 dB. For intermediate values of channel SNR, the gain factor selected lies between minimum and 0 dB.

    27.
    发明专利
    未知

    公开(公告)号:FR2739481A1

    公开(公告)日:1997-04-04

    申请号:FR9611654

    申请日:1996-09-25

    Applicant: MOTOROLA INC

    Abstract: A signal that includes noise (301) is sampled to provide a plurality of digital information samples (303). A predetermined number of the digital information samples are grouped as a set (305). Noise suppression is performed on the signal using the following steps. One or more digital representations of silence is attached to the set, forming an extended set (401). A Fourier transform is performed on the extended set, yielding a set of frequency domain coefficients (403), at least some of which are scaled (405). An inverse Fourier transform is performed on the set of scaled frequency domain coefficients to provide a set of time domain samples (407), which are partially overlapped in time and added with a previously formed set of time domain samples (409 and 411), which result is provided with the non-overlapping time domain samples as a noise suppressed version of the signal (413).

    METHOD AND APPARATUS FOR SPEECH RECONSTRUCTION IN A DISTRIBUTED SPEECH RECOGNITION SYSTEM
    29.
    发明公开
    METHOD AND APPARATUS FOR SPEECH RECONSTRUCTION IN A DISTRIBUTED SPEECH RECOGNITION SYSTEM 有权
    方法和设备语音重建分布式语音识别系统

    公开(公告)号:EP1395978A4

    公开(公告)日:2005-09-21

    申请号:EP02709089

    申请日:2002-01-18

    Applicant: MOTOROLA INC

    CPC classification number: G10L15/30 G10L19/00 G10L19/093 G10L25/18

    Abstract: A method of reconstructing speech input at a communication device comprises receiving, at the communication device, encoded data that includes encoded spectral data and encoded energy data of the speech input, the encoded spectral data being encoded as a series of mel-frequency cepstral coefficients. The method further comprises decoding, at the communication device, the encoded spectral data and encoded energy data to determine the spectral data and energy data, wherein decoding comprises: performing an inverse discrete cosine transform on the mel-frequency cepstral coefficients at harmonic mel-frequencies corresponding to a pitch period of the speech input to determine log-spectral magnitudes of the speech input at the harmonic mel-frequencies, and exponentiating the log-spectral magnitudes to determine the spectral magnitudes of the speech input. The method also comprises combining the spectral data and energy data to reconstruct the speech input at the communication device. A communication device for use in distributed speech recognition system is also disclosed.

    Método y aparato para estimar la energía de banda alta en un sistema de extensión del ancho de banda para señales de audio

    公开(公告)号:ES2467966T3

    公开(公告)日:2014-06-13

    申请号:ES09707285

    申请日:2009-02-05

    Abstract: Un método de extensión del ancho de banda que comprende: recibir una señal de audio digital de entrada que comprende una señal de banda estrecha en un primer intervalo de frecuencias; determinar un nivel de energía de banda alta estimado en un segundo intervalo de frecuencias, correspondientes a la señal de audio digital de entrada, donde el segundo intervalo de frecuencias es mayor en frecuencia que el primer intervalo de frecuencias y a la energía de banda alta estimada le falta información para ser estimada y utilizada en la extensión del ancho de banda; y modificar el nivel de energía de banda alta estimado sobre la base de las características de la señal de banda estrecha; donde la etapa de modificar el nivel de energía de banda alta estimado comprende la etapa de modificar el nivel de energía de banda alta estimado sobre la base de una ocurrencia de un ataque / sonido oclusivo; donde los niveles de energía de banda alta estimados de una secuencia de Kmax tramas que empieza en una trama en la cual se ha detectado el ataque / sonido oclusivo son modificados; donde las primeras Kmin tramas son ajustadas a un nivel de energía lo más bajo posible Emin; donde la modificación de los niveles de energía de banda alta estimados continúa hasta la trama Kmax-ésima siempre que el nivel de voz de una trama dentro de la secuencia de Kmax tramas excede un umbral; y donde la modificación del nivel de energía de banda alta estimado viene dada por la disminución del nivel de energía de banda alta en una cantidad fija hasta una trama KT en la que el nivel de voz de la trama excede un umbral y es aumentado de nuevo hacia la energía de banda alta estimada.

Patent Agency Ranking