31.
    发明专利
    未知

    公开(公告)号:DE69712230D1

    公开(公告)日:2002-05-29

    申请号:DE69712230

    申请日:1997-05-08

    Abstract: An audio decoder solution is here provided where a reduction in computing power is required. The proposed method consists of forcing the multiple output channels to only one type of inverse transformation format. A format of long transform length is more suitable for input signals whose spectrum remains stationary or quasi-stationary. This provides a greater frequency resolution, improved coding performance and a reduction of computing power required. Another format of two or more short transform lengths, possessing greater time resolution, is more desirable for rapidly changing signals with time. The computer power required for two or more short transforms should be higher than for only one transformation. The time versus frequency resolution trade-off should be considered when selecting a transform block length. Advantage is taken of human hearing behaviour to reduce the computing power of a processing engine (e.g. DSP) when downmixing from an M-channel input to a P-channel output is required. The encoder provides spectral information concerning the transmitted audio signal frame. This information corresponds to signals which are stationary/quasi-stationary or changing rapidly with time. Some analysis is required to decide which input channels are forced to long or short block conversion prior to frequency-domain downmixing and transformation.

    33.
    发明专利
    未知

    公开(公告)号:DE602004015409D1

    公开(公告)日:2008-09-11

    申请号:DE602004015409

    申请日:2004-09-27

    Abstract: Pitch detection of speech signals finds numerous applications in karaoke, voice recognition and scoring applications. While most of the existing techniques rely on time domain methods, the invention utilises frequency domain methods. There is provided a method and system for determining the pitch of speech from a speech signal; the method including the steps of: producing or obtaining the speech signal; distinguishing the speech signal into voiced, unvoiced or silence sections using speech signal energy levels; applying a Fourier Transform to the speech signal and obtaining speech signal parameters; determining peaks of the Fourier transformed speech signal; tracking the speech signal parameters of the determined peaks to select partials; and, determining the pitch from the selected partials using a two-way mismatch error calculation.

    ENVIRONMENTAL EFFECTS GENERATOR FOR DIGITAL AUDIO SIGNALS

    公开(公告)号:SG142294A1

    公开(公告)日:2008-05-28

    申请号:SG2007175755

    申请日:2007-11-07

    Abstract: ENVIRONMENTAL EFFECTS GENERATOR FOR DIGITAL AUDIO SIGNALS An device and method of generating environmental reverberation effects for digital audio signals is presented. The device includes a reverberation controller. The reverberation controller pre-processes one or more predetermined characteristics of a first audio signal to produce a pre- processed signal and generates a plurality of delayed outputs from the pre- processed signal, each output having a predetermined delay. The reverberation controller also produces a plurality of reflection outputs from the plurality of delayed outputs and combines the plurality of reflection outputs to produce a second audio signal having a desired reverberation response.

    35.
    发明专利
    未知

    公开(公告)号:DE602004004225D1

    公开(公告)日:2007-02-22

    申请号:DE602004004225

    申请日:2004-09-27

    Abstract: A method for determining whether a data frame of a coded speech signal corresponds to voice or to noise, including the steps of determining the cross-correlation of the data of said data frame; determining the periodicity of the cross-correlation; determining the variance of the periodicity; determining said data frame corresponds to noise if the cross-correlation is lower than a predetermined cross-correlation value; and determining the data corresponds to voice if the variance is less than a predetermined variance value.

    Pitch detection of speech signals
    36.
    发明专利

    公开(公告)号:SG120121A1

    公开(公告)日:2006-03-28

    申请号:SG200305743

    申请日:2003-09-26

    Abstract: Pitch detection of speech signals finds numerous applications in karaoke, voice recognition and scoring applications. While most of the existing techniques rely on time domain methods, the invention utilises frequency domain methods. There is provided a method and system for determining the pitch of speech from a speech signal; the method including the steps of: producing or obtaining the speech signal; distinguishing the speech signal into voiced, unvoiced or silence sections using speech signal energy levels; applying a Fourier Transform to the speech signal and obtaining speech signal parameters; determining peaks of the Fourier transformed speech signal; tracking the speech signal parameters of the determined peaks to select partials; and, determining the pitch from the selected partials using a two-way mismatch error calculation.

    A device and process for encoding audio data

    公开(公告)号:SG120118A1

    公开(公告)日:2006-03-28

    申请号:SG200305637

    申请日:2003-09-15

    Abstract: An MPEG-1 layer 3 audio encoder, including a scalefactor generator for determining first scalefactors for encoding a block of audio data if a temporal masking transient is not detected in said block of audio data; and for selecting the maximum of said scalefactors for encoding said block of audio data if a temporal masking transient is detected in said block of audio data to enable greater compression of said audio data. Increases in quantization error due to use of the maximum scalefactor are pre-masked or post-masked by the temporal masking transient. In cases where the last portion of a block includes a temporal masking transient that masks the preceding portions of the block, the maximum scalefactor is only used to encode the block if the resulting increase in quantization error is less than 30% of the quantization error for the block.

    40.
    发明专利
    未知

    公开(公告)号:DE69808146T2

    公开(公告)日:2003-05-15

    申请号:DE69808146

    申请日:1998-01-12

    Abstract: A method and apparatus for coding audio data in a frequency transform digital audio coder employing differential frequency coefficient exponent coding. Differential coding of exponents places constraints on possible values an exponent can take, which can lead to distortion in the decoded and reconstructed audio signal. The method and apparatus herein can overcome this restriction by mapping the input exponent set to a new set of values which satisfy the differential constraint as well as reducing information loss, thereby minimizing overall signal distortion due to coding restrictions.

Patent Agency Ranking