METHOD AND SYSTEM OF NEURAL NETWORK DYNAMIC NOISE SUPPRESSION FOR AUDIO PROCESSING

    公开(公告)号:US20250124938A1

    公开(公告)日:2025-04-17

    申请号:US18999380

    申请日:2024-12-23

    Abstract: A method and system of neural network dynamic noise suppression (DNS) is provided for audio processing. The system is a down-scaled DNS model that uses grouping techniques at pointwise convolutional layers to reduce the number of network parameters. According to one technique, audio signal data can be coded into an input vector that that is split into multiple groups, each groups having multiple channels. At a pointwise convolution layer, an output is generated for each group. The outputs can be concatenated to form a single input vector for a next layer of the model. Each group is treated as a channel, such that the reduction in the number of channels reduces the number of parameters used by the neural network. In some examples, the groups are weight sharing groups.

    MICROPHONE CHANNEL SELF-NOISE SILENCING
    4.
    发明公开

    公开(公告)号:US20240223948A1

    公开(公告)日:2024-07-04

    申请号:US18147777

    申请日:2022-12-29

    CPC classification number: H04R3/04 H04R2410/03

    Abstract: A user computing device includes a microphone to generate an audio signal and a self-noise silencer to generate a feature set corresponding to the audio signal, where the input feature identifies, for each of a plurality of frequency components in the audio signal, a respective magnitude value. At least a portion of the feature set is provided as an input to a machine learning model trained to infer frequencies contributing to self-noise generated at the microphone. An attenuation mask is generated, based on an output of the machine learning model, that identifies an attenuation value for at least a subset of the plurality of frequency components. The attenuation mask is applied to at least the subset of the magnitude values of the plurality of frequency components to remove self-noise from the audio signal and generate a denoised version of the audio signal.

    REDUCED LATENCY STREAMING DYNAMIC NOISE SUPPRESSION USING CONVOLUTIONAL NEURAL NETWORKS

    公开(公告)号:US20220084535A1

    公开(公告)日:2022-03-17

    申请号:US17535759

    申请日:2021-11-26

    Abstract: Techniques are provided for dynamic noise suppression. A methodology implementing the techniques according to an embodiment includes generating a magnitude spectrum and a phase spectrum of an input audio signal comprising speech and dynamic noise. The method also includes employing a temporal convolution network (TCN) to generate a separation mask based on the magnitude spectrum. The TCN comprises depth-wise (DW) convolution layers, each DW convolution layer including a state buffer to store a number of previous states of the associated DW convolution layer. The number of stored previous states is based on a dilation factor of the associated DW convolution layer. The method further includes multiplying the separation mask with the magnitude spectrum to separate the speech from the dynamic noise to obtain a denoised magnitude spectrum. The method further includes reconstructing the input audio signal with reduced dynamic noise based on the denoised magnitude spectrum and the phase spectrum.

Patent Agency Ranking