REAL-TIME DYNAMIC NOISE REDUCTION USING CONVOLUTIONAL NETWORKS

    公开(公告)号:US20210012767A1

    公开(公告)日:2021-01-14

    申请号:US17033605

    申请日:2020-09-25

    Abstract: A system, method and computer readable medium for dynamic noise reduction in a voice call. The system includes an encoder having a short-time Fourier transform module to determine a magnitude spectrum and a phase spectrum of an input audio signal. The input audio signal includes speech and dynamic noise. A separator is coupled to the encoder. The separator comprises a temporal convolution network (TCN) used to develop a separation mask using the magnitude spectrum as input. The TCN is trained using a frequency SNR function used to calculate loss during training. A mixer is coupled to the separator to multiply the separation mask with the magnitude spectrum to separate the speech from the dynamic noise to obtain a denoise magnitude spectrum. The system also includes a decoder coupled to the mixer and the encoder. The decoder includes an inverse short-time Fourier transform module to reconstruct the input audio signal without the dynamic noise using the denoise magnitude spectrum and the phase spectrum.

    Reliable reverberation estimation for improved automatic speech recognition in multi-device systems

    公开(公告)号:US10529353B2

    公开(公告)日:2020-01-07

    申请号:US15837223

    申请日:2017-12-11

    Abstract: A mechanism is described for facilitating multi-device reverberation estimation according to one embodiment. An apparatus of embodiments, as described herein, includes detection and capture logic to facilitate a microphone of a first voice-enabled device of multiple voice-enabled devices to detect a command from a user. The apparatus further includes calculation logic to facilitate a second voice-enabled device and a third voice-enabled device to calculate speech to reverberation modulation energy ratio (SRMR) values based on the command, where the calculation logic us further to estimate reverberation times (RTs) based on the SRMR values. The apparatus further includes decision and application logic to perform dereverberation based on the estimated RTs of the reverberations.

Patent Agency Ranking