Recovering method of target speech based on split spectra using sound sources' locational information
    1.
    发明申请
    Recovering method of target speech based on split spectra using sound sources' locational information 失效
    基于使用声源位置信息的分割谱的目标语音的恢复方法

    公开(公告)号:US20040040621A1

    公开(公告)日:2004-03-04

    申请号:US10435135

    申请日:2003-05-09

    CPC classification number: G10L21/0208 G10L2021/02165

    Abstract: The present invention relates to a method for recovering target speech from mixed signals, which include the target speech and noise observed in a real-world environment, based on split spectra using sound sources' locational information. This method includes: the first step of receiving target speech from a target speech source and noise from a noise source and forming mixed signals of the target speech and the noise at a first microphone and at a second microphone; the second step of performing the Fourier transform of the mixed signals from a time domain to a frequency domain, decomposing the mixed signals into two separated signals UA and UB by use of the Independent Component Analysis, and, based on transmission path characteristics of the four different paths from the target speech source and the noise source to the first and second microphones, generating from the separated signal UA a pair of split spectra vA1 and vA2, which were received at the first and second microphones respectively, and from the separated signal UB another pair of split spectra vB1 and vB2, which were received at the first and second microphones respectively; and the third step of extracting a recovered spectrum of the target speech, wherein the split spectra are analyzed by applying criteria based on sound transmission characteristics that depend on the four different distances between the first and second microphones and the target speech and noise sources, and performing the inverse Fourier transform of the recovered spectrum from the frequency domain to the time domain to recover the target speech.

    Abstract translation: 本发明涉及一种基于使用声源定位信息的分割频谱从混合信号中恢复目标语音的方法,其包括在现实环境中观察到的目标语音和噪声。 该方法包括:从目标语音源接收目标语音和来自噪声源的噪声的第一步骤,并在第一麦克风和第二麦克风处形成目标语音和噪声的混合信号; 执行从时域到频域的混合信号的傅里叶变换的第二步骤,通过使用独立分量分析将混合信号分解为两个分离的信号UA和UB,并且基于四个传输路径特性 从目标语音源和噪声源到第一和第二麦克风的不同路径,从分离的信号UA产生分别在第一和第二麦克风接收的一对分离频谱vA1和vA2,以及从分离的信号UB 分别在第一和第二麦克风接收的另一对分裂光谱vB1和vB2; 以及提取目标语音的恢复频谱的第三步骤,其中通过应用基于取决于第一和第二麦克风与目标语音和噪声源之间的四个不同距离的声音传输特性的标准来分析分离频谱,以及 从频域到时域执行恢复频谱的傅里叶逆变换,以恢复目标语音。

Patent Agency Ranking