Multi-microphone speech recognition systems and related techniques

    公开(公告)号:US10013981B2

    公开(公告)日:2018-07-03

    申请号:US14732711

    申请日:2015-06-06

    Applicant: Apple Inc.

    CPC classification number: G10L15/32 G10L15/20

    Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.

    PSYCHOACOUSTIC ADAPTIVE NOTCH FILTERING

    公开(公告)号:US20130329909A1

    公开(公告)日:2013-12-12

    申请号:US13790666

    申请日:2013-03-08

    Applicant: APPLE INC.

    CPC classification number: H04R3/002 G10L21/0324 H04R3/00 H04R5/04

    Abstract: Improved systems and methods for psychoacoustic adaptive notch filtering are provided. By accounting for psychoacoustic properties of an audio signal as well as finer characteristics of noise which may be present in the audio signal (e.g., the shape of the spectral density of the noise), more effective strategies for dealing with undesirable components of the audio signal may be realized.

    Abstract translation: 提供了用于心理声学自适应陷波滤波的改进的系统和方法。 通过考虑音频信号的心理声学特性以及可能存在于音频信号中的更精细的噪声特性(例如,噪声的频谱密度的形状),用于处理音频信号的不期望的分量的更有效的策略 可以实现。

    Spatial Audio Controller
    15.
    发明申请

    公开(公告)号:US20250080933A1

    公开(公告)日:2025-03-06

    申请号:US18949726

    申请日:2024-11-15

    Applicant: Apple Inc.

    Abstract: A method performed a local device that is communicatively coupled with several remote devices, the method includes: receiving, from each remote device with which the local device is engaged in a communication session, an input audio stream; receiving, for each remote device, a set parameters; determining, for each input audio stream, whether the input audio stream is to be 1) rendered individually or 2) rendered as a mix of input audio streams based on the set of parameters; for each input audio stream that is determined to be rendered individually, spatially rendering the input audio stream as an individual virtual sound source that contains only that input audio stream; and for input audio streams that are determined to be rendered as the mix of input audio streams, spatially rendering the mix of input audio streams as a single virtual sound source that contains the mix of input audio streams.

    Multi-microphone speech recognition systems and related techniques

    公开(公告)号:US10304462B2

    公开(公告)日:2019-05-28

    申请号:US15871836

    申请日:2018-01-15

    Applicant: Apple Inc.

    Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.

Patent Agency Ranking