-
公开(公告)号:US10013981B2
公开(公告)日:2018-07-03
申请号:US14732711
申请日:2015-06-06
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Harvey D. Thornburg , Arvindh Krishnaswamy , Aram M. Lindahl
Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.
-
公开(公告)号:US20180033447A1
公开(公告)日:2018-02-01
申请号:US15225707
申请日:2016-08-01
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Esge B. Andersen , Joshua D. Atkins , Sorin V. Dusan , Vasu Iyengar , Tarun Pruthi , Lalin S. Theverapperuma
IPC: G10L21/028 , G10L25/21 , G10L21/0232
Abstract: An audio system has a housing in which are integrated a number of microphones. A programmed processor accesses the microphone signals and produces a number of acoustic pick up beams based groups of microphones, an estimation of voice activity and an estimation of noise characteristics on each beam. Two or more beams including a voice beam that is used to pick up a desired voice and a noise beam that is used to provide information to estimate ambient noise are adaptively selected from among the plurality of beams, based on thresholds for voice separation and thresholds for noise-matching. Other embodiments are also described and claimed.
-
公开(公告)号:US09865265B2
公开(公告)日:2018-01-09
申请号:US14732715
申请日:2015-06-06
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Harvey D. Thornburg , Arvindh Krishnaswamy , Aram M. Lindahl
CPC classification number: G10L15/34 , G10L15/16 , G10L15/20 , G10L2015/022 , G10L2021/02166
Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.
-
公开(公告)号:US20130329909A1
公开(公告)日:2013-12-12
申请号:US13790666
申请日:2013-03-08
Applicant: APPLE INC.
Inventor: Arvindh Krishnaswamy , Sean A. Ramprashad
IPC: H04R3/00
CPC classification number: H04R3/002 , G10L21/0324 , H04R3/00 , H04R5/04
Abstract: Improved systems and methods for psychoacoustic adaptive notch filtering are provided. By accounting for psychoacoustic properties of an audio signal as well as finer characteristics of noise which may be present in the audio signal (e.g., the shape of the spectral density of the noise), more effective strategies for dealing with undesirable components of the audio signal may be realized.
Abstract translation: 提供了用于心理声学自适应陷波滤波的改进的系统和方法。 通过考虑音频信号的心理声学特性以及可能存在于音频信号中的更精细的噪声特性(例如,噪声的频谱密度的形状),用于处理音频信号的不期望的分量的更有效的策略 可以实现。
-
公开(公告)号:US20250080933A1
公开(公告)日:2025-03-06
申请号:US18949726
申请日:2024-11-15
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Peter D. Callaway , Jae Woo Chang , Martin E. Johnson , Daniel K. Boothe , Kostyantyn Komarov , Patrick Miauton , Christopher M. Garrido , Austin W. Shyu , Karthick Santhanam
IPC: H04S3/00 , G06F3/0487 , H04R3/00 , H04S5/00 , H04S7/00
Abstract: A method performed a local device that is communicatively coupled with several remote devices, the method includes: receiving, from each remote device with which the local device is engaged in a communication session, an input audio stream; receiving, for each remote device, a set parameters; determining, for each input audio stream, whether the input audio stream is to be 1) rendered individually or 2) rendered as a mix of input audio streams based on the set of parameters; for each input audio stream that is determined to be rendered individually, spatially rendering the input audio stream as an individual virtual sound source that contains only that input audio stream; and for input audio streams that are determined to be rendered as the mix of input audio streams, spatially rendering the mix of input audio streams as a single virtual sound source that contains the mix of input audio streams.
-
公开(公告)号:US11456006B2
公开(公告)日:2022-09-27
申请号:US17232027
申请日:2021-04-15
Applicant: Apple Inc.
Inventor: Joseph M. Williams , Sean A. Ramprashad , Nathan de Vries , Nicholas Felton
IPC: G10L25/51 , G10L21/0232 , H04R29/00 , G10L25/06 , H04R1/08 , G10L21/0208
Abstract: A method performed by a processor of an audio source device. The method drives an audio output device of the audio source device to output a sound with an audio output signal. The method obtains a microphone signal from a microphone of the audio source device, the microphone signal capturing the outputted sound. The method determines whether the audio output device is a headset or a loudspeaker based on the microphone signal and configures an acoustic dosimetry process based on the determination.
-
公开(公告)号:US20220180889A1
公开(公告)日:2022-06-09
申请号:US17677850
申请日:2022-02-22
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Lance Jabr , Matthew S. Connolly , Robert D. Silfvast , Sean A. Ramprashad , Carlos Avendano , Miquel Espi Marques
IPC: G10L21/0388 , G10L21/0208 , G10L19/008 , G10L21/0272
Abstract: A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.
-
公开(公告)号:US20210035597A1
公开(公告)日:2021-02-04
申请号:US16940792
申请日:2020-07-28
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Lance Jabr , Matthew S. Connolly , Robert D. Silfvast , Sean A. Ramprashad , Carlos Avendano , Miquel Espi Marques
IPC: G10L21/0388 , G10L21/0272 , G10L19/008 , G10L21/0208
Abstract: A first device obtains, from the array, several audio signals and processes the audio signals to produce a speech signal and one or more ambient signals. The first device processes the ambient signals to produce a sound-object sonic descriptor that has metadata describing a sound object within an acoustic environment. The first device transmits, over a communication data link, the speech signal and the descriptor to a second electronic device that is configured to spatially reproduce the sound object using the descriptor mixed with the speech signal, to produce several mixed signals to drive several speakers.
-
公开(公告)号:US10482899B2
公开(公告)日:2019-11-19
申请号:US15225707
申请日:2016-08-01
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Esge B. Andersen , Joshua D. Atkins , Sorin V. Dusan , Vasu Iyengar , Tarun Pruthi , Lalin S. Theverapperuma
IPC: G10L21/028 , G10L25/21 , G10L21/0216
Abstract: An audio system has a housing in which are integrated a number of microphones. A programmed processor accesses the microphone signals and produces a number of acoustic pick up beams based groups of microphones, an estimation of voice activity and an estimation of noise characteristics on each beam. Two or more beams including a voice beam that is used to pick up a desired voice and a noise beam that is used to provide information to estimate ambient noise are adaptively selected from among the plurality of beams, based on thresholds for voice separation and thresholds for noise-matching. Other embodiments are also described and claimed.
-
公开(公告)号:US10304462B2
公开(公告)日:2019-05-28
申请号:US15871836
申请日:2018-01-15
Applicant: Apple Inc.
Inventor: Sean A. Ramprashad , Harvey D. Thornburg , Arvindh Krishnaswamy , Aram M. Lindahl
Abstract: A speech recognition system for resolving impaired utterances can have a speech recognition engine configured to receive a plurality of representations of an utterance and concurrently to determine a plurality of highest-likelihood transcription candidates corresponding to each respective representation of the utterance. The recognition system can also have a selector configured to determine a most-likely accurate transcription from among the transcription candidates. As but one example, the plurality of representations of the utterance can be acquired by a microphone array, and beamforming techniques can generate independent streams of the utterance across various look directions using output from the microphone array.
-
-
-
-
-
-
-
-
-