-
公开(公告)号:US20190007780A1
公开(公告)日:2019-01-03
申请号:US15639191
申请日:2017-06-30
Applicant: Apple Inc.
Inventor: Jonathan D. Sheaffer , Joshua D. Atkins , Martin E. Johnson , Stuart J. Wood
IPC: H04S7/00 , G10L19/008 , G06T7/20
Abstract: Image analysis of a video signal is performed to produce first metadata, and audio analysis of a multi-channel sound track associated with the video signal is performed to produce second metadata. A number of time segments of the sound track are processed, wherein each time segment is processed by either (i) spatial filtering of the audio signals or (ii) spatial rendering of the audio signals, not both, wherein for each time segment a decision was made to select between the spatial filtering or the spatial rendering, in accordance with the first and second metadata. A mix of the processed sound track and the video signal is generated. Other embodiments are also described and claimed.
-
公开(公告)号:US12264931B1
公开(公告)日:2025-04-01
申请号:US16783929
申请日:2020-02-06
Applicant: Apple Inc.
Inventor: Katherine S. Shigeoka , Jonathan D. Sheaffer , Andrew P. Bright
Abstract: A method for providing navigation assistance using a head-worn device that has a camera, several microphones, and several speakers. The method captures sound in an environment as a plurality of microphone audio signals, captures, using the camera, a scene of the environment as a digital image, and processes the digital image to detect an object therein. The method selects, in response to the detection of the object, one of several navigation audio rendering modes. The several navigation audio rendering modes include a first mode that activates an acoustic transparency function to cause the speakers to reproduce the sound of the environment, a second mode that sonifies the object and activates the acoustic transparency function, and a third mode that sonifies the object, partially activates the acoustic transparency function, and an activates active noise cancellation function.
-
公开(公告)号:US12149919B2
公开(公告)日:2024-11-19
申请号:US17677808
申请日:2022-02-22
Applicant: Apple Inc.
Inventor: Jonathan D. Sheaffer , Martin E. Johnson , Andrew P. Bright
IPC: H04S7/00 , G10L25/21 , G10L25/60 , H04R1/10 , H04R1/40 , H04R3/00 , H04R5/033 , H04R5/04 , H04W76/14
Abstract: Disclosed are systems and methods for automatically transitioning between communication modes of wearable audio output devices based solely on acoustic analysis. The audio output devices may operate in one of three electroacoustic modes. In the transparency mode, an audio output device may pass through the speech signal of a nearby user. In the peer-to-peer mode, the audio output device may establish a direct low-latency radio frequency (RF) link to another audio output device. In the telephony mode, the audio output device may communicate with another audio output device using networked telephony. The disclosed methods and systems perform acoustic analysis of the near-field speech signal of a local wearer of the audio output device and the far-field speech signal of a remote talker to determine the best mode for the audio output device to use and to seamlessly transition between the modes as the acoustic environment between the wearers changes.
-
公开(公告)号:US20240365040A1
公开(公告)日:2024-10-31
申请号:US18622543
申请日:2024-03-29
Applicant: Apple Inc.
Inventor: Ashok Masilamani , David M. Fischer , John Woodruff , Jonathan D. Sheaffer , Sanket S. Dave , Deepak Iyer , Andrew W. Malta , Erik D. Hornberger
IPC: H04R1/10 , G10K11/00 , G10L21/0208 , G10L21/028 , G10L25/84 , H04S7/00
CPC classification number: H04R1/1016 , G10K11/002 , G10L21/0208 , G10L21/028 , G10L25/84 , H04S7/304 , H04R2460/13 , H04S2400/15
Abstract: A conversation detector processes microphone signals and other sensor signals of a headphone to declare a conversation and configures a filter block to activate a transparency audio signal. It then declares an end to the conversation based on processing one or more of the microphone signals and the other sensor signals, and in response deactivates the transparency audio signal. The conversation detector monitors an idle duration in which an OVAD and a TVAD are both or simultaneously indicating no activity and declares the end to the conversation in response to the idle duration being longer than an idle threshold. Other aspects are also described and claimed.
-
公开(公告)号:US11812194B1
公开(公告)日:2023-11-07
申请号:US16904342
申请日:2020-06-17
Applicant: Apple Inc.
Inventor: James W. Vandyke , Nikolas T. Vitt , Sean A. Ramprashad , Neal D. Evans , Jonathan D. Sheaffer
CPC classification number: H04N7/157 , G02B27/017 , G06F3/013 , H04L12/1822 , H04L63/0428 , H04L65/4038
Abstract: A method performed by a processor of an electronic device. The method presents a computer-generated reality (CGR) setting including a first user and several other users. The method obtains, from a microphone, an audio signal that contains speech of the first user. The method obtains, from a sensor, sensor data that represents a physical characteristic of the first user. The method determines, based on the sensor data, whether to initiate a private conversation between the first user and a second user of the other users, and in accordance with a determination to initiate the private conversation, initiates the private conversation by providing the audio signal to the second user.
-
公开(公告)号:US11726161B1
公开(公告)日:2023-08-15
申请号:US17464559
申请日:2021-09-01
Applicant: Apple Inc.
Inventor: Christopher T. Eubank , Martin E. Johnson , Daniel K. Boothe , Jonathan D. Sheaffer
CPC classification number: G01S3/8083 , G06F16/686 , H04R3/005 , H04S7/301
Abstract: Disclosed are techniques for a multimedia device with audio and video capturing capability to identify an audio device based on acoustic playback signal if the audio device cannot be identified from captured video. The multimedia device may assemble a list of candidate audio devices that are a possible match for the observed audio device from a database of previously recognized audio devices and may transmit commands to the candidate audio devices to play acoustic identification signals. The acoustic identification signals may be audible sound or ultrasonic tone sequences with embedded identification information unique to each audio device. The multimedia device may record and analyze the acoustic identification signals received from any of the candidate audio devices to construct metrics to select the most likely candidate for the observed audio device. The metrics may include time of flight, direction of arrival, received amplitude, direct-to-reverberant ratio (DRR) of the acoustic identification signals.
-
公开(公告)号:US11631393B1
公开(公告)日:2023-04-18
申请号:US17469255
申请日:2021-09-08
Applicant: Apple Inc.
Inventor: Soenke Pelzer , David E. Romblom , Dirk Schroeder , Jonathan D. Sheaffer
IPC: G10K15/08
Abstract: Ray tracing is performed with sound sources and a listener position in a listening environment, to generate impulse responses associated with each of the sound sources. The impulse responses are combined to form a combined impulse response. One or more filters are determined, each corresponding to the sound sources, based on the combined impulse response and the impulse responses. Each filter serves as a correction factor that holds unique acoustic information for the sound source that the filter is associated with. The combined impulse response and the filters can be applied to one or more audio signals that contain the sound sources, resulting in audio having reverberation that is tailored to the various sound sources. Other aspects are described and claimed.
-
公开(公告)号:US20220279305A1
公开(公告)日:2022-09-01
申请号:US17677808
申请日:2022-02-22
Applicant: Apple Inc.
Inventor: Jonathan D. Sheaffer , Martin E. Johnson , Andrew P. Bright
Abstract: Disclosed are systems and methods for automatically transitioning between communication modes of wearable audio output devices based solely on acoustic analysis. The audio output devices may operate in one of three electroacoustic modes. In the transparency mode, an audio output device may pass through the speech signal of a nearby user. In the peer-to-peer mode, the audio output device may establish a direct low-latency radio frequency (RF) link to another audio output device. In the telephony mode, the audio output device may communicate with another audio output device using networked telephony. The disclosed methods and systems perform acoustic analysis of the near-field speech signal of a local wearer of the audio output device and the far-field speech signal of a remote talker to determine the best mode for the audio output device to use and to seamlessly transition between the modes as the acoustic environment between the wearers changes.
-
公开(公告)号:US20220076662A1
公开(公告)日:2022-03-10
申请号:US17525801
申请日:2021-11-12
Applicant: Apple Inc.
Inventor: Robert D. Silfvast , Neal D. Evans , Nikolas T. Vitt , Jonathan D. Sheaffer
Abstract: A method for routing audio content through an electronic device that is to be worn by a user. The method obtains a communication and determines whether the communication is private. In response to determining that the communication is private, the method drives a bone conduction transducer of the electronic device with an audio signal associated with the communication. In response to determining that the communication is not private, however, the method drives a speaker of the electronic device with the audio signal.
-
公开(公告)号:US11252525B2
公开(公告)日:2022-02-15
申请号:US17128910
申请日:2020-12-21
Applicant: Apple Inc.
Inventor: Gaetan R. Lorho , Jonathan D. Sheaffer , Symeon Delikaris Manias , Frank Baumgarte
Abstract: Transfer functions can describe responses of microphones or ears to sounds at different locations on a sphere. The transfer functions can be compressed by determining, based on transfer functions, a) one or more basis transfer functions, and b) spherical harmonics coefficients that describe variations of the transfer functions with respect to spherical coordinates. Other aspects are described and claimed.
-
-
-
-
-
-
-
-
-