-
公开(公告)号:US11533577B2
公开(公告)日:2022-12-20
申请号:US17326208
申请日:2021-05-20
Applicant: Apple Inc.
Inventor: Hassan Taherian , Jonathan Huang , Carlos M. Avendano
Abstract: A method performed by an electronic device in a room. The method performs an enrollment process in which a spatial profile of a location of an artificial sound source is created and performs an identification process that determines whether a sound event within the room is produced by the artificial sound source by 1) capturing the sound event using a microphone array and 2) determining a likelihood that the sound event occurred at the location of the artificial sound source.
-
公开(公告)号:US20240005903A1
公开(公告)日:2024-01-04
申请号:US18346085
申请日:2023-06-30
Applicant: Apple Inc.
Inventor: Yang Lu , Carlos M. Avendano , Tony S. Verma
IPC: G10K11/178 , H04R1/10
CPC classification number: G10K11/17881 , H04R1/1083 , G10L2021/02166
Abstract: Microphone signals of a primary headphone are processed and either a first transparency mode of operation is activated or a second transparency mode of operation. In another aspect, a processor enters different configurations in response to estimated ambient acoustic noise being lower or higher than a threshold, wherein in a first configuration a transparency audio signal is adapted via target voice and wearer voice processing (TVWVP) of a microphone signal to boost detected speech frequencies in the transparency audio signal, and in a second configuration the TVWVP is controlled to, as the estimated ambient acoustic noise increases, reduce boosting of, or not boost at all, the detected speech frequencies in the transparency audio signal. Other aspects are also described and claimed.
-
公开(公告)号:US11810588B2
公开(公告)日:2023-11-07
申请号:US17589889
申请日:2022-01-31
Applicant: Apple Inc.
Inventor: Carlos M. Avendano , John Woodruff , Jonathan Huang , Mehrez Souden , Andreas Koutrouvelis
IPC: H04R29/00 , G10L21/028 , H04R1/10 , G06N20/00 , G10L21/0232
CPC classification number: G10L21/028 , G06N20/00 , G10L21/0232 , H04R1/1016 , H04R1/1041 , H04R1/1083 , H04R2420/07
Abstract: Implementations of the subject technology provide systems and methods for providing audio source separation for audio input, such as for audio devices having limited power and/or computing resources. The subject technology may allow an audio device to leverage processing and/or power resources of a companion device that is communicatively coupled to the audio device. The companion device may identify a noise condition of the audio device, select a source separation model based on the noise condition, and provide the source separation model to the audio device. In this way, the audio device can provide audio source separation functionality using a relatively small footprint source separation model that is specific to the noise condition in which the audio device is operated.
-
公开(公告)号:US20230164507A1
公开(公告)日:2023-05-25
申请号:US18061753
申请日:2022-12-05
Applicant: Apple Inc.
Inventor: Hassan Taherian , Jonathan Huang , Carlos M. Avendano
CPC classification number: H04S7/302 , H04R3/005 , H04S2400/11
Abstract: A method performed by an electronic device in a room. The method performs an enrollment process in which a spatial profile of a location of an artificial sound source is created and performs an identification process that determines whether a sound event within the room is produced by the artificial sound source by 1) capturing the sound event using a microphone array and 2) determining a likelihood that the sound event occurred at the location of the artificial sound source.
-
公开(公告)号:US12272370B2
公开(公告)日:2025-04-08
申请号:US18376438
申请日:2023-10-03
Applicant: Apple Inc.
Inventor: Carlos M. Avendano , John Woodruff , Jonathan Huang , Mehrez Souden , Andreas Koutrouvelis
IPC: H04R29/00 , G06N20/00 , G10L21/0232 , G10L21/028 , H04R1/10
Abstract: Implementations of the subject technology provide systems and methods for providing audio source separation for audio input, such as for audio devices having limited power and/or computing resources. The subject technology may allow an audio device to leverage processing and/or power resources of a companion device that is communicatively coupled to the audio device. The companion device may identify a noise condition of the audio device, select a source separation model based on the noise condition, and provide the source separation model to the audio device. In this way, the audio device can provide audio source separation functionality using a relatively small footprint source separation model that is specific to the noise condition in which the audio device is operated.
-
公开(公告)号:US11941968B2
公开(公告)日:2024-03-26
申请号:US18103486
申请日:2023-01-30
Applicant: Apple Inc.
Inventor: Hyung-Suk Kim , Daniel C. Klingler , Miquel Espi Marques , Carlos M. Avendano
CPC classification number: G08B21/182 , G01H3/005 , G06N20/00 , G08B7/06 , G10L25/51
Abstract: An electronic device includes a processor, and a memory containing instructions that, when executed by the processor, cause the electronic device to learn a sound emitted by a legacy device and to issue an output when the electronic device subsequently hears the sound. For example, the electronic device can receive a training input and extract a compact representation of a sound in the training input, which the device stores. The device can receive an audio signal corresponding to an observed acoustic scene and extract a representation of the observed acoustic scene from the audio signal. The electronic device can determine whether the sound is present in the observed acoustic scene at least in part from a comparison of the representation of the observed acoustic scene with the representation of the sound. The electronic device emits a selected output responsive to determining that the sound is present in the acoustic scene.
-
公开(公告)号:US20220377483A1
公开(公告)日:2022-11-24
申请号:US17326208
申请日:2021-05-20
Applicant: Apple Inc.
Inventor: Hassan Taherian , Jonathan Huang , Carlos M. Avendano
Abstract: A method performed by an electronic device in a room. The method performs an enrollment process in which a spatial profile of a location of an artificial sound source is created and performs an identification process that determines whether a sound event within the room is produced by the artificial sound source by 1) capturing the sound event using a microphone array and 2) determining a likelihood that the sound event occurred at the location of the artificial sound source.
-
公开(公告)号:US20210020018A1
公开(公告)日:2021-01-21
申请号:US16872168
申请日:2020-05-11
Applicant: Apple Inc.
Inventor: Hyung-Suk Kim , Daniel C. Klingler , Miquel Espi Marques , Carlos M. Avendano
Abstract: An electronic device includes a processor, and a memory containing instructions that, when executed by the processor, cause the electronic device to learn a sound emitted by a legacy device and to issue an output when the electronic device subsequently hears the sound. For example, the electronic device can receive a training input and extract a compact representation of a sound in the training input, which the device stores. The device can receive an audio signal corresponding to an observed acoustic scene and extract a representation of the observed acoustic scene from the audio signal. The electronic device can determine whether the sound is present in the observed acoustic scene at least in part from a comparison of the representation of the observed acoustic scene with the representation of the sound. The electronic device emits a selected output responsive to determining that the sound is present in the acoustic scene.
-
公开(公告)号:US10861210B2
公开(公告)日:2020-12-08
申请号:US16033111
申请日:2018-07-11
Applicant: Apple Inc.
Inventor: Carlos M. Avendano , Sean A. Ramprashad
IPC: G10L21/013 , G06T13/20 , G06T13/40 , G10L21/003
Abstract: Embodiments of the present disclosure can provide systems, methods, and computer-readable medium for providing audio and/or video effects based at least in part on facial features and/or voice feature characteristics of the user. For example, video and/or an audio signal of the user may be recorded by a device. Voice audio features and facial feature characteristics may be extracted from the voice audio signal and the video, respectively. The facial features of the user may be used to modify features of a virtual avatar to emulate the facial feature characteristics of the user. The extracted voice audio features may modified to generate an adjusted audio signal or an audio signal may be composed from the voice audio features. The adjusted/composed audio signal may simulate the voice of the virtual avatar. A preview of the modified video/audio may be provided at the user's device.
-
公开(公告)号:US20200090644A1
公开(公告)日:2020-03-19
申请号:US16564775
申请日:2019-09-09
Applicant: Apple Inc.
Inventor: Daniel C. Klingler , Carlos M. Avendano , Hyung-Suk Kim , Miquel Espi Marques
Abstract: An electronic device has one or more microphones that pick up a sound. At least one feature extractor processes the audio signals from the microphones, that contain the picked up the sound, to determine several features for the sound. The electronic device also includes a classifier that has a machine learning model which is configured to determine a sound classification, such as artificial versus natural for the sound, based upon at least one of the determined features. Other aspects are also described and claimed.
-
-
-
-
-
-
-
-
-