Abstract:
A method of detecting a user's voice activity in a mobile device is described herein. The method starts with a voice activity detector (VAD) generating a VAD output based on (i) acoustic signals received from microphones included in the mobile device and (ii) data output by an inertial sensor that is included in an earphone portion of the mobile device. The inertial sensor may detect vibration of the user's vocal chords modulated by the user's vocal tract based on vibrations in bones and tissue of the user's head. A noise suppressor may then receive the acoustic signals from the microphones and the VAD output and suppress the noise included in the acoustic signals received from the microphones based on the VAD output. The method may also include steering one or more beamformers based on the VAD output. Other embodiments are also described.
Abstract:
A media processing system and device 100 with improved power usage characteristics, improved audio functionality and improved media security is provided. Embodiments of the media processing system 100 include an audio processing subsystem 301 that operates independently of the host processor 304 for long periods of time, allowing the host processor 304 to enter a low power state. Other aspects of the media processing system 100 provide for enhanced audio effects such as mixing stored audio samples into real-time telephone audio. Still other aspects of the media processing system 100 provide for improved media security due to the isolation of decrypted audio data from the host processor 304.
Abstract:
This is directed to processing voice inputs received by an electronic device while prompts are provided. In particular, this is directed to providing a sequence of prompts to a user (e.g., voice over prompts) while monitoring for a voice input. When the voice input is received, a characteristic time stamp can be identified for the voice input, and can be compared to periods or windows associated with each of the provided prompts. The electronic device can then determine that the prompt corresponding to a window that includes the characteristic time stamp was the prompt to which the user wished to apply the voice input. The device can process the voice input to extract a user instruction, and apply the instruction to the identified prompt (e.g., and perform an operation associated with the prompt).
Abstract:
Disclosed is a hierarchical spatial resolution codec that adaptively adjusts the representations of immersive audio content as the target bandwidth for delivering the audio content changes. The audio content may be represented by an adaptive number of content types such as channels/objects, higher-order ambisonics (HOA), and encoded by adaptive spatial coding techniques to support the target bitrate of a transmission channel or user. Adaptive spatial coding techniques may include adaptive channel/object spatial encoding techniques to generate an adaptive number of channels/objects, and adaptive HOA spatial encoding or HOA compression techniques to generate an adaptive order of the HOA. The adaptation may be a function of the target bitrate that is associated with a desired quality, and an analysis that determines the priority of the channels, objects, and HOA. High priority channels/objects may be encoded into a high quality bit-stream while low priority channels/objects may be converted and encoded as HOA.
Abstract:
Various techniques for adaptively varying audio feedback data on an electronic device 10 are provided. In one embodiment, an audio user interface 38 implementing certain aspects of the present disclosure may devolve or evolve the verbosity of audio feedback in response to user interface events based at least partially upon the verbosity level of audio feedback provided during previous occurrences of the user interface event. In another embodiment, an audio user interface 38 may be configured to vary the verbosity of audio feedback associated with a navigable list of items based at least partially upon the speed at which a user navigates the list. In a further embodiment, an audio user interface 38 may be configured to vary audio feedback verbosity based upon the contextual importance of a user interface event. Electronic devices 10 implementing the present techniques provide an improved user experience with regard to audio user interfaces.
Abstract:
An electronic device for providing tactile feedback is provided. The electronic device may provide tactile feedback using any suitable approach, including for example vibration, heat, electrical, visual, or any other type of feedback. The electronic device may provide tactile feedback in response to detecting any particular status of the electronic device, receiving any particular input, or detecting any suitable communication received by the electronic device. For example, the electronic device may provide tactile feedback in response to identifying the current network of the device, the status of a particular electronic device component, or any other electronic device status. As another example, the electronic device may provide tactile feedback in response to receiving a particular type of communication, or in response to receiving a communication from a particular contact. As still another example, the electronic device may provide tactile feedback in response to receiving a particular user input, or to detecting a user's finger on a particular portion of the electronic device.
Abstract:
Systems, methods, and devices for user-specific noise suppression are provided. For example, when a voice-related feature of an electronic device (10) is in use, the electronic device (10) may receive an audio signal that includes a user voice. Since noise, such as ambient sounds (60), also may be received by the electronic device (10) at this time, the electronic device (10) may suppress such noise in the audio signal. In particular, the electronic device (10) may suppress the noise in the audio signal while substantially preserving the user voice via user-specific noise suppression parameters. These user-specific noise suppression parameters may be based at least in part on a user noise suppression preference or a user voice profile, or a combination thereof.
Abstract:
Various techniques relating to the buffering of a live audio broadcast on an electronic device 10 and the subsequently playback the buffered data are provided. In one embodiment, the playback speed of the buffered data may be increased relative to the actual speed at which the data was originally broadcasted 126. If the buffered playback (using the increased playback speed) synchronizes or catches up to the live broadcast, the electronic device may disable buffering and output the live stream instead 128. This decreases processing demands by lowering processing cycles required for buffering (encoding, etc.) and playback of the buffered data (decoding, etc.), thereby reducing power consumption.
Abstract:
A media processing system and device 100 with improved power usage characteristics, improved audio functionality and improved media security is provided. Embodiments of the media processing system 100 include an audio processing subsystem 301 that operates independently of the host processor 304 for long periods of time, allowing the host processor 304 to enter a low power state. Other aspects of the media processing system 100 provide for enhanced audio effects such as mixing stored audio samples into realtime telephone audio. Still other aspects of the media processing system 100 provide for improved media security due to the isolation of decrypted audio data from the host processor 304.
Abstract:
Various techniques for adaptively varying audio feedback data on an electronic device 10 are provided. In one embodiment, an audio user interface 38 implementing certain aspects of the present disclosure may devolve or evolve the verbosity of audio feedback in response to user interface events based at least partially upon the verbosity level of audio feedback provided during previous occurrences of the user interface event. In another embodiment, an audio user interface 38 may be configured to vary the verbosity of audio feedback associated with a navigable list of items based at least partially upon the speed at which a user navigates the list. In a further embodiment, an audio user interface 38 may be configured to vary audio feedback verbosity based upon the contextual importance of a user interface event. Electronic devices 10 implementing the present techniques provide an improved user experience with regard to audio user interfaces.