SYSTEM AND METHOD OF DETECTING A USER'S VOICE ACTIVITY USING AN ACCELEROMETER
    1.
    发明申请
    SYSTEM AND METHOD OF DETECTING A USER'S VOICE ACTIVITY USING AN ACCELEROMETER 审中-公开
    使用加速度计检测用户声音活动的系统和方法

    公开(公告)号:WO2014051969A1

    公开(公告)日:2014-04-03

    申请号:PCT/US2013/058551

    申请日:2013-09-06

    Applicant: APPLE INC.

    CPC classification number: H04R3/005 G10L25/78 G10L2021/02165

    Abstract: A method of detecting a user's voice activity in a mobile device is described herein. The method starts with a voice activity detector (VAD) generating a VAD output based on (i) acoustic signals received from microphones included in the mobile device and (ii) data output by an inertial sensor that is included in an earphone portion of the mobile device. The inertial sensor may detect vibration of the user's vocal chords modulated by the user's vocal tract based on vibrations in bones and tissue of the user's head. A noise suppressor may then receive the acoustic signals from the microphones and the VAD output and suppress the noise included in the acoustic signals received from the microphones based on the VAD output. The method may also include steering one or more beamformers based on the VAD output. Other embodiments are also described.

    Abstract translation: 这里描述了在移动设备中检测用户的语音活动的方法。 该方法从基于(i)从移动设备中包括的麦克风接收的声信号产生VAD输出的语音活动检测器(VAD)开始,以及(ii)包括在移动设备的耳机部分中的惯性传感器输出的数据 设备。 惯性传感器可以基于用户头部的骨骼和组织中的振动来检测用户的声带调制的用户的声带的振动。 然后,噪声抑制器可以从麦克风和VAD输出接收声信号,并且基于VAD输出抑制包括在从麦克风接收的声信号中的噪声。 该方法还可以包括基于VAD输出来操纵一个或多个波束形成器。 还描述了其它实施例。

    MEDIA PROCESSING METHOD AND DEVICE
    2.
    发明申请
    MEDIA PROCESSING METHOD AND DEVICE 审中-公开
    媒体处理方法和装置

    公开(公告)号:WO2010017034A1

    公开(公告)日:2010-02-11

    申请号:PCT/US2009/051436

    申请日:2009-07-22

    Abstract: A media processing system and device 100 with improved power usage characteristics, improved audio functionality and improved media security is provided. Embodiments of the media processing system 100 include an audio processing subsystem 301 that operates independently of the host processor 304 for long periods of time, allowing the host processor 304 to enter a low power state. Other aspects of the media processing system 100 provide for enhanced audio effects such as mixing stored audio samples into real-time telephone audio. Still other aspects of the media processing system 100 provide for improved media security due to the isolation of decrypted audio data from the host processor 304.

    Abstract translation: 提供了具有改进的功率使用特性,改进的音频功能和改进的媒体安全性的媒体处理系统和设备100。 媒体处理系统100的实施例包括音频处理子系统301,其独立于主处理器304长时间运行,允许主机处理器304进入低功率状态。 媒体处理系统100的其他方面提供增强的音频效果,例如将存储的音频样本混合成实时电话音频。 媒体处理系统100的其他方面由于从主机处理器304隔离解密的音频数据来提供改善的媒体安全性。

    PROCESSING OF VOICE INPUTS
    3.
    发明申请
    PROCESSING OF VOICE INPUTS 审中-公开
    语音输入的处理

    公开(公告)号:WO2011088038A1

    公开(公告)日:2011-07-21

    申请号:PCT/US2011/020825

    申请日:2011-01-11

    CPC classification number: G10L15/22 G06F3/167 G10L2015/228

    Abstract: This is directed to processing voice inputs received by an electronic device while prompts are provided. In particular, this is directed to providing a sequence of prompts to a user (e.g., voice over prompts) while monitoring for a voice input. When the voice input is received, a characteristic time stamp can be identified for the voice input, and can be compared to periods or windows associated with each of the provided prompts. The electronic device can then determine that the prompt corresponding to a window that includes the characteristic time stamp was the prompt to which the user wished to apply the voice input. The device can process the voice input to extract a user instruction, and apply the instruction to the identified prompt (e.g., and perform an operation associated with the prompt).

    Abstract translation: 这是针对在提供提示时处理由电子设备接收的语音输入。 特别地,这旨在在监视语音输入的同时向用户提供一系列提示(例如,语音提示)。 当接收到语音输入时,可以识别用于语音输入的特征时间戳,并且可以与与所提供的提示中的每一个相关联的周期或窗口进行比较。 电子设备然后可以确定对应于包括特征时间戳的窗口的提示是用户希望应用语音输入的提示。 设备可以处理语音输入以提取用户指令,并将该指令应用于所识别的提示(例如,并执行与提示相关的操作)。

    Hierarchical Spatial Resolution Codec
    4.
    发明申请

    公开(公告)号:WO2022066370A1

    公开(公告)日:2022-03-31

    申请号:PCT/US2021/048354

    申请日:2021-08-31

    Applicant: APPLE INC.

    Abstract: Disclosed is a hierarchical spatial resolution codec that adaptively adjusts the representations of immersive audio content as the target bandwidth for delivering the audio content changes. The audio content may be represented by an adaptive number of content types such as channels/objects, higher-order ambisonics (HOA), and encoded by adaptive spatial coding techniques to support the target bitrate of a transmission channel or user. Adaptive spatial coding techniques may include adaptive channel/object spatial encoding techniques to generate an adaptive number of channels/objects, and adaptive HOA spatial encoding or HOA compression techniques to generate an adaptive order of the HOA. The adaptation may be a function of the target bitrate that is associated with a desired quality, and an analysis that determines the priority of the channels, objects, and HOA. High priority channels/objects may be encoded into a high quality bit-stream while low priority channels/objects may be converted and encoded as HOA.

    ADAPTIVE AUDIO FEEDBACK SYSTEM AND METHOD
    5.
    发明申请
    ADAPTIVE AUDIO FEEDBACK SYSTEM AND METHOD 审中-公开
    自适应音频反馈系统和方法

    公开(公告)号:WO2011087953A1

    公开(公告)日:2011-07-21

    申请号:PCT/US2011/020350

    申请日:2011-01-06

    CPC classification number: G06F3/167 G06F3/0482 G10L13/00 G10L15/00

    Abstract: Various techniques for adaptively varying audio feedback data on an electronic device 10 are provided. In one embodiment, an audio user interface 38 implementing certain aspects of the present disclosure may devolve or evolve the verbosity of audio feedback in response to user interface events based at least partially upon the verbosity level of audio feedback provided during previous occurrences of the user interface event. In another embodiment, an audio user interface 38 may be configured to vary the verbosity of audio feedback associated with a navigable list of items based at least partially upon the speed at which a user navigates the list. In a further embodiment, an audio user interface 38 may be configured to vary audio feedback verbosity based upon the contextual importance of a user interface event. Electronic devices 10 implementing the present techniques provide an improved user experience with regard to audio user interfaces.

    Abstract translation: 提供了用于在电子设备10上自适应地改变音频反馈数据的各种技术。 在一个实施例中,实现本公开的某些方面的音频用户界面38可以至少部分地基于在先前出现的用户界面中提供的音频反馈的详细级别来响应于用户界面事件来排放或演变音频反馈的冗长度 事件。 在另一个实施例中,音频用户接口38可以被配置为至少部分地基于用户浏览列表的速度来改变与可导航的项目列表相关联的音频反馈的冗长度。 在另一个实施例中,音频用户接口38可以被配置为基于用户界面事件的上下文重要性来改变音频反馈冗长度。 实现本技术的电子设备10提供了关于音频用户界面的改进的用户体验。

    TACTILE FEEDBACK IN AN ELECTRONIC DEVICE
    6.
    发明申请
    TACTILE FEEDBACK IN AN ELECTRONIC DEVICE 审中-公开
    电子设备中的触发反馈

    公开(公告)号:WO2009085378A1

    公开(公告)日:2009-07-09

    申请号:PCT/US2008/080938

    申请日:2008-10-23

    Abstract: An electronic device for providing tactile feedback is provided. The electronic device may provide tactile feedback using any suitable approach, including for example vibration, heat, electrical, visual, or any other type of feedback. The electronic device may provide tactile feedback in response to detecting any particular status of the electronic device, receiving any particular input, or detecting any suitable communication received by the electronic device. For example, the electronic device may provide tactile feedback in response to identifying the current network of the device, the status of a particular electronic device component, or any other electronic device status. As another example, the electronic device may provide tactile feedback in response to receiving a particular type of communication, or in response to receiving a communication from a particular contact. As still another example, the electronic device may provide tactile feedback in response to receiving a particular user input, or to detecting a user's finger on a particular portion of the electronic device.

    Abstract translation: 提供了一种用于提供触觉反馈的电子设备。 电子设备可以使用任何合适的方法提供触觉反馈,包括例如振动,热,电,视觉或任何其他类型的反馈。 电子设备可以响应于检测电子设备的任何特定状态,接收任何特定输入或检测由电子设备接收的任何合适的通信来提供触觉反馈。 例如,电子设备可以响应于识别设备的当前网络,特定电子设备组件的状态或任何其他电子设备状态来提供触觉反馈。 作为另一示例,电子设备可以响应于接收到特定类型的通信或响应于从特定联系人接收到通信而提供触觉反馈。 作为另一示例,电子设备可以响应于接收特定用户输入或者在电子设备的特定部分上检测用户的手指来提供触觉反馈。

    USER-SPECIFIC NOISE SUPPRESSION FOR VOICE QUALITY IMPROVEMENTS
    7.
    发明申请
    USER-SPECIFIC NOISE SUPPRESSION FOR VOICE QUALITY IMPROVEMENTS 审中-公开
    用户特定的噪音抑制声音质量改进

    公开(公告)号:WO2011152993A1

    公开(公告)日:2011-12-08

    申请号:PCT/US2011/037014

    申请日:2011-05-18

    CPC classification number: G10L21/0208

    Abstract: Systems, methods, and devices for user-specific noise suppression are provided. For example, when a voice-related feature of an electronic device (10) is in use, the electronic device (10) may receive an audio signal that includes a user voice. Since noise, such as ambient sounds (60), also may be received by the electronic device (10) at this time, the electronic device (10) may suppress such noise in the audio signal. In particular, the electronic device (10) may suppress the noise in the audio signal while substantially preserving the user voice via user-specific noise suppression parameters. These user-specific noise suppression parameters may be based at least in part on a user noise suppression preference or a user voice profile, or a combination thereof.

    Abstract translation: 提供了用于用户特定噪声抑制的系统,方法和设备。 例如,当使用电子设备(10)的语音相关特征时,电子设备(10)可以接收包括用户语音的音频信号。 由于此时电子设备(10)也可以接收诸如环境声音(60)的噪声,所以电子设备(10)可以抑制音频信号中的这种噪声。 特别地,电子设备(10)可以抑制音频信号中的噪声,同时通过用户特定的噪声抑制参数基本上保留用户语音。 这些用户特定噪声抑制参数可以至少部分地基于用户噪声抑制偏好或用户语音简档或其组合。

    SYNCHRONIZATION OF BUFFERED AUDIO DATA WITH LIVE BROADCAST
    8.
    发明申请
    SYNCHRONIZATION OF BUFFERED AUDIO DATA WITH LIVE BROADCAST 审中-公开
    缓冲音频数据与实时广播同步

    公开(公告)号:WO2011019946A1

    公开(公告)日:2011-02-17

    申请号:PCT/US2010/045363

    申请日:2010-08-12

    Abstract: Various techniques relating to the buffering of a live audio broadcast on an electronic device 10 and the subsequently playback the buffered data are provided. In one embodiment, the playback speed of the buffered data may be increased relative to the actual speed at which the data was originally broadcasted 126. If the buffered playback (using the increased playback speed) synchronizes or catches up to the live broadcast, the electronic device may disable buffering and output the live stream instead 128. This decreases processing demands by lowering processing cycles required for buffering (encoding, etc.) and playback of the buffered data (decoding, etc.), thereby reducing power consumption.

    Abstract translation: 提供了与电子设备10上的实时音频广播的缓冲有关的各种技术,并且随后重放缓冲的数据。 在一个实施例中,缓冲数据的重放速度可以相对于最初广播数据的实际速度而增加126.如果缓冲播放(使用增加的播放速度)同步或赶上直播,则电子 设备可以禁用缓冲并输出直播流128.这通过降低缓冲(编码等)和缓冲数据的回放(解码等)所需的处理周期来降低处理需求,从而降低功耗。

    ADAPTIVE AUDIO FEEDBACK SYSTEM AND METHOD
    10.
    发明公开
    ADAPTIVE AUDIO FEEDBACK SYSTEM AND METHOD 审中-公开
    系统和方法自适应音频反馈

    公开(公告)号:EP2524298A1

    公开(公告)日:2012-11-21

    申请号:EP11701893.7

    申请日:2011-01-06

    Applicant: Apple Inc.

    CPC classification number: G06F3/167 G06F3/0482 G10L13/00 G10L15/00

    Abstract: Various techniques for adaptively varying audio feedback data on an electronic device 10 are provided. In one embodiment, an audio user interface 38 implementing certain aspects of the present disclosure may devolve or evolve the verbosity of audio feedback in response to user interface events based at least partially upon the verbosity level of audio feedback provided during previous occurrences of the user interface event. In another embodiment, an audio user interface 38 may be configured to vary the verbosity of audio feedback associated with a navigable list of items based at least partially upon the speed at which a user navigates the list. In a further embodiment, an audio user interface 38 may be configured to vary audio feedback verbosity based upon the contextual importance of a user interface event. Electronic devices 10 implementing the present techniques provide an improved user experience with regard to audio user interfaces.

Patent Agency Ranking