MEDIA/VOICE BINDING PROTOCOL AND RELATED USER INTERFACES
    1.
    发明申请
    MEDIA/VOICE BINDING PROTOCOL AND RELATED USER INTERFACES 审中-公开
    媒体/语音绑定协议及相关用户界面

    公开(公告)号:WO2012021529A1

    公开(公告)日:2012-02-16

    申请号:PCT/US2011/047101

    申请日:2011-08-09

    Abstract: One or more media items can be bound to a voice call using a binding protocol. The binding protocol allows call participants to more easily transfer media items to other call participants using one or more user interfaces. A call participant can initiate a media transfer by selecting the media and a communication modality for transferring the media. The binding protocol can be active or lazy. In lazy binding, the call participant can select the desired media for transfer before the voice call is established, and subsequently mark the media for binding with the voice call. In active binding, the call participant can select and transfer the desired media item during the voice call, and the media item is automatically bound to the voice call. The media item can be transferred using a user-selected communication modality over an independent data communication channel.

    Abstract translation: 可以使用绑定协议将一个或多个媒体项目绑定到语音呼叫。 绑定协议允许呼叫参与者使用一个或多个用户界面更容易地将媒体项目传送到其他呼叫参与者。 呼叫参与者可以通过选择媒体和用于传送媒体的通信模式来发起媒体传送。 绑定协议可以是活动的或懒惰的。 在延迟绑定中,呼叫参与方可以在语音呼叫建立之前选择所需要的媒体进行传输,随后标记媒体用于与语音呼叫绑定。 在主动绑定中,呼叫参与者可以在语音呼叫期间选择和传送所需的媒体项目,并且媒体项目被自动绑定到语音呼叫。 可以通过独立的数据通信信道使用用户选择的通信模式传送媒体项目。

    SYSTEMS AND METHODS FOR NAME PRONUNCIATION
    2.
    发明申请
    SYSTEMS AND METHODS FOR NAME PRONUNCIATION 审中-公开
    用于名字发音的系统和方法

    公开(公告)号:WO2013130878A2

    公开(公告)日:2013-09-06

    申请号:PCT/US2013/028412

    申请日:2013-02-28

    Applicant: APPLE INC.

    Inventor: NAIK, Devang K.

    CPC classification number: G10L13/08 G10L13/086 G10L15/187

    Abstract: Systems and methods are provided for associating a phonetic pronunciation with a name by receiving the name, mapping the name to a plurality of monosyllabic components that are combinable to construct the phonetic pronunciation of the name, receiving a user input to select one or more of the plurality, and combining the selected one or more of the plurality of monosyllabic components to construct the phonetic pronunciation of the name.

    Abstract translation: 提供了系统和方法,用于通过接收名称来将语音发音与名称相关联,将该名称映射到可组合以构建名称的语音发音的多个单音节组件,接收用户 输入以选择所述多个单词中的一个或多个,并且组合所选择的多个单音节成分中的一个或多个单音节成分以构建该名字的语音发音。

    ROBUST END-POINTING OF SPEECH SIGNALS USING SPEAKER RECOGNITION
    4.
    发明公开
    ROBUST END-POINTING OF SPEECH SIGNALS USING SPEAKER RECOGNITION 审中-公开
    使用扬声器识别进行语音信号的鲁棒性端点定位

    公开(公告)号:EP3158561A1

    公开(公告)日:2017-04-26

    申请号:EP15723406.3

    申请日:2015-05-06

    Applicant: Apple Inc.

    Abstract: Systems and processes for robust end-pointing of speech signals using speaker recognition are provided. In one example process, a stream of audio having a spoken user request can be received. A first likelihood that the stream of audio includes user speech can be determined. A second likelihood that the stream of audio includes user speech spoken by an authorized user can be determined. A start-point or an end-point of the spoken user request can be determined based at least in part on the first likelihood and the second likelihood.

    Abstract translation: 提供了使用说话者识别来稳健地结束语音信号的系统和过程。 在一个示例过程中,可以接收具有口头用户请求的音频流。 可以确定音频流包括用户语音的第一种可能性。 可以确定音频流包括授权用户说出的用户语音的第二种可能性。 可以至少部分地基于第一可能性和第二可能性来确定口头用户请求的起点或终点。

    SYSTEMS AND METHODS FOR NAME PRONUNCIATION
    7.
    发明公开
    SYSTEMS AND METHODS FOR NAME PRONUNCIATION 有权
    系统方法ZUR BESTIMMUNG DER AUSSPRACHE VON NAMEN

    公开(公告)号:EP2815397A2

    公开(公告)日:2014-12-24

    申请号:EP13709284.7

    申请日:2013-02-28

    Applicant: Apple Inc.

    Inventor: NAIK, Devang K.

    CPC classification number: G10L13/08 G10L13/086 G10L15/187

    Abstract: A method comprising: providing a plurality of pronunciation guessers, each of the plurality of pronunciation guessers being associated with a respective phonetic alphabet of a language or a locale; determining a user language or a user locale; associating a first phonetic alphabet with the user language or the user locale; receiving at each pronunciation guesser a representation of a name; guessing, at each pronunciation guesser, a phonetic pronunciation of one or more components of the name; mapping the phonetic pronunciation of the one or more components of the name guessed by each of the plurality of pronunciation guessers to the first phonetic alphabet to generate a list of guessed pronunciations; receiving an audio pronunciation of the name; and selecting a combination of components from the list of guessed pronunciations that, when pronounced, substantially matches the audio pronunciation of the name.

    Abstract translation: 一种方法,包括:提供多个发音猜测器,所述多个发音猜测器中的每一个与语言或语言环境的相应语音字母表相关联; 确定用户语言或用户区域设置; 将第一个语音字母与用户语言或用户区域相关联; 在每个发音猜测器接收一个名字的表示; 在每个发音猜测器上猜测一个或多个组成部分的语音发音; 将由多个发音猜测器中的每一个猜测的名称的一个或多个组件的语音发音映射到第一语音字母表以生成猜测发音的列表; 接收该名称的音频发音; 并且从被猜测的发音的列表中选择组件的组合,当发音时,与该名称的音频发音基本匹配。

Patent Agency Ranking