Patent search ap:("APPLE INC.") AND inv:"KIM Page Yoon"

1.

发明申请
FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES 审中-公开

公开(公告)号：WO2018213415A1

公开(公告)日：2018-11-22

申请号：PCT/US2018/032919

申请日：2018-05-16

Applicant: APPLE INC.

Inventor： KIM, Yoon , SRISUWANANUKORN, Charles , CARSON, David A. , GRUBER, Thomas R. , BINDER, Justin G.

IPC: G10L15/30 , G10L15/22 , G06F3/16

Abstract: Systems and processes for operating an intelligent automated assistant to provide extension of digital assistant services are provided. An example method includes, at an electronic device having one or more processors, receiving, from a first user, a first speech input representing a user request. The method further includes obtaining an identity of the first user; and in accordance with the user identity, providing a representation of the user request to at least one of a second electronic device or a third electronic device. The method further includes receiving, based on a determination of whether the second electronic device or the third electronic device, or both, is to provide the response to the first electronic device, the response to the user request from the second electronic device or the third electronic device. The method further includes providing a representation of the response to the first user.

2.

发明申请
DETECTING A TRIGGER OF A DIGITAL ASSISTANT 审中-公开

公开(公告)号：WO2018212953A1

公开(公告)日：2018-11-22

申请号：PCT/US2018/029474

申请日：2018-04-25

Applicant: APPLE INC.

Inventor： KIM, Yoon , BRIDLE, John , ATKINS, Joshua D. , LI, Feipeng , SOUDEN, Mehrez

IPC: G10L15/22 , G10L15/30 , G10L21/0216 , H04R3/00 , G10L25/51 , G10L15/18

Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.

3.

发明申请
ZERO LATENCY DIGITAL ASSISTANT 审中-公开
Title translation: 零年数字助理

公开(公告)号：WO2017044160A1

公开(公告)日：2017-03-16

申请号：PCT/US2016/031550

申请日：2016-05-09

Applicant: APPLE INC. , STASIOR, William, F. , CARSON, David , DASARI, Rohit , KIM, Yoon

Inventor： STASIOR, William, F. , CARSON, David , DASARI, Rohit , KIM, Yoon

IPC: G06F15/16 , G06F17/00 , G06F17/20 , G06F17/30 , G06F19/00 , G10L15/00

CPC classification number: G06F3/167 , G06F3/038 , G06F3/0481 , G06F3/0604 , G06F3/0656 , G06F3/0673 , G10L15/22 , G10L15/285 , G10L15/32 , G10L2015/088 , G10L2015/223 , H04M2201/40 , H04M2250/74

Abstract: An electronic device can implement a zero-latency digital assistant by capturing audio input from a microphone and using a first processor to write audio data representing the captured audio input to a memory buffer. In response to detecting a user input while capturing the audio input, the device can determine whether the user input meets a predetermined criteria. If the user input meets the criteria, the device can use a second processor to identify and execute a task based on at least a portion of the contents of the memory buffer.

Abstract translation: 电子设备可以通过从麦克风捕获音频输入并使用第一处理器将表示所捕获的音频输入的音频数据写入存储器缓冲器来实现零延迟数字助理。响应于在捕获音频输入时检测用户输入，设备可以确定用户输入是否满足预定标准。如果用户输入满足标准，则设备可以使用第二处理器来基于存储器缓冲器的内容的至少一部分来识别和执行任务。

4.

发明申请
DYNAMIC THRESHOLDS FOR ALWAYS LISTENING SPEECH TRIGGER 审中-公开
Title translation: 动态词汇练习

公开(公告)号：WO2016039992A1

公开(公告)日：2016-03-17

申请号：PCT/US2015/047064

申请日：2015-08-27

Applicant: APPLE INC.

Inventor： KIM, Yoon , GRUBER, Thomas, R. , BRIDLE, John

IPC: G10L15/20

CPC classification number: G06F3/167 , G10L15/20 , G10L15/22 , G10L2015/223

Abstract: Systems and processes are disclosed for dynamically adjusting a speech trigger threshold, which can be used in triggering a virtual assistant. Audio input can be received via a microphone. The received audio input can be sampled, and a confidence level can be determined of whether the sampled audio input includes a portion of a spoken trigger. In response to the confidence level exceeding a threshold, a virtual assistant can be triggered to receive a user command from the audio input. The threshold can be dynamically adjusted in response to perceived events (e.g., events indicating a user may be more or less likely to initiate speech interactions, events indicating a trigger may be difficult to detect, events indicating a trigger was missed, etc.), thereby minimizing both missed triggers and false positive triggering events.

Abstract translation: 公开了用于动态调整语音触发阈值的系统和过程，其可以用于触发虚拟助理。可以通过麦克风接收音频输入。所接收的音频输入可以被采样，并且可以确定采样的音频输入是否包括口语触发的一部分的置信水平。响应于置信水平超过阈值，可以触发虚拟助手从音频输入接收用户命令。可以响应于感知事件（例如，指示用户或多或少可能发起语音交互的事件，指示触发可能难以检测的事件，指示触发被错过的事件等）来动态地调整阈值）从而最大限度地减少错过触发和假阳性触发事件。

5.

发明申请
PROVIDING AN AUDITORY-BASED INTERFACE OF A DIGITAL ASSISTANT 审中-公开

公开(公告)号：WO2018212861A1

公开(公告)日：2018-11-22

申请号：PCT/US2018/027363

申请日：2018-04-12

Applicant: APPLE INC.

Inventor： PIERCY, Aimee , IRANI, Cyrus Daniel , GRAHAM, David Chance , COFFMAN, Patrick L. , KIM, Yoon

IPC: G10L15/22 , G10L13/033 , G10L13/027 , G10L15/30

Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors and memory, receiving a natural-language speech input indicative of a request to the digital assistant; obtaining, by the digital assistant, context information; determining, by the digital assistant, a text-to-speech mode from a plurality of text-to-speech modes based on the obtained context information; and providing, by the digital assistant, an audio output with the determined text-to-speech mode, where the audio output is indicative of a speech response to the user request.

6.

发明申请
PROVIDING AN INDICATION OF THE SUITABILITY OF SPEECH RECOGNITION 审中-公开
Title translation: 提供语音识别适用性的指示

公开(公告)号：WO2016053530A1

公开(公告)日：2016-04-07

申请号：PCT/US2015/047553

申请日：2015-08-28

Applicant: APPLE INC.

Inventor： KIM, Yoon

IPC: G10L15/22

CPC classification number: G10L15/01 , G10L15/22 , G10L25/60 , H04R29/008

Abstract: This relates to providing an indication of the suitability of an acoustic environment for performing speech recognition. One process can include receiving an audio input and determining a speech recognition suitability based on the audio input. The speech recognition suitability can include a numerical, textual, graphical, or other representation of the suitability of an acoustic environment for performing speech recognition. The process can further include displaying a visual representation of the speech recognition suitability to indicate the likelihood that a spoken user input will be interpreted correctly. This allows a user to determine whether to proceed with the performance of a speech recognition process, or to move to a different location having a better acoustic environment before performing the speech recognition process. In some examples, the user device can disable operation of a speech recognition process in response to determining that the speech recognition suitability is below a threshold suitability.

Abstract translation: 这涉及提供用于执行语音识别的声学环境的适用性的指示。一个过程可以包括接收音频输入并基于音频输入确定语音识别适合性。语音识别适用性可以包括用于执行语音识别的声学环境的适用性的数字，文本，图形或其他表示。该过程还可以包括显示语音识别适合性的视觉表示，以指示口语用户输入将被正确解释的可能性。这允许用户在执行语音识别过程之前确定是否进行语音识别处理的执行，或者移动到具有更好声学环境的不同位置。在一些示例中，响应于确定语音识别适合性低于阈值适用性，用户设备可以禁用语音识别过程的操作。

7.

发明申请
SPEAKER IDENTIFICATION AND UNSUPERVISED SPEAKER ADAPTATION TECHNIQUES 审中-公开
Title translation: 扬声器识别和不可支持的扬声器适配技术

公开(公告)号：WO2016053523A1

公开(公告)日：2016-04-07

申请号：PCT/US2015/047281

申请日：2015-08-27

Applicant: APPLE INC.

Inventor： KIM, Yoon , KAJAKEKAR, Sachin, S.

IPC: G10L17/04 , G10L17/06 , G10L15/18 , G10L15/26

CPC classification number: G10L17/26 , G10L15/1822 , G10L15/26 , G10L17/04 , G10L17/06

Abstract: Systems and processes for generating a speaker profile for use in performing speaker identification for a virtual assistant are provided. One example process can include receiving an audio input including user speech and determining whether a speaker of the user speech is a predetermined user based on a speaker profile for the predetermined user. In response to determining that the speaker of the user speech is the predetermined user, the user speech can be added to the speaker profile and operation of the virtual assistant can be triggered. In response to determining that the speaker of the user speech is not the predetermined user, the user speech can be added to an alternate speaker profile and operation of the virtual assistant may not be triggered. In some examples, contextual information can be used to verify results produced by the speaker identification process.

Abstract translation: 提供了用于生成用于为虚拟助理执行说话者识别的扬声器简档的系统和过程。一个示例性过程可以包括基于用于预定用户的扬声器简档来接收包括用户语音的音频输入并且确定用户语音的扬声器是否是预定用户。响应于确定用户语音的扬声器是预定用户，可以将用户语音添加到扬声器简档，并且可以触发虚拟助手的操作。响应于确定用户语音的讲话者不是预定用户，可以将用户语音添加到备用讲话者简档，并且虚拟助理的操作可能不被触发。在一些示例中，可以使用上下文信息来验证由说话者识别过程产生的结果。

8.

发明公开
DETECTING A TRIGGER OF A DIGITAL ASSISTANT 审中-公开

公开(公告)号：EP3570277A3

公开(公告)日：2020-01-01

申请号：EP19182046.3

申请日：2018-04-25

Applicant: Apple Inc.

Inventor： KIM, Yoon , BRIDLE, John , ATKINS, Joshua, D. , LI, Feipeng , SOUDEN, Mehrez

IPC: G10L15/22 , G10L15/30 , G10L21/0216 , H04R3/00 , G10L25/51 , G10L15/18 , H04R27/00

Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.

9.

发明公开
DETECTING A TRIGGER OF A DIGITAL ASSISTANT 审中-公开

公开(公告)号：EP3570277A2

公开(公告)日：2019-11-20

申请号：EP19182046.3

申请日：2018-04-25

Applicant: Apple Inc.

Inventor： KIM, Yoon , BRIDLE, John , ATKINS, Joshua, D. , LI, Feipeng , SOUDEN, Mehrez

IPC: G10L15/22 , G10L15/30 , G10L21/0216 , H04R3/00 , G10L25/51 , G10L15/18 , H04R27/00

Abstract: Systems and processes for operating an intelligent automated assistant are provided. In accordance with one example, a method includes, at an electronic device with one or more processors, memory, and a plurality of microphones, sampling, at each of the plurality of microphones of the electronic device, an audio signal to obtain a plurality of audio signals; processing the plurality of audio signals to obtain a plurality of audio streams; and determining, based on the plurality of audio streams, whether any of the plurality of audio signals corresponds to a spoken trigger. The method further includes, in accordance with a determination that the plurality of audio signals corresponds to the spoken trigger, initiating a session of the digital assistant; and in accordance with a determination that the plurality of audio signals does not correspond to the spoken trigger, foregoing initiating a session of the digital assistant.

10.

发明公开
FAR-FIELD EXTENSION FOR DIGITAL ASSISTANT SERVICES 审中-公开

公开(公告)号：EP3459076A1

公开(公告)日：2019-03-27

申请号：EP18732187.2

申请日：2018-05-16

Applicant: Apple Inc.

Inventor： KIM, Yoon , SRISUWANANUKORN, Charles , CARSON, David A. , GRUBER, Thomas R. , BINDER, Justin G.

IPC: G10L15/30 , G10L15/22 , G06F3/16

Abstract: Systems and processes for operating an intelligent automated assistant to provide extension of digital assistant services are provided. An example method includes, at an electronic device having one or more processors, receiving, from a first user, a first speech input representing a user request. The method further includes obtaining an identity of the first user; and in accordance with the user identity, providing a representation of the user request to at least one of a second electronic device or a third electronic device. The method further includes receiving, based on a determination of whether the second electronic device or the third electronic device, or both, is to provide the response to the first electronic device, the response to the user request from the second electronic device or the third electronic device. The method further includes providing a representation of the response to the first user.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification