-
公开(公告)号:US11670289B2
公开(公告)日:2023-06-06
申请号:US17127394
申请日:2020-12-18
Applicant: Apple Inc.
Inventor: Thomas R. Gruber , Harry J. Saddler , Jerome Rene Bellegarda , Bryce H. Nyeggen , Alessandro Sabatelli
CPC classification number: G10L15/1822 , G06F40/205 , G10L15/1815 , G10L15/26 , G10L15/28 , G10L2015/088 , G10L2015/221 , G10L2015/223 , G10L2015/225 , G10L2015/228
Abstract: Systems and processes are disclosed for handling a multi-part voice command for a virtual assistant. Speech input can be received from a user that includes multiple actionable commands within a single utterance. A text string can be generated from the speech input using a speech transcription process. The text string can be parsed into multiple candidate substrings based on domain keywords, imperative verbs, predetermined substring lengths, or the like. For each candidate substring, a probability can be determined indicating whether the candidate substring corresponds to an actionable command. Such probabilities can be determined based on semantic coherence, similarity to user request templates, querying services to determine manageability, or the like. If the probabilities exceed a threshold, the user intent of each substring can be determined, processes associated with the user intents can be executed, and an acknowledgment can be provided to the user.
-
公开(公告)号:US11636869B2
公开(公告)日:2023-04-25
申请号:US17150513
申请日:2021-01-15
Applicant: Apple Inc.
Inventor: Justin Binder , Samuel D. Post , Onur Tackin , Thomas R. Gruber
IPC: G10L15/00 , G10L21/16 , G10L15/22 , G06F3/16 , G10L17/24 , G10L15/26 , G10L15/30 , G10L25/51 , G10L25/84 , G10L15/02
Abstract: A method for operating a voice trigger is provided. In some implementations, the method is performed at an electronic device including one or more processors and memory storing instructions for execution by the one or more processors. The method includes receiving a sound input. The sound input may correspond to a spoken word or phrase, or a portion thereof. The method includes determining whether at least a portion of the sound input corresponds to a predetermined type of sound, such as a human voice. The method includes, upon a determination that at least a portion of the sound input corresponds to the predetermined type, determining whether the sound input includes predetermined content, such as a predetermined trigger word or phrase. The method also includes, upon a determination that the sound input includes the predetermined content, initiating a speech-based service, such as a voice-based digital assistant.
-
公开(公告)号:US11151899B2
公开(公告)日:2021-10-19
申请号:US14213852
申请日:2014-03-14
Applicant: Apple Inc.
Inventor: Donald W. Pitschel , Thomas R. Gruber
Abstract: The method includes receiving, from a user, a first speech input spoken in a first language; inferring a user intent based on at least the first speech input in the first language; based on the inferred user intent, generating one or more alternative expressions of the first speech input in the first language; and providing feedback to the user introducing the alternative expressions as a more preferred input to express the inferred user intent than the first speech input provided by the user.
-
公开(公告)号:US10748546B2
公开(公告)日:2020-08-18
申请号:US16267146
申请日:2019-02-04
Applicant: Apple Inc.
Inventor: Yoon Kim , Charles Srisuwananukorn , David A. Carson , Thomas R. Gruber , Justin G. Binder
Abstract: Systems and processes for operating an intelligent automated assistant to provide extension of digital assistant services are provided. An example method includes, at an electronic device having one or more processors, receiving, from a first user, a first speech input representing a user request. The method further includes obtaining an identity of the first user; and in accordance with the user identity, providing a representation of the user request to at least one of a second electronic device or a third electronic device. The method further includes receiving, based on a determination of whether the second electronic device or the third electronic device, or both, is to provide the response to the first electronic device, the response to the user request from the second electronic device or the third electronic device. The method further includes providing a representation of the response to the first user.
-
公开(公告)号:US10705794B2
公开(公告)日:2020-07-07
申请号:US13913421
申请日:2013-06-08
Applicant: Apple Inc.
Inventor: Thomas R. Gruber , Harry J. Saddler , Lia T. Napolitano , Emily Clark Schubert , Brian Conrad Sumner
Abstract: The method includes automatically, without user input and without regard to whether a digital assistant application has been separately invoked by a user, determining that the electronic device is in a vehicle. In some implementations, determining that the electronic device is in a vehicle comprises detecting that the electronic device is in communication with the vehicle (e.g., via a wired or wireless communication techniques and/or protocols). The method also includes, responsive to the determining, invoking a listening mode of a virtual assistant implemented by the electronic device. In some implementations, the method also includes limiting the ability of a user to view visual output presented by the electronic device, provide typed input to the electronic device, and the like.
-
公开(公告)号:US10679605B2
公开(公告)日:2020-06-09
申请号:US13913423
申请日:2013-06-08
Applicant: Apple Inc.
Inventor: Thomas R. Gruber , Harry J. Saddler , Lia T. Napolitano , Emily Clark Schubert , Brian Conrad Sumner
IPC: G10L13/00
Abstract: Systems and methods for providing hands-free reading of content comprising: identifying a plurality of data items for presentation to a user, the plurality of data items associated with a domain-specific item type and sorted according to a particular order; based on the domain-specific item type, generating a speech-based overview of the plurality of data items; for each of the plurality of data items, generating a respective speech-based, item-specific paraphrase for the data item based on respective content of the data item; and providing, to a user through the speech-enabled dialogue interface, the speech-based overview, followed by the respective speech-based, item-specific paraphrases for at least a subset of the plurality of data items in the particular order.
-
公开(公告)号:US10383568B2
公开(公告)日:2019-08-20
申请号:US14871887
申请日:2015-09-30
Applicant: Apple Inc.
Inventor: Guy L. Tribble , Roy J. Raymann , Wren N. Dougherty , Divya Nag , Deborah M. Lambert , Stephanie Greer , Thomas R. Gruber
Abstract: In some implementations, a computing device can confirm a sleep determination for a user based on secondary indicia of user activity. For example, the computing device can be a user's primary computing device. The primary device can predict and/or determine when the user is sleeping based on the user's use (e.g., primary indicia), or lack of use, of the primary device. After the primary device determines that the user is sleeping, the primary device can confirm that the user is asleep based on secondary indicia of user activity. In some implementations, the secondary indicia can include user activity reported to the primary computing device by other secondary computing devices (e.g., a second user device, a household appliance, etc.). In some implementations, the secondary indicia can include user activity detected by sensors of the primary computing device (e.g., sound, light, movement, etc.).
-
公开(公告)号:US20190213060A1
公开(公告)日:2019-07-11
申请号:US16353784
申请日:2019-03-14
Applicant: Apple Inc.
Inventor: Roy J. Raymann , Wren N. Dougherty , Divya Nag , Deborah M. Lambert , Stephanie M. Greer , Thomas R. Gruber
Abstract: In some implementations, a computing device may detect that a user of the computing device intends to sleep. The computing device may cause a reminder notification to be presented on a display of the computing device that prompts the user to prepare one or more secondary devices for sleep. The computing device may obtain, for each of the one or more secondary devices, a desired state for sleep specified by the user. The computing device may cause, for each of the one or more secondary devices, a current state to change to the desired state for sleep. In some implementations, the user activities may be detected by receiving sensor data from one or more sensor devices of the computing device and identifying the user activities based on the received sensor data. In some implementations, the computing device may automatically change the current state to the desired state for sleep.
-
公开(公告)号:US10169329B2
公开(公告)日:2019-01-01
申请号:US15220276
申请日:2016-07-26
Applicant: Apple Inc.
Inventor: Richard L. Futrell , Thomas R. Gruber
Abstract: Systems and processes for exemplar-based natural language processing are provided. In one example process, a first text phrase can be received. It can be determined whether editing the first text phrase to match a second text phrase requires one or more of inserting, deleting, and substituting a word of the first text phrase. In response to determining that editing the first text phrase to match the second text phrase requires one or more of inserting, deleting, and substituting a word of the first text phrase, one or more of an insertion cost, a deletion cost, and a substitution cost can be determined. A semantic edit distance between the first text phrase and the second text phrase in a semantic space can be determined based on one or more of the insertion cost, the deletion cost, and the substitution cost.
-
公开(公告)号:US09966065B2
公开(公告)日:2018-05-08
申请号:US14724623
申请日:2015-05-28
Applicant: Apple Inc.
Inventor: Thomas R. Gruber , Harry J. Saddler , Jerome Rene Bellegarda , Bryce H. Nyeggen , Alessandro Sabatelli
CPC classification number: G10L15/1822 , G06F17/2705 , G10L15/1815 , G10L15/26 , G10L15/28 , G10L2015/088 , G10L2015/221 , G10L2015/223 , G10L2015/225 , G10L2015/228
Abstract: Systems and processes are disclosed for handling a multi-part voice command for a virtual assistant. Speech input can be received from a user that includes multiple actionable commands within a single utterance. A text string can be generated from the speech input using a speech transcription process. The text string can be parsed into multiple candidate substrings based on domain keywords, imperative verbs, predetermined substring lengths, or the like. For each candidate substring, a probability can be determined indicating whether the candidate substring corresponds to an actionable command. Such probabilities can be determined based on semantic coherence, similarity to user request templates, querying services to determine manageability, or the like. If the probabilities exceed a threshold, the user intent of each substring can be determined, processes associated with the user intents can be executed, and an acknowledgment can be provided to the user.
-
-
-
-
-
-
-
-
-