-
71.
公开(公告)号:US20240321277A1
公开(公告)日:2024-09-26
申请号:US18677629
申请日:2024-05-29
Applicant: GOOGLE LLC
Inventor: Victor Carbune , Krishna Sapkota , Behshad Behzadi , Julia Proskurnia , Jacopo Sannazzaro Natta , Justin Lu , Magali Boizot-Roche , Marius Sajgalik , Nicolo D'Ercole , Zaheed Sabur , Luv Kothari
CPC classification number: G10L15/26 , G10L15/22 , G10L2015/223
Abstract: Implementations described herein relate to an application and/or automated assistant that can identify arrangement operations to perform for arranging text during speech-to-text operations—without a user having to expressly identify the arrangement operations. In some instances, a user that is dictating a document (e.g., an email, a text message, etc.) can provide a spoken utterance to an application in order to incorporate textual content. However, in some of these instances, certain corresponding arrangements are needed for the textual content in the document. The textual content that is derived from the spoken utterance can be arranged by the application based on an intent, vocalization features, and/or contextual features associated with the spoken utterance and/or a type of the application associated with the document, without the user expressly identifying the corresponding arrangements. In this way, the application can infer content arrangement operations from a spoken utterance that only specifies the textual content.
-
公开(公告)号:US12093609B2
公开(公告)日:2024-09-17
申请号:US18388465
申请日:2023-11-09
Applicant: GOOGLE LLC
Inventor: Srikanth Pandiri , Luv Kothari , Behshad Behzadi , Zaheed Sabur , Domenico Carbotta , Akshay Kannan , Qi Wang , Gokay Baris Gultekin , Angana Ghosh , Xu Liu , Yang Lu , Steve Cheng
IPC: G06F3/048 , G06F3/0481 , G06F3/0484 , G06F3/04886 , G06F3/16 , G06F40/117 , G06F40/143 , G06F40/174 , G06F40/30 , G10L15/22 , G10L15/26
CPC classification number: G06F3/167 , G06F3/0481 , G06F3/0484 , G06F3/04886 , G06F40/117 , G06F40/143 , G06F40/174 , G06F40/30 , G10L15/22 , G10L15/26
Abstract: Implementations set forth herein relate to an automated assistant that can selectively determine whether to incorporate a verbatim interpretation of portions spoken utterances into an entry field and/or incorporate synonymous content into the entry field. For instance, a user can be accessing an interface that provides an entry field (e.g., address field) for receiving user input. In order to provide input for entry field, the user can select the entry field and/or access a GUI keyboard to initialize an automated assistant for assisting with filling the entry field. Should the user provide a spoken utterance, the user can elect to provide a spoken utterance that embodies the intended input (e.g., an actual address) or a reference to the intended input (e.g., a name). In response to the spoken utterance, the automated assistant can fill the entry field with the intended input without necessitating further input from the user.
-
73.
公开(公告)号:US20240256611A1
公开(公告)日:2024-08-01
申请号:US18635960
申请日:2024-04-15
Applicant: GOOGLE LLC
Inventor: Michael Schaer , Alexandru Tudor , Ori Gershony , Fredrik Bergenlid , Behshad Behzadi , Tomislav Grbin
IPC: G06F16/9032 , G06F3/0482 , G06F3/04842 , G06F16/25 , G06F16/9535 , H04L51/046 , H04L51/216
CPC classification number: G06F16/90324 , G06F3/0482 , G06F3/04842 , G06F16/252 , G06F16/9535 , H04L51/046 , H04L51/216
Abstract: Providing at least one contextually relevant suggestion to one or more users of an ongoing message exchange thread between the users. The suggestion is provided for presentation to the user(s) via user interface output device(s) of computing device(s) of the user(s). The suggestion indicates a query that can be submitted to an automated assistant to cause the automated assistant to incorporate, into the message exchange thread, content that is responsive to the query. In some implementations, the suggestion is a selectable suggestion and content that is responsive to the query is incorporated into the message exchange thread in response to user interface input that is directed to the selectable suggestion. In some implementations, the suggestion is determined based on one or more messages that have already been communicated between users of the message exchange thread.
-
公开(公告)号:US11893350B2
公开(公告)日:2024-02-06
申请号:US17902543
申请日:2022-09-02
Applicant: GOOGLE LLC
Inventor: Nathan David Howard , Gabor Simko , Andrei Giurgiu , Behshad Behzadi , Marcin M. Nowak-Przygodzki
IPC: G06F40/284 , G06F16/903 , G06F16/901 , G06N5/02 , G10L15/22 , G10L25/51 , G10L15/08
CPC classification number: G06F40/284 , G06F16/9024 , G06F16/90335 , G06N5/02 , G10L15/08 , G10L15/22 , G10L25/51
Abstract: Methods, systems, and apparatus, including computer programs encoded on a computer storage medium, for detecting a continued conversation are disclosed. In one aspect, a method includes the actions of receiving first audio data of a first utterance. The actions further include obtaining a first transcription of the first utterance. The actions further include receiving second audio data of a second utterance. The actions further include obtaining a second transcription of the second utterance. The actions further include determining whether the second utterance includes a query directed to a query processing system based on analysis of the second transcription and the first transcription or a response to the first query. The actions further include configuring the data routing component to provide the second transcription of the second utterance to the query processing system as a second query or bypass routing the second transcription.
-
公开(公告)号:US11790207B2
公开(公告)日:2023-10-17
申请号:US17982815
申请日:2022-11-08
Applicant: GOOGLE LLC
Inventor: Yariv Adan , Vladimir Vuskovic , Behshad Behzadi
IPC: G06N3/006 , G06Q10/0631 , G10L15/22 , G06Q10/02 , G06F3/16 , G06F16/332 , G10L13/00 , H04M3/493
CPC classification number: G06N3/006 , G06F3/167 , G06F16/3329 , G06Q10/02 , G06Q10/063114 , G10L15/22 , G10L13/00 , G10L2015/223 , H04M3/4936 , H04M2203/355
Abstract: An example method includes receiving, by a computational assistant executing at one or more processors, a representation of an utterance spoken at a computing device; identifying, based on the utterance, a task to be performed by the computational assistant; responsive to determining, by the computational assistant, that complete performance of the task will take more than a threshold amount of time, outputting, for playback by one or more speakers operably connected to the computing device, synthesized voice data that informs a user of the computing device that complete performance of the task will not be immediate; and performing, by the computational assistant, the task.
-
76.
公开(公告)号:US20230125662A1
公开(公告)日:2023-04-27
申请号:US18086263
申请日:2022-12-21
Applicant: GOOGLE LLC
Inventor: Denis Burakov , Sergey Nazarov , Behshad Behzadi , Mario Bertschler , Bohdan Vlasyuk , Daniel Cotting , Michael Golikov , Lucas Mirelmann , Steve Cheng , Zaheed Sabur , Okan Kolak , Yan Zhong , Vinh Quoc Ly
Abstract: Implementations set forth herein allow a user to access a first application in a foreground of a graphical interface, and simultaneously employ an automated assistant to respond to notifications arising from a second application. The user can provide an input, such as a spoken utterance, while viewing the first application in the foreground in order to respond to notifications from the second application without performing certain intervening steps that can arise under certain circumstances. Such intervening steps can include providing a user confirmation, which can be bypassed, and/or time-limited according to a timer, which can be displayed in response to the user providing a responsive input directed at the notification. A period for the timer can be set according to one or more characteristics that are associated with the notification, the user, and/or any other information that can be associated with the user receiving the notification.
-
公开(公告)号:US20230013581A1
公开(公告)日:2023-01-19
申请号:US17944712
申请日:2022-09-14
Applicant: GOOGLE LLC
Inventor: Marcin Nowak-Przygodzki , Jan Lamecki , Behshad Behzadi
Abstract: Techniques are described related to enabling automated assistants to enter into a “conference mode” in which they can “participate” in meetings between multiple human participants and perform various functions described herein. In various implementations, an automated assistant implemented at least in part on conference computing device(s) may be set to a conference mode in which the automated assistant performs speech-to-text processing on multiple distinct spoken utterances, provided by multiple meeting participants, without requiring explicit invocation prior to each utterance. The automated assistant may perform semantic processing on first text generated from the speech-to-text processing of one or more of the spoken utterances, and generate, based on the semantic processing, data that is pertinent to the first text. The data may be output to the participants at conference computing device(s). The automated assistant may later determine that the meeting has concluded, and may be set to a non-conference mode.
-
78.
公开(公告)号:US11545151B2
公开(公告)日:2023-01-03
申请号:US17045273
申请日:2019-06-05
Applicant: Google LLC
Inventor: Denis Burakov , Sergey Nazarov , Behshad Behzadi , Mario Bertschler , Bohdan Vlasyuk , Daniel Cotting , Michael Golikov , Lucas Mirelmann , Steve Cheng , Zaheed Sabur , Okan Kolak , Yan Zhong , Vinh Quoc Ly
Abstract: Implementations set forth herein allow a user to access a first application in a foreground of a graphical interface, and simultaneously employ an automated assistant to respond to notifications arising from a second application. The user can provide an input, such as a spoken utterance, while viewing the first application in the foreground in order to respond to notifications from the second application without performing certain intervening steps that can arise under certain circumstances. Such intervening steps can include providing a user confirmation, which can be bypassed, and/or time-limited according to a timer, which can be displayed in response to the user providing a responsive input directed at the notification. A period for the timer can be set according to one or more characteristics that are associated with the notification, the user, and/or any other information that can be associated with the user receiving the notification.
-
79.
公开(公告)号:US20220092120A1
公开(公告)日:2022-03-24
申请号:US17542042
申请日:2021-12-03
Applicant: GOOGLE LLC
Inventor: Michael Schaer , Alexandru Tudor , Ori Gershony , Fredrik Bergenlid , Behshad Behzadi , Tomislav Grbin
IPC: G06F16/9032 , G06F16/25 , G06F16/9535 , G06F3/0482 , G06F3/0484 , H04L12/58
Abstract: Providing at least one contextually relevant suggestion to one or more users of an ongoing message exchange thread between the users. The suggestion is provided for presentation to the user(s) via user interface output device(s) of computing device(s) of the user(s). The suggestion indicates a query that can be submitted to an automated assistant to cause the automated assistant to incorporate, into the message exchange thread, content that is responsive to the query. In some implementations, the suggestion is a selectable suggestion and content that is responsive to the query is incorporated into the message exchange thread in response to user interface input that is directed to the selectable suggestion. In some implementations, the suggestion is determined based on one or more messages that have already been communicated between users of the message exchange thread.
-
公开(公告)号:US11204927B2
公开(公告)日:2021-12-21
申请号:US15815349
申请日:2017-11-16
Applicant: Google LLC
Inventor: Gökhan Hasan Bakir , Károly Csalogány , Behshad Behzadi
IPC: G06F7/00 , G06F16/2457 , G06F16/43 , G06F16/951
Abstract: Techniques for contextual search on multimedia content are provided. An example method includes extracting entities associated with multimedia content, wherein the entities include values characterizing one or more objects represented in the multimedia content, generating one or more query rewrite candidates based on the extracted entities and one or more terms in a query related to the multimedia content, providing the one or more query rewrite candidates to a search engine, scoring the one or more query rewrite candidates, ranking the scored one or more query rewrite candidates based on their respective scores, rewriting the query related to the multimedia content based on a particular ranked query rewrite candidate and providing for display, responsive to the query related to the multimedia content, a result set from the search engine based on the rewritten query.
-
-
-
-
-
-
-
-
-