Patent search ap:("SRI International") AND inv:"Harry Bratt" Page 1

1.

发明授权
Method and apparatus for tailoring the output of an intelligent automated assistant to a user 有权

公开(公告)号：US09501743B2

公开(公告)日：2016-11-22

申请号：US14957286

申请日：2015-12-02

Applicant: SRI International

Inventor： Gokhan Tur , Horacio E. Franco , Elizabeth Shriberg , Gregory K. Myers , William S. Mark , Norman D. Winarsky , Andreas Stolcke , Bart Peintner , Michael J. Wolverton , Luciana Ferrer , Martin Graciarena , Neil Yorke-Smith , Harry Bratt

IPC: G06N5/00 , G06F1/00 , G06N5/04 , G06F9/44 , G06N5/02 , G06N7/00

CPC classification number: G06N7/005 , G06F9/453 , G06N5/022 , G06N5/04 , G06N99/005

Abstract: The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the user, making a set of inferences about the user in accordance with the data, and tailoring an output to be delivered to the user in accordance with the set of inferences.

2.

发明授权
Autonomous intelligent radio 有权

公开(公告)号：US11152016B2

公开(公告)日：2021-10-19

申请号：US16407009

申请日：2019-05-08

Applicant: SRI International

Inventor： Aaron D. Lawson , Harry Bratt , Mitchell L. McLaren , Martin Graciarena

IPC: G10L25/84 , H04L27/00 , H04H60/48 , H04H60/70 , G10L15/16 , G06N20/00 , G10L15/22 , G10L15/08 , G10L15/00

Abstract: Embodiments of the disclosed technologies include finding content of interest in an RF spectrum by automatically scanning the RF spectrum; detecting, in a range of frequencies of the RF spectrum that includes one or more undefined channels, a candidate RF segment; where the candidate RF segment includes a frequency-bound time segment of electromagnetic energy; executing a machine learning-based process to determine, for the candidate RF segment, signal characterization data indicative of one or more of: a frequency range, a modulation type, a timestamp; using the signal characterization data to determine whether audio contained in the candidate RF segment corresponds to a search criterion; in response to determining that the candidate RF segment corresponds to the search criterion, outputting, through an electronic device, data indicative of the candidate RF segment; where the data indicative of the candidate RF segment is output in a real-time time interval after the candidate RF segment is detected.

3.

发明授权
Vehicle personal assistant 有权
Title translation: 车辆个人助理

公开(公告)号：US09085303B2

公开(公告)日：2015-07-21

申请号：US13678213

申请日：2012-11-15

Applicant: SRI International

Inventor： Michael J. Wolverton , William S. Mark , Harry Bratt , Douglas A. Bercow

IPC: B60W50/10 , B60W50/08 , G10L15/22 , B60K35/00 , B60W40/08

CPC classification number: B60W50/10 , B60K35/00 , B60K37/06 , B60K2350/10 , B60K2350/1044 , B60K2350/1052 , B60K2350/2013 , B60W50/085 , B60W2040/089 , G10L15/22

Abstract: A vehicle personal assistant to engage a user in a conversational dialog about vehicle-related topics, such as those commonly found in a vehicle owner's manual, includes modules to interpret spoken natural language input, search a vehicle knowledge base and/or other data sources for pertinent information, and respond to the user's input in a conversational fashion. The dialog may be initiated by the user or more proactively by the vehicle personal assistant based on events that may be currently happening in relation to the vehicle. The vehicle personal assistant may use real-time inputs obtained from the vehicle and/or non-verbal inputs from the user to enhance its understanding of the dialog and assist the user in a variety of ways.

Abstract translation: 车辆个人助理将用户引入关于车辆相关主题（例如车辆用户手册中常见的那些）的对话对话中，包括解释口语自然语言输入，搜索车辆知识库和/或其他数据源的模块相关信息，并以对话方式响应用户的输入。该对话可由用户或由车辆个人助理根据当前可能相对于该车辆发生的事件主动发起。车辆个人助理可以使用从车辆获得的实时输入和/或来自用户的非语言输入来增强对对话的理解并以各种方式帮助用户。

4.

发明申请
VEHICLE PERSONAL ASSISTANT 有权

公开(公告)号：US20140136187A1

公开(公告)日：2014-05-15

申请号：US13678209

申请日：2012-11-15

Applicant: SRI INTERNATIONAL

Inventor： Michael J. Wolverton , William S. Mark , Harry Bratt , Douglas A. Bercow

IPC: G06F17/20

CPC classification number: G06F17/30654 , G10L15/22 , G10L2015/226 , G10L2015/227 , G10L2015/228

Abstract: A vehicle personal assistant to engage a user in a conversational dialog about vehicle-related topics, such as those commonly found in a vehicle owner's manual, includes modules to interpret spoken natural language input, search a vehicle knowledge base and/or other data sources for pertinent information, and respond to the user's input in a conversational fashion. The dialog may be initiated by the user or more proactively by the vehicle personal assistant based on events that may be currently happening in relation to the vehicle. The vehicle personal assistant may use real-time inputs obtained from the vehicle and/or non-verbal inputs from the user to enhance its understanding of the dialog and assist the user in a variety of ways.

5.

发明申请
CONTROLLABLE, NATURAL PARALINGUISTICS FOR TEXT TO SPEECH SYNTHESIS 有权

公开(公告)号：US20220406292A1

公开(公告)日：2022-12-22

申请号：US17776145

申请日：2020-12-29

Applicant: SRI International

Inventor： Harry Bratt , Colleen Richey , Maneesh Yadav

IPC: G10L13/10 , G10L13/047 , G06F40/40 , G06F40/169 , G06F40/117 , G10L13/033

Abstract: A speech recognition module receives training data of speech and creates a representation for individual words, non-words, phonemes, and any combination. A set of speech processing detectors analyze the training data of speech from humans communicating. The set of speech processing detectors detect speech parameters that are indicative of paralinguistic effects on top of enunciated words, phonemes, and non-words in the audio stream. One or more machine learning models undergo supervised machine learning on their neural network to train on how to associate one or more mark-up markers with a textual representation, for each individual word, individual non-word, individual phoneme, and any combinations of these, that was enunciated with a particular paralinguistic effect. Each mark-up marker can correspond to its own paralinguistic effect.

6.

发明授权
Real-time class recognition for an audio stream 有权

公开(公告)号：US11024291B2

公开(公告)日：2021-06-01

申请号：US16366751

申请日：2019-03-27

Applicant: SRI International

Inventor： Diego Castan Lavilla , Harry Bratt , Mitchell Leigh McLaren

IPC: G10L17/00 , G10L15/08 , G10L15/04 , G10L17/18 , G10L15/00 , G10L15/16 , G10L17/06

Abstract: In an embodiment, the disclosed technologies include automatically recognizing speech content of an audio stream that may contain multiple different classes of speech content, by receiving, by an audio capture device, an audio stream; outputting, by one or more classifiers, in response to an inputting to the one or more classifiers of digital data that has been extracted from the audio stream, score data; where a score of the score data indicates a likelihood that a particular time segment of the audio stream contains speech of a particular class; where the one or more classifiers use one or more machine-learned models that have been trained to recognize audio of one or more particular classes to determine the score data; using a sliding time window process, selecting particular scores from the score data; using the selected particular scores, determining and outputting one or more decisions as to whether one or more particular time segments of the audio stream contain speech of one or more particular classes; where the one or more decisions are outputted within a real-time time interval of the receipt of the audio stream; where the one or more decisions are used by downstream processing of the audio stream to control any one or more of the following: labeling the audio stream, segmenting the audio stream, diarizing the audio stream.

7.

发明申请
METHOD AND APPARATUS FOR TAILORING THE OUTPUT OF AN INTELLIGENT AUTOMATED ASSISTANT TO A USER 审中-公开
Title translation: 将智能自动化助理输出给用户的方法和装置

公开(公告)号：US20170061316A1

公开(公告)日：2017-03-02

申请号：US15352852

申请日：2016-11-16

Applicant: SRI INTERNATIONAL

Inventor： Gokhan Tur , Horacio E. Franco , Elizabeth Shriberg , Gregory K. Myers , William S. Mark , Norman D. Winarsky , Andreas Stolcke , Bart Peintner , Michael J. Wolverton , Luciana Ferrer , Martin Graciarena , Neil Yorke-Smith , Harry Bratt

IPC: G06N7/00 , G06N99/00

CPC classification number: G06N7/005 , G06F9/453 , G06N5/022 , G06N5/04 , G06N20/00

Abstract: The present invention relates to a method and apparatus for tailoring the output of an intelligent automated assistant. One embodiment of a method for conducting an interaction with a human user includes collecting data about the user using a multimodal set of sensors positioned in a vicinity of the user, making a set of inferences about the user in accordance with the data, and tailoring an output to be delivered to the user in accordance with the set of inferences.

Abstract translation: 本发明涉及一种用于定制智能自动化助理的输出的方法和装置。用于与人类用户进行交互的方法的一个实施例包括使用定位在用户附近的多模式传感器来收集关于用户的数据，根据数据做出关于用户的一组推断，并且定制输出将根据推导集合被传递给用户。

8.

发明申请
Method, System and Apparatus for Understanding and Generating Human Conversational Cues 有权

公开(公告)号：US20220115001A1

公开(公告)日：2022-04-14

申请号：US17418193

申请日：2020-05-07

Applicant: SRI International

Inventor： Harry Bratt , Kristin Precoda , Dimitra Vergyri

IPC: G10L13/10 , G10L15/22 , G10L15/18 , G10L25/63 , G10L13/027

Abstract: A voice-based digital assistant (VDA) uses a conversation intelligence (CI) manager module having a rule-based engine on conversational intelligence to process information from one or more modules to make determinations on both i) understanding the human conversational cues and ii) generating the human conversational cues, including at least understanding and generating a backchannel utterance, in a flow and exchange of human communication in order to at least one of grab or yield a conversational floor between a user and the VDA. The CI manager module uses the rule-based engine to analyze and make a determination on a conversational cue of, at least, prosody in a user's flow of speech to generate the backchannel utterance to signal any of i) an understanding, ii) a correction, iii) a confirmation, and iv) a questioning of verbal communications conveyed by the user in the flow of speech during a time frame when the user still holds the conversational floor.

9.

发明申请
REAL-TIME CLASS RECOGNITION FOR AN AUDIO STREAM 审中-公开

公开(公告)号：US20200160845A1

公开(公告)日：2020-05-21

申请号：US16366751

申请日：2019-03-27

Applicant: SRI International

Inventor： Diego Castan Lavilla , Harry Bratt , Mitchell Leigh McLaren

IPC: G10L15/08 , G10L15/04 , G10L17/06 , G10L15/00 , G10L15/16 , G10L17/18

Abstract: In an embodiment, the disclosed technologies include automatically recognizing speech content of an audio stream that may contain multiple different classes of speech content, by receiving, by an audio capture device, an audio stream; outputting, by one or more classifiers, in response to an inputting to the one or more classifiers of digital data that has been extracted from the audio stream, score data; where a score of the score data indicates a likelihood that a particular time segment of the audio stream contains speech of a particular class; where the one or more classifiers use one or more machine-learned models that have been trained to recognize audio of one or more particular classes to determine the score data; using a sliding time window process, selecting particular scores from the score data; using the selected particular scores, determining and outputting one or more decisions as to whether one or more particular time segments of the audio stream contain speech of one or more particular classes; where the one or more decisions are outputted within a real-time time interval of the receipt of the audio stream; where the one or more decisions are used by downstream processing of the audio stream to control any one or more of the following: labeling the audio stream, segmenting the audio stream, diarizing the audio stream.

10.

发明授权
Method and apparatus for classifying lexical stress 有权

公开(公告)号：US09928832B2

公开(公告)日：2018-03-27

申请号：US14320152

申请日：2014-06-30

Applicant: SRI International

Inventor： Horacio E. Franco , Luciana Ferrer , Harry Bratt , Colleen Richey , Kristin Precoda , Victor Abrash

IPC: G06F17/27 , G10L15/18 , G10L25/48 , G10L25/24

CPC classification number: G10L15/1807 , G10L25/24 , G10L25/48

Abstract: A method for classifying lexical stress in an utterance includes generating a feature vector representing stress characteristics of a syllable occurring in the utterance, wherein the feature vector includes a plurality of features based on prosodic information and spectral information, computing a plurality of scores, wherein each of the plurality of scores is related to a probability of a given class of lexical stress, and classifying the lexical stress of the syllable based on the plurality of scores.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification