-
公开(公告)号:US11217228B2
公开(公告)日:2022-01-04
申请号:US16085262
申请日:2017-03-22
Applicant: SRI International
Inventor: Vikramjit Mitra , Horacio E. Franco , Chris D. Bartels , Dimitra Vergyri , Julien van Hout , Martin Graciarena
Abstract: Systems and methods for speech recognition are provided. In some aspects, the method comprises receiving, using an input, an audio signal. The method further comprises splitting the audio signal into auditory test segments. The method further comprises extracting, from each of the auditory test segments, a set of acoustic features. The method further comprises applying the set of acoustic features to a deep neural network to produce a hypothesis for the corresponding auditory test segment. The method further comprises selectively performing one or more of: indirect adaptation of the deep neural network and direct adaptation of the deep neural network.
-
公开(公告)号:US10478111B2
公开(公告)日:2019-11-19
申请号:US15505577
申请日:2015-08-05
Applicant: SRI International
Inventor: Bruce Knoth , Dimitra Vergyri , Elizabeth Shriberg , Vikramjit Mitra , Mitchell McLaren , Andreas Kathol , Colleen Richey , Martin Graciarena
Abstract: A computer-implemented method can include a speech collection module collecting a speech pattern from a patient, a speech feature computation module computing at least one speech feature from the collected speech pattern, a mental health determination module determining a state-of-mind of the patient based at least in part on the at least one computed speech feature, and an output module providing an indication of a diagnosis with regard to a possibility that the patient is suffering from a certain condition such as depression or Post-Traumatic Stress Disorder (PTSD).
-
公开(公告)号:US20200311122A1
公开(公告)日:2020-10-01
申请号:US16365439
申请日:2019-03-26
Applicant: SRI International
Inventor: Bhaskar Ramamurthy , Rajan Singh , Dimitra Vergyri , Jagjit Singh Srawan , Rolf Joseph Rando
IPC: G06F16/483 , G06F16/438 , G06F16/332 , G06N20/20 , G06N3/08 , G06F16/432
Abstract: In general, the disclosure describes techniques for personalizing a meeting summary according to the relevance of different meeting items within a meeting to different users. In some examples, a computing system for automatically providing personalized summaries of meetings comprises a memory configured to store information describing a meeting; and processing circuitry configured to receive a plurality of meeting item summaries of respective meeting items included in the transcript of the meeting; determine, by applying a model of meeting item relevance to the meeting item summaries, a corresponding relevance to a user of each of the meeting item summaries; and output respective indications of relevance to the user for one or more of the meeting item summaries to provide a personalized summary of the meeting to the user.
-
公开(公告)号:US20180314689A1
公开(公告)日:2018-11-01
申请号:US16014593
申请日:2018-06-21
Applicant: SRI International
Inventor: Wen Wang , Dimitra Vergyri , Girish Acharya
CPC classification number: G06F17/289 , G06F16/90332 , G06F17/2785 , G10L15/07 , G10L15/1815 , G10L15/1822 , G10L15/22 , G10L25/63 , G10L2015/228
Abstract: Provided are systems, computer-implemented methods, and computer-program products for a multi-lingual device, capable of receiving verbal input in multiple languages, and further capable of providing conversational responses in multiple languages. In various implementations, the multi-lingual device includes an automatic speech recognition engine capable of receiving verbal input in a first natural language and providing a textual representation of the input and a confidence value for the recognition. The multi-lingual device can also include a machine translation engine, capable of translating textual input from the first natural language into a second natural language. The machine translation engine can output a confidence value for the translation. The multi-lingual device can further include natural language processing, capable of translating from the second natural language to a computer-based language. Input in the computer-based language can be processed, and the multi-lingual device can take an action based on the result of the processing.
-
公开(公告)号:US20250124352A1
公开(公告)日:2025-04-17
申请号:US18916442
申请日:2024-10-15
Applicant: SRI International
Inventor: Anirudh Som , Karan Sikka , Ajay Divakaran , Helen Gent , Andreas Kathol , Dimitra Vergyri
IPC: G06N20/00
Abstract: Techniques are described for a machine learning system configured to generate respective sample embeddings for a plurality of sample statements. The machine learning system may further be configured to generate a statement embedding for a statement. The machine learning system may further be configured to determine, based on the sample embedding and the statement embedding, respective similarity scores for the sample embeddings. The machine learning system may further be configured to select, based on the respective similarity scores for the sample embeddings, one or more sample statements from the plurality of sample statements. The machine learning system may further be configured to generate a prompt including the one or more sample statements, the statement, and at least one of respective ground-truth information or respective paraphrases for the selected one or more sample statements. The machine learning system may further be configured to provide the prompt to a machine learning model.
-
公开(公告)号:US20190332680A1
公开(公告)日:2019-10-31
申请号:US16509428
申请日:2019-07-11
Applicant: SRI International
Inventor: Wen Wang , Dimitra Vergyri , Girish Acharya
Abstract: Provided are systems, computer-implemented methods, and computer-program products for a multi-lingual device, capable of receiving verbal input in multiple languages, and further capable of providing conversational responses in multiple languages. In various implementations, the multi-lingual device includes an automatic speech recognition engine capable of receiving verbal input in a first natural language and providing a textual representation of the input and a confidence value for the recognition. The multi-lingual device can also include a machine translation engine, capable of translating textual input from the first natural language into a second natural language. The machine translation engine can output a confidence value for the translation. The multi-lingual device can further include natural language processing, capable of translating from the second natural language to a computer-based language. Input in the computer-based language can be processed, and the multi-lingual device can take an action based on the result of the processing.
-
公开(公告)号:US20190108841A1
公开(公告)日:2019-04-11
申请号:US16088737
申请日:2017-06-03
Applicant: SRI International
Inventor: Dimitra Vergyri , Diego Castan Lavilla , Girish Acharya , David Sahner , Elizabeth Shriberg , Joseph B Rogers , Bruce H Knoth
Abstract: An electronic device for providing health information or assistance includes an input configured to receive at least one type of signal selected from sound signals, verbal signals, non-verbal signals, and combinations thereof, a communication module configured to send information related to the at least one user and his/her environment to a remote device, including the sound signals, non-verbal signals, and verbal signals, the remote device being configured to analyze a condition of the at least one user and communicate condition signals to the electronic device, a processing module configured to receive the condition signals and to cause the electronic device to engage in a passive monitoring mode or an active engagement and monitoring mode, the active engagement and monitoring mode including, but not limited to, verbal communication with the at least one user, and an output configured to engage the at least one user in verbal communication.
-
公开(公告)号:US20220115001A1
公开(公告)日:2022-04-14
申请号:US17418193
申请日:2020-05-07
Applicant: SRI International
Inventor: Harry Bratt , Kristin Precoda , Dimitra Vergyri
IPC: G10L13/10 , G10L15/22 , G10L15/18 , G10L25/63 , G10L13/027
Abstract: A voice-based digital assistant (VDA) uses a conversation intelligence (CI) manager module having a rule-based engine on conversational intelligence to process information from one or more modules to make determinations on both i) understanding the human conversational cues and ii) generating the human conversational cues, including at least understanding and generating a backchannel utterance, in a flow and exchange of human communication in order to at least one of grab or yield a conversational floor between a user and the VDA. The CI manager module uses the rule-based engine to analyze and make a determination on a conversational cue of, at least, prosody in a user's flow of speech to generate the backchannel utterance to signal any of i) an understanding, ii) a correction, iii) a confirmation, and iv) a questioning of verbal communications conveyed by the user in the flow of speech during a time frame when the user still holds the conversational floor.
-
公开(公告)号:US10915570B2
公开(公告)日:2021-02-09
申请号:US16365439
申请日:2019-03-26
Applicant: SRI International
Inventor: Bhaskar Ramamurthy , Rajan Singh , Dimitra Vergyri , Jagjit Singh Srawan , Rolf Joseph Rando
IPC: H04L29/06 , G06F16/483 , G06F16/438 , G06F16/432 , G06N20/20 , G06N3/08 , G06F16/332
Abstract: In general, the disclosure describes techniques for personalizing a meeting summary according to the relevance of different meeting items within a meeting to different users. In some examples, a computing system for automatically providing personalized summaries of meetings comprises a memory configured to store information describing a meeting; and processing circuitry configured to receive a plurality of meeting item summaries of respective meeting items included in the transcript of the meeting; determine, by applying a model of meeting item relevance to the meeting item summaries, a corresponding relevance to a user of each of the meeting item summaries; and output respective indications of relevance to the user for one or more of the meeting item summaries to provide a personalized summary of the meeting to the user.
-
公开(公告)号:US10726846B2
公开(公告)日:2020-07-28
申请号:US16088737
申请日:2017-06-03
Applicant: SRI International
Inventor: Dimitra Vergyri , Diego Castan Lavilla , Girish Acharya , David Sahner , Elizabeth Shriberg , Joseph B Rogers , Bruce H Knoth
Abstract: An electronic device for providing health information or assistance includes an input configured to receive at least one type of signal selected from sound signals, verbal signals, non-verbal signals, and combinations thereof, a communication module configured to send information related to the at least one user and his/her environment to a remote device, including the sound signals, non-verbal signals, and verbal signals, the remote device being configured to analyze a condition of the at least one user and communicate condition signals to the electronic device, a processing module configured to receive the condition signals and to cause the electronic device to engage in a passive monitoring mode or an active engagement and monitoring mode, the active engagement and monitoring mode including, but not limited to, verbal communication with the at least one user, and an output configured to engage the at least one user in verbal communication.
-
-
-
-
-
-
-
-
-