-
公开(公告)号:US11664042B2
公开(公告)日:2023-05-30
申请号:US17330011
申请日:2021-05-25
Applicant: Plantronics, Inc.
Inventor: Shridhar K. Mukund , Pamornpol Jinachitra
IPC: G10L21/028 , G10L21/0232 , G10L25/84 , H04M1/60 , H04R1/40 , H04R3/00
CPC classification number: G10L21/028 , G10L21/0232 , G10L25/84 , H04M1/6058 , H04R1/406 , H04R3/005 , H04M2201/40
Abstract: A head-worn audio device is provided with a circuit for voice signal enhancement. The circuit comprises at least a plurality of microphones, arranged at predefined positions, where each microphone provides a microphone signal. The circuit further comprises a directivity pre-processor and a blind source separation processor. The directivity pre-processor is connected with the plurality of microphones to receive the microphone signals and being configured to provide at least a voice signal and a noise signal. Directivity pre-processing increases the mutual independence of the signals provided to the blind source separation processor and thus improves processing by blind source separation. The blind source separation processor receives at least the voice signal and the noise signal, and is configured to conduct blind source separation on at least the voice signal and the noise signal to provide at least an enhanced voice signal with reduced noise components.
-
公开(公告)号:US11664029B2
公开(公告)日:2023-05-30
申请号:US16740574
申请日:2020-01-13
Applicant: Ultratec, Inc.
Inventor: Robert M. Engelke , Kevin R. Colwell , Christopher Engelke
CPC classification number: G10L15/26 , G10L15/01 , H04M1/2475 , G10L15/1815 , G10L25/48 , G10L25/60 , H04M3/42391 , H04M2201/40 , H04M2201/60 , H04M2203/2061
Abstract: A method to transcribe communications includes the steps of obtaining a plurality of hypothesis transcriptions of a voice signal generated by a speech recognition system, determining consistent words that are included in at least first and second of the plurality of hypothesis transcriptions, in response to determining the consistent words, providing the consistent words to a device for presentation of the consistent words to an assisted user, and presenting the consistent words via a display screen on the device, wherein a rate of the presentation of the words on the display screen is variable.
-
公开(公告)号:US20190244630A1
公开(公告)日:2019-08-08
申请号:US16383861
申请日:2019-04-15
Applicant: AT&T INTELLECTUAL PROPERTY I, L.P.
Inventor: David C. GIBBON , Andrea BASSO , Lee BEGEJA , Sumit KUMAR , Zhu LIU , Bernard S. RENGER , Behzad SHAHRARAY , Eric ZAVESKY
IPC: G10L21/10 , H04L29/06 , H04L29/08 , G06F16/61 , G11B27/28 , G10L15/26 , G06F16/683 , G06F3/16 , G06F16/68 , G11B27/34
CPC classification number: G10L21/10 , G06F3/167 , G06F16/61 , G06F16/683 , G06F16/686 , G06F16/9535 , G10L15/07 , G10L15/26 , G10L15/265 , G10L17/00 , G10L17/005 , G10L2015/0631 , G11B27/28 , G11B27/34 , H04L65/403 , H04L67/306 , H04M2201/40 , H04N5/76
Abstract: Speaker content generated in an audio conference is selectively and visually represented. A profile for each audience member who participates in the audio conference is obtained. Speaker content spoken during the audio conference is monitored. Different weights are applied to words included in the speaker content according to a parameter of the profile for each of the audience members. A relation between the speaker content to the profile for each of the audience members is determined. Visual representations of the speaker content are presented to selective members among the audience members based on the determined relation.
-
公开(公告)号:US20190215400A1
公开(公告)日:2019-07-11
申请号:US16359745
申请日:2019-03-20
Applicant: Sorenson IP Holdings, LLC
Inventor: Jasper Cheekeong Pan
CPC classification number: H04M3/42391 , G10L15/26 , G10L15/30 , H04M1/72591 , H04M3/4936 , H04M2201/40 , H04W4/12
Abstract: A method to transcribe communications is provided. The method may include obtaining first communication data during a communication session between a first communication device and a second communication device and transmitting the first communication data to the second communication device by way of a mobile device that is locally coupled with the first communication device. The method may also include receiving, at the first communication device, second communication data from the second communication device through the mobile device and transmitting the second communication data to a remote transcription system. The method may further include receiving, at the first communication device, transcription data from the remote transcription system, the transcription data corresponding to a transcription of the second communication data, the transcription generated by the remote transcription system and presenting, by the first communication device, the transcription of the second communication data.
-
公开(公告)号:US20190199858A1
公开(公告)日:2019-06-27
申请号:US16181388
申请日:2018-11-06
Inventor: Yuko KANETSUKI , Takashi SUGIYAMA , Terumi SAITO
CPC classification number: H04M3/5175 , G10L15/22 , G10L15/28 , H04M3/42221 , H04M2201/40
Abstract: An evaluation criterion for a call performed between an operator and a customer is set without taking time and effort. A voice recognition system includes a call recording unit that records a call performed between a customer and an operator, a voice recognition unit that recognizes the call recorded by the call recording unit and a value of non-verbal information indicating a feature of a calling party in the call and accumulates a recognized result in a storage unit, and a voice recognition result managing unit that sets a reference value for evaluating the calling party on the basis of the value of the non-verbal information included in the recognized result.
-
公开(公告)号:US20190028592A1
公开(公告)日:2019-01-24
申请号:US16002732
申请日:2018-06-07
Applicant: TOYOTA JIDOSHA KABUSHIKI KAISHA
Inventor: Koichi SUZUKI
CPC classification number: H04M3/58 , G10L15/22 , G10L17/22 , G10L2015/223 , H04M3/42221 , H04M3/4931 , H04M3/4933 , H04M3/4936 , H04M2201/18 , H04M2201/39 , H04M2201/40
Abstract: A voice recognition system includes a call connection control device that controls the call destination of a user, and a computer. The computer is configured to perform voice recognition of speech voice data of the user, determine an intention of a speech of the user based on a voice recognition result of the speech voice data, evaluate the reliability of a response generated by the computer for the user based on the determined intention of the speech of the user, and cause the call connection control device to switch the call destination of the user to an operator terminal in a case where the reliability of the response is equal to or less than a threshold value.
-
公开(公告)号:US20180359205A1
公开(公告)日:2018-12-13
申请号:US15624367
申请日:2017-06-15
Applicant: Google Inc.
Inventor: Sriram Bhargav Karnati , Varun Soundararajan
CPC classification number: G06F9/451 , G06F3/01 , G06F3/167 , G06Q10/1093 , H04L12/1813 , H04L51/046 , H04L51/10 , H04L51/16 , H04L65/1069 , H04L67/36 , H04M2201/40 , H04M2203/251
Abstract: A method includes, for each of a plurality of web resources, receiving, at a communications server, data indicating characteristics of a respective web resource, detecting, based on the received data, that the respective web resource provides functionality for live assistance by a third party content provider through a chat user interface on the respective web resource, and storing, in a database, an entry that indicates that the respective web resource has the functionality. The method includes receiving, from a user, a request to access a particular web resource hosted by a particular third party content provider, determining, based on a stored entry in the database representing the particular web resource, that the web resource provides functionality for live assistance by the particular third party content provider through a particular chat user interface on the web resource, and initiating a chat session between the user and the third party content provider.
-
公开(公告)号:US20180338040A1
公开(公告)日:2018-11-22
申请号:US15599537
申请日:2017-05-19
Applicant: Avaya Inc.
Inventor: Gerard Carty , Thomas Moran
CPC classification number: H04M3/4936 , G06F17/2705 , G06F17/2735 , H04M3/5133 , H04M3/58 , H04M2201/40 , H04M2203/2038 , H04M2203/552
Abstract: A process for updating a second agent about a call in a contact center comprises receiving a call at the contact center and connecting the call to a device associated with a first agent of the contact center. A processor is used to configure a list of keywords to detect during the call, and when a key word is detected (e.g., using a speech analyzer), a snippet of the call based on the detected keywords is identified. The snippets are ordered and presented to a second agent through a device associated with the second agent. The call is then connected to the device associated with the second agent.
-
公开(公告)号:US20180324295A1
公开(公告)日:2018-11-08
申请号:US16036482
申请日:2018-07-16
Applicant: NUANCE COMMUNICATIONS, INC.
Inventor: David ANDERSON
CPC classification number: H04M3/42221 , G10L15/26 , G10L15/265 , H04M1/656 , H04M3/51 , H04M3/5175 , H04M3/5183 , H04M2201/40 , H04M2203/301 , H04M2203/303 , H04M2203/5018
Abstract: Embodiments are provided for the automatic real-time recording and processing of media in a communications network based on the context of the media. In one embodiment, a media stream is received in an analysis module in a service platform in the communications network. The media stream may represent a communication session between a calling party and a call center in the network. The incoming media steam is analyzed to identify words comprising a context of the communication session. A determination is then made as to whether the context of the communication session is related to a set of business rules associated with the service platform which may automatically trigger the retention of a recording of the communication session. If the context of the communication session is related to the set of business rules, the retention of the communication session is automatically triggered in real-time at a recording module.
-
公开(公告)号:US20180322186A1
公开(公告)日:2018-11-08
申请号:US16035895
申请日:2018-07-16
Applicant: NUANCE COMMUNICATIONS, INC.
Inventor: Julia HIRSCHBERG , Stephen WHITTAKER
CPC classification number: G06F17/30619 , G06F17/30719 , G10L15/26 , H04L51/08 , H04L51/36 , H04M3/533 , H04M2201/40 , H04M2203/301
Abstract: A system and method for speech file processing which provides users with differentially selectable speech file transcripts which can be sent to one or more other users. The speech files may be voicemail messages from which respective voicemail transcripts are created. The voicemail transcripts are provided in a user selectable format from which users may select non-contiguous portions of the transcript.
-
-
-
-
-
-
-
-
-