Methods for the encoding of participants in a conference

    公开(公告)号:US10237413B2

    公开(公告)日:2019-03-19

    申请号:US15917425

    申请日:2018-03-09

    Abstract: A system and method are presented for the encoding of participants in a conference setting. In an embodiment, audio from conference participants in a voice-over-IP setting may be received and processed by the system. In an embodiment, audio may be received in a compressed form and de-compressed for processing. For each participant, return audio is generated, compressed (if applicable) and transmitted to the participant. The system may recognize when participants are using the same audio encoding format and are thus receiving audio that may be similar or identical. The audio may only be encoded once instead of for each participant. Thus, redundant encodings are recognized and eliminated resulting in less CPU usage.

    Method and system for learning call analysis

    公开(公告)号:US10116793B2

    公开(公告)日:2018-10-30

    申请号:US15370120

    申请日:2016-12-06

    Abstract: A system and method are presented for learning call analysis. Audio fingerprinting may be employed to identify audio recordings that answer communications. In one embodiment, the system may generate a fingerprint of a candidate audio stream and compare it against known fingerprints within a database. The system may also search for a speech-like signal to determine if the endpoint contains a known audio recording. If a known audio recording is not encountered, a fingerprint may be computed for the contact and the communication routed to a human for handling. An indication may be made as to if the call is indeed an audio recording. The associated information may be saved and used for future identification purposes.

    System and Method to Correct for Packet Loss in ASR Systems
    5.
    发明申请
    System and Method to Correct for Packet Loss in ASR Systems 审中-公开
    系统和方法来纠正ASR系统中的丢包

    公开(公告)号:US20150255075A1

    公开(公告)日:2015-09-10

    申请号:US14638198

    申请日:2015-03-04

    Abstract: A system and method are presented for the correction of packet loss in audio in automatic speech recognition (ASR) systems. Packet loss correction, as presented herein, occurs at the recognition stage without modifying any of the acoustic models generated during training. The behavior of the ASR engine in the absence of packet loss is thus not altered. To accomplish this, the actual input signal may be rectified, the recognition scores may be normalized to account for signal errors, and a best-estimate method using information from previous frames and acoustic models may be used to replace the noisy signal.

    Abstract translation: 提出了一种用于在自动语音识别(ASR)系统中校正音频中的分组丢失的系统和方法。 如本文所示,分组丢失校正发生在识别阶段,而不修改在训练期间产生的任何声学模型。 因此,在没有丢包的情况下,ASR引擎的行为不会改变。 为了实现这一点,实际输入信号可以被纠正,识别分数可以被归一化以考虑信号误差,并且可以使用使用来自先前帧和声学模型的信息的最佳估计方法来代替噪声信号。

    System and method for fingerprinting datasets

    公开(公告)号:US10552457B2

    公开(公告)日:2020-02-04

    申请号:US15876050

    申请日:2018-01-19

    Abstract: Systems and methods for the matching of datasets, such as input audio segments, with known datasets in a database are disclosed. In an illustrative embodiment, the use of the presently disclosed systems and methods is described in conjunction with recognizing known network message recordings encountered during an outbound telephone call. The methodologies include creation of a ternary fingerprint bitmap to make the comparison process more efficient. Also disclosed are automated methodologies for creating the database of known datasets from a larger collection of datasets.

    System and method for optimization of audio fingerprint search

    公开(公告)号:US10303800B2

    公开(公告)日:2019-05-28

    申请号:US14636474

    申请日:2015-03-03

    Abstract: A system and method are presented for optimization of audio fingerprint search. In an embodiment, the audio fingerprints are organized into a recursive tree with different branches containing fingerprint sets that are dissimilar to each other. The tree is constructed using a clustering algorithm based on a similarity measure. The similarity measure may comprise a Hamming distance for a binary fingerprint or a Euclidean distance for continuous valued fingerprints. In another embodiment, each fingerprint is stored at a plurality of resolutions and clustering is performed hierarchically. The recognition of an incoming fingerprint begins from the root of the tree and proceeds down its branches until a match or mismatch is declared. In yet another embodiment, a fingerprint definition is generalized to include more detailed audio information than in the previous definition.

    SYSTEM AND METHOD FOR SYNTHESIS OF SPEECH FROM PROVIDED TEXT

    公开(公告)号:US20180144739A1

    公开(公告)日:2018-05-24

    申请号:US15874612

    申请日:2018-01-18

    Abstract: A system and method are presented for the synthesis of speech from provided text. Particularly, the generation of parameters within the system is performed as a continuous approximation in order to mimic the natural flow of speech as opposed to a step-wise approximation of the feature stream. Provided text may be partitioned and parameters generated using a speech model. The generated parameters from the speech model may then be used in a post-processing step to obtain a new set of parameters for application in speech synthesis.

Patent Agency Ranking