SYSTEM AND METHOD FOR OFFLINE SURVIVABILITY
    14.
    发明申请
    SYSTEM AND METHOD FOR OFFLINE SURVIVABILITY 审中-公开
    离线生存的系统和方法

    公开(公告)号:US20160294955A1

    公开(公告)日:2016-10-06

    申请号:US14674437

    申请日:2015-03-31

    CPC classification number: H04L43/0811 H04L63/0428 H04L67/2842 H04L69/40

    Abstract: A system and method are presented for on premise and offline survivability of an interactive voice response system in a cloud telephony system. Voice interaction control may be divided from the media resources. Survivability is invoked when the communication technology between the Cloud and the voice interaction's resource provider is degraded or disrupted. The system is capable of recovering after a disruption event such that a seamless transition between failure and non-failure states is provided for a limited impact to a user's experience. When communication paths or Cloud control is re-established, the user resumes normal processing and full functionality as if the failure had not occurred.

    Abstract translation: 提出了一种用于云电话系统中交互式语音应答系统的前提和离线生存性的系统和方法。 语音交互控制可能与媒体资源分开。 当Cloud与语音交互的资源提供者之间的通信技术降级或中断时,可以调用生存性。 该系统能够在中断事件之后恢复,从而提供故障和非故障状态之间的无缝转换,以对用户体验产生有限的影响。 当重新建立通信路径或Cloud控制时,用户恢复正常处理和完整功能,就像未发生故障一样。

    SYSTEM AND METHOD FOR OPTIMIZATION OF AUDIO FINGERPRINT SEARCH
    15.
    发明申请
    SYSTEM AND METHOD FOR OPTIMIZATION OF AUDIO FINGERPRINT SEARCH 审中-公开
    用于优化音频指纹搜索的系统和方法

    公开(公告)号:US20150254338A1

    公开(公告)日:2015-09-10

    申请号:US14636474

    申请日:2015-03-03

    CPC classification number: G06F17/30743 G10L25/51

    Abstract: A system and method are presented for optimization of audio fingerprint search. In an embodiment, the audio fingerprints are organized into a recursive tree with different branches containing fingerprint sets that are dissimilar to each other. The tree is constructed using a clustering algorithm based on a similarity measure. The similarity measure may comprise a Hamming distance for a binary fingerprint or a Euclidean distance for continuous valued fingerprints. In another embodiment, each fingerprint is stored at a plurality of resolutions and clustering is performed hierarchically. The recognition of an incoming fingerprint begins from the root of the tree and proceeds down its branches until a match or mismatch is declared. In yet another embodiment, a fingerprint definition is generalized to include more detailed audio information than in the previous definition.

    Abstract translation: 提出了一种用于音频指纹搜索优化的系统和方法。 在一个实施例中,音频指纹被组织成具有不同分支的递归树,该分支包含彼此不相似的指纹集。 使用基于相似性度量的聚类算法构建树。 相似性度量可以包括二进制指纹的汉明距离或连续值指纹的欧几里德距离。 在另一个实施例中,每个指纹以多个分辨率存储,并且分层地进行聚类。 输入指纹的识别从树的根开始,并向下延伸到其分支,直到声明匹配或不匹配。 在另一个实施例中,指纹定义被概括为包括比先前定义中更详细的音频信息。

    SYSTEM AND METHOD FOR SYNTHESIS OF SPEECH FROM PROVIDED TEXT
    16.
    发明申请
    SYSTEM AND METHOD FOR SYNTHESIS OF SPEECH FROM PROVIDED TEXT 有权
    从提供的文本合成语音的系统和方法

    公开(公告)号:US20150199956A1

    公开(公告)日:2015-07-16

    申请号:US14596628

    申请日:2015-01-14

    CPC classification number: G10L13/08

    Abstract: A system and method are presented for the synthesis of speech from provided text. Particularly, the generation of parameters within the system is performed as a continuous approximation in order to mimic the natural flow of speech as opposed to a step-wise approximation of the feature stream. Provided text may be partitioned and parameters generated using a speech model. The generated parameters from the speech model may then be used in a post-processing step to obtain a new set of parameters for application in speech synthesis.

    Abstract translation: 提供了一种用于从提供的文本合成语音的系统和方法。 特别地,系统内的参数的产生被执行为连续近似,以便模拟语音的自然流动而不是特征流的逐步近似。 所提供的文本可以被分割,并且使用语音模型生成参数。 然后可以在后处理步骤中使用来自语音模型的所生成的参数,以获得用于语音合成中的应用的一组新参数。

    System and Method for Learning Alternate Pronunciations for Speech Recognition
    17.
    发明申请
    System and Method for Learning Alternate Pronunciations for Speech Recognition 有权
    学习用于语音识别的替代发音的系统和方法

    公开(公告)号:US20150106082A1

    公开(公告)日:2015-04-16

    申请号:US14515607

    申请日:2014-10-16

    Abstract: A system and method for learning alternate pronunciations for speech recognition is disclosed. Alternative name pronunciations may be covered, through pronunciation learning, that have not been previously covered in a general pronunciation dictionary. In an embodiment, the detection of phone-level and syllable-level mispronunciations in words and sentences may be based on acoustic models trained by Hidden Markov Models. Mispronunciations may be detected by comparing the likelihood of the potential state of the targeting pronunciation unit with a pre-determined threshold through a series of tests. It is also within the scope of an embodiment to detect accents.

    Abstract translation: 公开了用于学习语音识别的替代发音的系统和方法。 通过发音学习可以覆盖另类名称发音,这些发音先前未被一般发音词典涵盖。 在一个实施例中,在单词和句子中检测电话级和音节级错误可以基于由隐马尔可夫模型训练的声学模型。 可以通过一系列测试来比较目标语音单元的潜在状态与预定阈值的可能性来检测微分。 检测重音也属于实施例的范围。

    Method and system for acoustic data selection for training the parameters of an acoustic model

    公开(公告)号:US10157610B2

    公开(公告)日:2018-12-18

    申请号:US15850106

    申请日:2017-12-21

    Abstract: A system and method are presented for acoustic data selection of a particular quality for training the parameters of an acoustic model, such as a Hidden Markov Model and Gaussian Mixture Model, for example, in automatic speech recognition systems in the speech analytics field. A raw acoustic model may be trained using a given speech corpus and maximum likelihood criteria. A series of operations are performed, such as a forced Viterbi-alignment, calculations of likelihood scores, and phoneme recognition, for example, to form a subset corpus of training data. During the process, audio files of a quality that does not meet a criterion, such as poor quality audio files, may be automatically rejected from the corpus. The subset may then be used to train a new acoustic model.

    SYSTEM AND METHOD FOR PARAMETERIZATION OF SPEECH RECOGNITION GRAMMAR SPECIFICATION (SRGS) GRAMMARS

    公开(公告)号:US20180122370A1

    公开(公告)日:2018-05-03

    申请号:US15802269

    申请日:2017-11-02

    CPC classification number: G10L15/193 G10L15/063 G10L2015/0631 H04M3/4938

    Abstract: A method includes: loading, by a processor, a grammar specification defining at least one parameterizable grammar including a plurality of rules; setting, by the processor, an initial state of a grammar processor as a current state, the current state including parameters supplied to the rules; selecting, by the processor, a rule of the plurality of rules matching the parameters of the current state of the grammar processor; applying, by the processor, the selected rule to the audio and updating the current state; determining, by the processor, whether termination conditions have been met; in response to determining the termination conditions are not met, selecting, by the processor, from the plurality of rules in accordance with parameters of the updated state; and in response to determining the termination conditions are met, outputting, by the processor, a recognizer result of the current state.

Patent Agency Ranking