Patent search ap:("INTERACTIVE INTELLIGENCE GROUP Page INC.") AND inv:"Felix Immanuel Wyss"

11.

发明申请
TECHNOLOGIES FOR AUTHENTICATING A SPEAKER USING VOICE BIOMETRICS 审中-公开

公开(公告)号：US20170352353A1

公开(公告)日：2017-12-07

申请号：US15612898

申请日：2017-06-02

Applicant: Interactive Intelligence Group, Inc.

Inventor： Rajesh Dachiraju , Aravind Ganapathiraju , Ananth Nagaraja Iyer , Felix Immanuel Wyss

IPC: G10L17/22 , G10L21/038 , G10L17/04 , G10L25/12 , G10L17/02

CPC classification number: G10L17/22 , G10L15/02 , G10L15/28 , G10L17/02 , G10L17/04 , G10L21/038 , G10L25/12

Abstract: Technologies for authenticating a speaker in a voice authentication system using voice biometrics include a speech collection computing device and a speech authentication computing device. The speech collection computing device is configured to collect a speech signal from a speaker and transmit the speech signal to the speech authentication computing device. The speech authentication computing device is configured to compute a speech signal feature vector for the received speech signal, retrieve a speech signal classifier associated with the speaker, and feed the speech signal feature vector to the retrieved speech signal classifier. Additionally, the speech authentication computing device is configured to determine whether the speaker is an authorized speaker based on an output of the retrieved speech signal classifier. Additional embodiments are described herein.

12.

发明申请
METHOD AND SYSTEM FOR LEARNING CALL ANALYSIS 审中-公开

公开(公告)号：US20170085710A1

公开(公告)日：2017-03-23

申请号：US15370120

申请日：2016-12-06

Applicant: Interactive Intelligence Group, Inc.

Inventor： Felix Immanuel Wyss , Matthew Alan Taylor , Kevin Charles Vlack

IPC: H04M3/51 , H04M3/42 , G10L25/51

CPC classification number: H04M3/5158 , G09B19/00 , G10L25/48 , G10L25/51 , H04M3/42102 , H04M3/5183 , H04M2203/2027 , H04M2203/558 , H04M2203/6054

Abstract: A system and method are presented for learning call analysis. Audio fingerprinting may be employed to identify audio recordings that answer communications. In one embodiment, the system may generate a fingerprint of a candidate audio stream and compare it against known fingerprints within a database. The system may also search for a speech-like signal to determine if the endpoint contains a known audio recording. If a known audio recording is not encountered, a fingerprint may be computed for the contact and the communication routed to a human for handling. An indication may be made as to if the call is indeed an audio recording. The associated information may be saved and used for future identification purposes.

13.

发明申请
System and Method for Learning Alternate Pronunciations for Speech Recognition 有权
Title translation: 学习用于语音识别的替代发音的系统和方法

公开(公告)号：US20170032780A1

公开(公告)日：2017-02-02

申请号：US15291353

申请日：2016-10-12

Applicant: Interactive Intelligence Group, Inc.

Inventor： Zhenhao Ge , Vivek Tyagi , Aravind Ganapathiraju , Ananth Nagaraja Iyer , Scott Allen Randal , Felix Immanuel Wyss

IPC: G10L15/06 , G09B19/06 , G06F17/27 , G09B19/04 , G10L15/187 , G10L15/14

CPC classification number: G10L15/063 , G06F17/2735 , G06F17/28 , G09B19/04 , G09B19/06 , G10L15/14 , G10L15/187 , G10L2015/081

Abstract: A system and method for learning alternate pronunciations for speech recognition is disclosed. Alternative name pronunciations may be covered, through pronunciation learning, that have not been previously covered in a general pronunciation dictionary. In an embodiment, the detection of phone-level and syllable-level mispronunciations in words and sentences may be based on acoustic models trained by Hidden Markov Models. Mispronunciations may be detected by comparing the likelihood of the potential state of the targeting pronunciation unit with a pre-determined threshold through a series of tests. It is also within the scope of an embodiment to detect accents.

Abstract translation: 公开了用于学习语音识别的替代发音的系统和方法。通过发音学习可以覆盖另类名称发音，这些发音先前未被一般发音词典涵盖。在一个实施例中，在单词和句子中检测电话级和音节级错误可以基于由隐马尔可夫模型训练的声学模型。可以通过一系列测试来比较目标语音单元的潜在状态与预定阈值的可能性来检测微分。检测重音也属于实施例的范围。

14.

发明申请
SYSTEM AND METHOD FOR OFFLINE SURVIVABILITY 审中-公开
Title translation: 离线生存的系统和方法

公开(公告)号：US20160294955A1

公开(公告)日：2016-10-06

申请号：US14674437

申请日：2015-03-31

Applicant: Interactive Intelligence Group, Inc.

Inventor： Richard M. Neidermyer , Kevin Elliott King , Felix Immanuel Wyss

IPC: H04L29/08 , H04L12/26

CPC classification number: H04L43/0811 , H04L63/0428 , H04L67/2842 , H04L69/40

Abstract: A system and method are presented for on premise and offline survivability of an interactive voice response system in a cloud telephony system. Voice interaction control may be divided from the media resources. Survivability is invoked when the communication technology between the Cloud and the voice interaction's resource provider is degraded or disrupted. The system is capable of recovering after a disruption event such that a seamless transition between failure and non-failure states is provided for a limited impact to a user's experience. When communication paths or Cloud control is re-established, the user resumes normal processing and full functionality as if the failure had not occurred.

Abstract translation: 提出了一种用于云电话系统中交互式语音应答系统的前提和离线生存性的系统和方法。语音交互控制可能与媒体资源分开。当Cloud与语音交互的资源提供者之间的通信技术降级或中断时，可以调用生存性。该系统能够在中断事件之后恢复，从而提供故障和非故障状态之间的无缝转换，以对用户体验产生有限的影响。当重新建立通信路径或Cloud控制时，用户恢复正常处理和完整功能，就像未发生故障一样。

15.

发明申请
SYSTEM AND METHOD FOR OPTIMIZATION OF AUDIO FINGERPRINT SEARCH 审中-公开
Title translation: 用于优化音频指纹搜索的系统和方法

公开(公告)号：US20150254338A1

公开(公告)日：2015-09-10

申请号：US14636474

申请日：2015-03-03

Applicant: Interactive Intelligence Group, Inc.

Inventor： Srinath Cheluvaraja , Ananth Nagaraja Iyer , Felix Immanuel Wyss

IPC: G06F17/30 , G10L19/018

CPC classification number: G06F17/30743 , G10L25/51

Abstract: A system and method are presented for optimization of audio fingerprint search. In an embodiment, the audio fingerprints are organized into a recursive tree with different branches containing fingerprint sets that are dissimilar to each other. The tree is constructed using a clustering algorithm based on a similarity measure. The similarity measure may comprise a Hamming distance for a binary fingerprint or a Euclidean distance for continuous valued fingerprints. In another embodiment, each fingerprint is stored at a plurality of resolutions and clustering is performed hierarchically. The recognition of an incoming fingerprint begins from the root of the tree and proceeds down its branches until a match or mismatch is declared. In yet another embodiment, a fingerprint definition is generalized to include more detailed audio information than in the previous definition.

Abstract translation: 提出了一种用于音频指纹搜索优化的系统和方法。在一个实施例中，音频指纹被组织成具有不同分支的递归树，该分支包含彼此不相似的指纹集。使用基于相似性度量的聚类算法构建树。相似性度量可以包括二进制指纹的汉明距离或连续值指纹的欧几里德距离。在另一个实施例中，每个指纹以多个分辨率存储，并且分层地进行聚类。输入指纹的识别从树的根开始，并向下延伸到其分支，直到声明匹配或不匹配。在另一个实施例中，指纹定义被概括为包括比先前定义中更详细的音频信息。

16.

发明申请
SYSTEM AND METHOD FOR SYNTHESIS OF SPEECH FROM PROVIDED TEXT 有权
Title translation: 从提供的文本合成语音的系统和方法

公开(公告)号：US20150199956A1

公开(公告)日：2015-07-16

申请号：US14596628

申请日：2015-01-14

Applicant: INTERACTIVE INTELLIGENCE GROUP, INC.

Inventor： Yingyi Tan , Aravind Ganapathiraju , Felix Immanuel Wyss

IPC: G10L13/02

CPC classification number: G10L13/08

Abstract: A system and method are presented for the synthesis of speech from provided text. Particularly, the generation of parameters within the system is performed as a continuous approximation in order to mimic the natural flow of speech as opposed to a step-wise approximation of the feature stream. Provided text may be partitioned and parameters generated using a speech model. The generated parameters from the speech model may then be used in a post-processing step to obtain a new set of parameters for application in speech synthesis.

Abstract translation: 提供了一种用于从提供的文本合成语音的系统和方法。特别地，系统内的参数的产生被执行为连续近似，以便模拟语音的自然流动而不是特征流的逐步近似。所提供的文本可以被分割，并且使用语音模型生成参数。然后可以在后处理步骤中使用来自语音模型的所生成的参数，以获得用于语音合成中的应用的一组新参数。

17.

发明申请
System and Method for Learning Alternate Pronunciations for Speech Recognition 有权
Title translation: 学习用于语音识别的替代发音的系统和方法

公开(公告)号：US20150106082A1

公开(公告)日：2015-04-16

申请号：US14515607

申请日：2014-10-16

Applicant: Interactive Intelligence Group, Inc.

Inventor： Zhenhao Ge , Vivek Tyagi , Aravind Ganapathiraju , Ananth Nagaraja Iyer , Scott Allen Randal , Felix Immanuel Wyss

IPC: G10L15/187 , G06F17/28 , G09B19/04 , G06F17/27

CPC classification number: G10L15/063 , G06F17/2735 , G06F17/28 , G09B19/04 , G09B19/06 , G10L15/14 , G10L15/187 , G10L2015/081

Abstract: A system and method for learning alternate pronunciations for speech recognition is disclosed. Alternative name pronunciations may be covered, through pronunciation learning, that have not been previously covered in a general pronunciation dictionary. In an embodiment, the detection of phone-level and syllable-level mispronunciations in words and sentences may be based on acoustic models trained by Hidden Markov Models. Mispronunciations may be detected by comparing the likelihood of the potential state of the targeting pronunciation unit with a pre-determined threshold through a series of tests. It is also within the scope of an embodiment to detect accents.

Abstract translation: 公开了用于学习语音识别的替代发音的系统和方法。通过发音学习可以覆盖另类名称发音，这些发音先前未被一般发音词典涵盖。在一个实施例中，在单词和句子中检测电话级和音节级错误可以基于由隐马尔可夫模型训练的声学模型。可以通过一系列测试来比较目标语音单元的潜在状态与预定阈值的可能性来检测微分。检测重音也属于实施例的范围。

18.

发明授权
System and method for recording and monitoring communications using a media server 有权

公开(公告)号：US10516716B2

公开(公告)日：2019-12-24

申请号：US15215069

申请日：2016-07-20

Applicant: Interactive Intelligence Group, Inc.

Inventor： Felix Immanuel Wyss , Michael D. Snyder , Kevin O'Connor

IPC: H04L29/06 , H04M3/22 , H04M3/42 , H04M3/51 , H04L12/26

Abstract: A communication system including a media server through which communication packets are exchanged for recording and monitoring purposes is disclosed. A tap is associated with each communication endpoint allowing for cradle to grave recording of communications despite their subsequent routing or branching. An incoming communication is routed to a first tap and upon selection of a receiving party; the first tap is routed to a second tap which forwards communication packets on to the receiving party. The taps may be used to forward communication packets to any number of other taps or destinations, such as a recording device, monitoring user, or other user in the form of a conference.

19.

发明授权
Method and system for acoustic data selection for training the parameters of an acoustic model 有权

公开(公告)号：US10157610B2

公开(公告)日：2018-12-18

申请号：US15850106

申请日：2017-12-21

Applicant: INTERACTIVE INTELLIGENCE GROUP, INC.

Inventor： Vivek Tyagi , Aravind Ganapathiraju , Felix Immanuel Wyss

IPC: G10L15/04 , G10L15/06 , G10L15/14 , G10L15/02

Abstract: A system and method are presented for acoustic data selection of a particular quality for training the parameters of an acoustic model, such as a Hidden Markov Model and Gaussian Mixture Model, for example, in automatic speech recognition systems in the speech analytics field. A raw acoustic model may be trained using a given speech corpus and maximum likelihood criteria. A series of operations are performed, such as a forced Viterbi-alignment, calculations of likelihood scores, and phoneme recognition, for example, to form a subset corpus of training data. During the process, audio files of a quality that does not meet a criterion, such as poor quality audio files, may be automatically rejected from the corpus. The subset may then be used to train a new acoustic model.

20.

发明申请
SYSTEM AND METHOD FOR PARAMETERIZATION OF SPEECH RECOGNITION GRAMMAR SPECIFICATION (SRGS) GRAMMARS 审中-公开

公开(公告)号：US20180122370A1

公开(公告)日：2018-05-03

申请号：US15802269

申请日：2017-11-02

Applicant: INTERACTIVE INTELLIGENCE GROUP, INC.

Inventor： Felix Immanuel Wyss

IPC: G10L15/193 , G10L15/06 , H04M3/493

CPC classification number: G10L15/193 , G10L15/063 , G10L2015/0631 , H04M3/4938

Abstract: A method includes: loading, by a processor, a grammar specification defining at least one parameterizable grammar including a plurality of rules; setting, by the processor, an initial state of a grammar processor as a current state, the current state including parameters supplied to the rules; selecting, by the processor, a rule of the plurality of rules matching the parameters of the current state of the grammar processor; applying, by the processor, the selected rule to the audio and updating the current state; determining, by the processor, whether termination conditions have been met; in response to determining the termination conditions are not met, selecting, by the processor, from the plurality of rules in accordance with parameters of the updated state; and in response to determining the termination conditions are met, outputting, by the processor, a recognizer result of the current state.

Search Results

Country/Region

Patent validity

Application date

Publication (announcement) day

applicant

The country/region where the applicant is located

Inventor

IPC

IPC Department

IPC class

IPC subclass

IPC group

IPC team

Appearance classification