-
公开(公告)号:US12125482B2
公开(公告)日:2024-10-22
申请号:US16692150
申请日:2019-11-22
Applicant: INTEL CORPORATION
CPC classification number: G10L15/22 , G06F16/638 , G10L15/02 , G10L15/16 , G10L17/00 , G10L25/78 , G10L2015/027 , G10L2015/088 , G10L2015/223
Abstract: An example apparatus for recognizing speech includes an audio receiver to receive a stream of audio. The apparatus also includes a key phrase detector to detect a key phrase in the stream of audio. The apparatus further includes a model adapter to dynamically adapt a model based on the detected key phrase. The apparatus also includes a query recognizer to detect a voice query following the key phrase in a stream of audio via the adapted model.
-
公开(公告)号:US11062703B2
公开(公告)日:2021-07-13
申请号:US16106852
申请日:2018-08-21
Applicant: Intel Corporation
Inventor: Josef Bauer , Tobias Bocklet , Joachim Hofer , Munir Georges
Abstract: An automatic speech recognition (ASR) system includes a memory configured to store a filler model. The filler model includes one or more phonetic strings corresponding to one or more portions of a wake up phrase. The ASR system also includes one or more processors operatively coupled to the memory and configured to analyze a speech signal with the filler model to determine whether the speech signal includes the wake up phrase or any portion of the wake up phrase. The one or more processors are also configured to generate, based on the analysis, a hypothesis of underlying speech included in the speech signal. The hypothesis excludes the wake up phrase or any portion of the wake up phrase included in the speech signal.
-
公开(公告)号:US20210082429A1
公开(公告)日:2021-03-18
申请号:US17092737
申请日:2020-11-09
Applicant: Intel Corporation
Inventor: Jacek Ossowski , Tobias Bocklet , Kuba Lopatka
Abstract: Techniques related to a method and system of audio false keyphrase rejection using speaker recognition are described herein. Such techniques use speaker recognition of a computer originated voice to omit actions triggered when a keyphrase is present in captured audio and omitted when speech of the captured audio was spoken by the computer originated voice.
-
公开(公告)号:US10714122B2
公开(公告)日:2020-07-14
申请号:US16001496
申请日:2018-06-06
Applicant: Intel Corporation
Inventor: Maciej Muchlinski , Tobias Bocklet
IPC: G10L25/78 , G10L25/84 , G10L15/02 , G10L15/22 , G10L25/87 , G10L15/06 , G10L15/08 , G10L15/14 , G10L15/16
Abstract: Speech or non-speech detection techniques are discussed and include updating a speech pattern model using probability scores from an acoustic model to generate a score for each state of the speech pattern model, such that the speech pattern model includes a first non-speech state having multiple self loops each associated with a non-speech probability score of the probability scores, a plurality of speech states following the first non-speech state, and a second non-speech state following the speech states, and detecting speech based on a comparison of a score of the first non-speech state and a score of the last speech state of the multiple speech states.
-
5.
公开(公告)号:US20190043477A1
公开(公告)日:2019-02-07
申请号:US16022376
申请日:2018-06-28
Applicant: Intel Corporation
Inventor: Suyoung Bang , Muhammad Khellah , Somnath Paul , Charles Augustine , Turbo Majumder , Wootaek Lim , Tobias Bocklet , David Pearce
IPC: G10L15/02
Abstract: A system, article, and method provide temporal-domain feature extraction for automatic speech recognition.
-
6.
公开(公告)号:US20160365096A1
公开(公告)日:2016-12-15
申请号:US15121004
申请日:2014-03-28
Applicant: INTEL CORPORATION
Inventor: Tobias Bocklet , Adam Marek
Abstract: Various systems, apparatuses, and methods for training classifiers using selected cohort sample subsets are disclosed herein, in an example, a set of target supervectors, representing a target class, is received, and a set of cohort supervectors, representing a cohort class, is received. A distance metric is calculated from a respective cohort supervector to a respective target supervector, and a proper subset of cohort supervectors are selected based on the calculated distance metrics. The set of target supervectors and the selected proper subset of cohort supervectors are used to train a classifier. Further examples described herein describe how training classifiers using selected cohort sample subsets may be used to increase performance and decrease resource consumption in voice biometric systems.
Abstract translation: 在本文中公开了使用所选择的队列样本子集来训练分类器的各种系统,装置和方法,在一个示例中,接收到表示目标类的一组目标超向量,并且代表队列类的一组队列超向量是 收到了 从相应的队列超向量到相应的目标超向量计算距离度量,并且基于所计算的距离度量来选择队列超级向量的适当子集。 目标超级队员和选定的队列超级队员的子集用于训练分类器。 本文描述的进一步示例描述如何使用选择的队列样本子集的训练分类器可以用于增加语音生物测定系统中的性能并降低资源消耗。
-
公开(公告)号:US20240185851A1
公开(公告)日:2024-06-06
申请号:US18367180
申请日:2023-09-12
Applicant: Intel Corporation
Inventor: Jacek Ossowski , Tobias Bocklet , Kuba Lopatka
CPC classification number: G10L15/22 , G10L15/08 , G10L2015/088 , G10L2015/223 , G10L2015/225 , G10L17/00
Abstract: Techniques related to a method and system of audio false keyphrase rejection using speaker recognition are described herein. Such techniques use speaker recognition of a computer originated voice to omit actions triggered when a keyphrase is present in captured audio and omitted when speech of the captured audio was spoken by the computer originated voice.
-
公开(公告)号:US11216724B2
公开(公告)日:2022-01-04
申请号:US15834838
申请日:2017-12-07
Applicant: INTEL CORPORATION
Inventor: Kuba Lopatka , Tobias Bocklet , Mateusz Kotarski
Abstract: Techniques are provided for acoustic event detection. A methodology implementing the techniques according to an embodiment includes extracting acoustic features from a received audio signal. The acoustic features may include, for example, one or more short-term Fourier transform frames, or other spectral energy characteristics, of the audio signal. The method also includes applying a trained classifier to the extracted acoustic features to identify and label acoustic event subparts of the audio signal and to generate scores associated with the subparts. The method further includes performing sequence decoding of the acoustic event subparts and associated scores to detect target acoustic events of interest based on the scores and temporal ordering sequence of the event subparts. The classifier is trained on acoustic event subparts that are generated through unsupervised subspace clustering techniques applied to training data that includes target acoustic events.
-
公开(公告)号:US20200294493A1
公开(公告)日:2020-09-17
申请号:US16892080
申请日:2020-06-03
Applicant: Intel Corporation
Inventor: Marcin Terpilowski , Tomasz Dorau , Tobias Bocklet
IPC: G10L15/16
Abstract: A method, system, and device are directed to audio input bit-size conversion for compatibility to audio processing systems with an expected input sample bit-size.
-
公开(公告)号:US10747231B2
公开(公告)日:2020-08-18
申请号:US15816835
申请日:2017-11-17
Applicant: Intel Corporation
Inventor: Sarang Akotkar , Mithil Ramteke , Tobias Bocklet , Sivasubramanian Sundaram
IPC: G05D1/02 , G10L25/30 , G10L25/51 , G10L21/038 , G05D1/00 , H04R3/00 , G06N3/08 , H04R1/40 , G10L25/24 , G06N3/04
Abstract: Embodiments include apparatuses, systems, and methods for a computer-aided or autonomous driving (CA/AD) system to identify and respond to an audio signal, e.g., an emergency alarm signal. In embodiments, the CA/AD driving system may include a plurality of microphones disposed to capture the audio signal included in surrounding sounds to a semi-autonomous or autonomous (SA/AD) vehicle. In embodiments, an audio analysis unit may receive the audio signal to extract audio features from the audio signal. In embodiments, a neural network such as a Deep Neural Network (DNN) may receive the extracted audio features from the audio analysis unit and to generate a probability score to allow identification of the audio signal. In embodiments, the CA/AD driving system may control driving elements of the SA/AD vehicle to autonomously or semi-autonomously drive the SA/AD vehicle in response to the identification. Other embodiments may also be described and claimed.
-
-
-
-
-
-
-
-
-