-
公开(公告)号:US20190115011A1
公开(公告)日:2019-04-18
申请号:US15786803
申请日:2017-10-18
Applicant: Intel Corporation
Inventor: Muhammad Khellah , Oren Arad , Binuraj Ravindran , Somnath Paul , Charles Augustine , Bruno Umbria Pedroni
CPC classification number: G10L15/02 , G06N3/049 , G10L15/063 , G10L15/16 , G10L25/12 , G10L25/24 , G10L25/30 , G10L2015/0635 , G10L2015/088
Abstract: An example apparatus for detecting keywords in audio includes an audio receiver to receive audio comprising a keyword to be detected. The apparatus also includes a spike transducer to convert the audio into a plurality of spikes. The apparatus further includes a spiking neural network to receive one or more of the spikes and generate a spike corresponding to a detected keyword.
-
公开(公告)号:US20190035404A1
公开(公告)日:2019-01-31
申请号:US15858763
申请日:2017-12-29
Applicant: Intel Corporation
Inventor: Douglas Gabel , Jonathan Huang , Sylvia J. Downing , Narayan Biswal , Binuraj Ravindran , Willem Beltman , Vered Bar Bracha , Ze'Ev Rivlin
Abstract: A system, method, apparatus and computer readable medium for hierarchical speech recognition resolution. The method of hierarchical speech recognition resolution on a platform includes receiving a speech stream from a microphone. The speech stream is resolved using a lowest possible level automatic speech recognition (ASR) engine of multi-level ASR engines. The selection of the lowest possible level ASR engine is based on policies defined for the platform. If resolution of the speech stream is rated less than a predetermined confidence level, the resolution of the speech stream is pushed to a next higher-level ASR engine of the multi-level ASR engines until the resolution of the speech stream meets the predetermined confidence level without violating one or more policies.
-
公开(公告)号:US10672393B2
公开(公告)日:2020-06-02
申请号:US15869899
申请日:2018-01-12
Applicant: Intel Corporation
Inventor: Ze'ev Rivlin , Vered Bar Bracha , Douglas Gabel , Jonathan Huang , Sylvia Downing , Binuraj Ravindran , Willem Beltman
IPC: G10L15/22 , G10L15/05 , G10L15/197 , G06F3/16 , G10L15/07 , G09B5/06 , G06F40/274 , G10L15/14 , G10L15/16 , G09B19/04
Abstract: A system, apparatus, method, and computer program product for a speaking aid. The system including network interface circuitry to receive speech input from a user. The speech input includes a partial sentence with a missing word or the partial sentence with a stuttered word. The system also includes a processor coupled to the network interface circuitry and one or more memory devices coupled to the processor. The one or more memory devices include instructions, that when executed by the processor, cause the system to detect a stutter or pause in the speech input, predict the stuttered word or the missing word, present a predicted word from an n-best list to the user; and if a prompt is received from the user, present a next word from the n-best list until the user speaks a correct word to replace the stutter or the pause.
-
公开(公告)号:US10403266B2
公开(公告)日:2019-09-03
申请号:US15786803
申请日:2017-10-18
Applicant: Intel Corporation
Inventor: Muhammad Khellah , Oren Arad , Binuraj Ravindran , Somnath Paul , Charles Augustine , Bruno Umbria Pedroni
Abstract: An example apparatus for detecting keywords in audio includes an audio receiver to receive audio comprising a keyword to be detected. The apparatus also includes a spike transducer to convert the audio into a plurality of spikes. The apparatus further includes a spiking neural network to receive one or more of the spikes and generate a spike corresponding to a detected keyword.
-
公开(公告)号:US20190043525A1
公开(公告)日:2019-02-07
申请号:US15869890
申请日:2018-01-12
Applicant: Intel Corporation
Inventor: Jonathan Huang , Willem Beltman , Vered Bar Bracha , Ze'ev Rivlin , Douglas Gabel , Sylvia Downing , Narayan Biswal , Binuraj Ravindran
Abstract: A system, apparatus, method, and computer readable medium for using an audio trigger for surveillance in a security system. The method including receiving an audio input stream via a microphone. Dividing the audio input stream into audio segments. Filtering high energy audio segments from the audio segments. If a high energy audio segment includes speech, then determining if the speech is recognized as the speech of users of the system. If the high energy audio segment does not include the speech, then classifying the high energy audio segment as an interesting sound or an uninteresting sound. Determining whether to turn video on based on classification of the high energy audio segment as the interesting sound, speech recognition of the speech as the speech of the users of the system, and contextual data.
-
公开(公告)号:US20190043490A1
公开(公告)日:2019-02-07
申请号:US15869899
申请日:2018-01-12
Applicant: Intel Corporation
Inventor: Ze'ev Rivlin , Vered Bar Bracha , Douglas Gabel , Jonathan Huang , Sylvia Downing , Binuraj Ravindran , Willem Beltman
IPC: G10L15/197 , G10L15/22 , G06F3/16 , G10L15/07
Abstract: A system, apparatus, method, and computer program product for a speaking aid. The system including network interface circuitry to receive speech input from a user. The speech input includes a partial sentence with a missing word or the partial sentence with a stuttered word. The system also includes a processor coupled to the network interface circuitry and one or more memory devices coupled to the processor. The one or more memory devices include instructions, that when executed by the processor, cause the system to detect a stutter or pause in the speech input, predict the stuttered word or the missing word, present a predicted word from an n-best list to the user; and if a prompt is received from the user, present a next word from the n-best list until the user speaks a correct word to replace the stutter or the pause.
-
-
-
-
-