Input-aware and input-unaware iterative speech recognition

    公开(公告)号:US12230256B2

    公开(公告)日:2025-02-18

    申请号:US17886859

    申请日:2022-08-12

    Abstract: An interactive voice response (IVR) system including iterative speech recognition with semantic interpretation is deployed to process an audio input in a manner that optimizes and conserves computing resources and facilitates low-latency discovery of start-of-speech events that can be used to support external processes such as barge-in operations. The IVR system can repeatedly receive an audio input at a speech processing component and apply an input-aware recognition process to the audio input. In response to generating a start-of-speech event, the IVR system can apply an input-unaware recognition process to the remaining audio input and determine a semantic meaning in relation to the relevant portion of the audio input.

    ITERATIVE SPEECH RECOGNITION WITH SEMANTIC INTERPRETATION

    公开(公告)号:US20240055018A1

    公开(公告)日:2024-02-15

    申请号:US17819354

    申请日:2022-08-12

    CPC classification number: G10L25/93 G10L25/78 G10L15/26 G06F40/30 G10L2025/783

    Abstract: An interactive voice response (IVR) system including iterative speech recognition with semantic interpretation is deployed to determine when a user is finished speaking, thus saving them time and frustration. The IVR system can repeatedly receive an audio input representing a portion of human speech, transcribe the speech into text, and determine a semantic meaning of the text. If the semantic meaning corresponds to a valid input or response to the IVR system, then the IVR system can determine that the user input is complete and respond to the user after the user is silent for a predetermined time period. If the semantic meaning does not correspond to a valid input to the IVR system, the IVR system can determine that the user input is not complete and can wait for a second predetermined time period before determining that the user has finished speaking.

    INPUT-AWARE AND INPUT-UNAWARE ITERATIVE SPEECH RECOGNITION

    公开(公告)号:US20240054995A1

    公开(公告)日:2024-02-15

    申请号:US17886859

    申请日:2022-08-12

    CPC classification number: G10L15/1815 G10L15/05

    Abstract: An interactive voice response (IVR) system including iterative speech recognition with semantic interpretation is deployed to process an audio input in a manner that optimizes and conserves computing resources and facilitates low-latency discovery of start-of-speech events that can be used to support external processes such as barge-in operations. The IVR system can repeatedly receive an audio input at a speech processing component and apply an input-aware recognition process to the audio input. In response to generating a start-of-speech event, the IVR system can apply an input-unaware recognition process to the remaining audio input and determine a semantic meaning in relation to the relevant portion of the audio input.

Patent Agency Ranking