Invention Grant
- Patent Title: Audio event detection with window-based prediction
-
Application No.: US17647318Application Date: 2022-01-06
-
Publication No.: US11948599B2Publication Date: 2024-04-02
- Inventor: Lihi Ahuva Shiloh Perl , Ben Fishman , Gilad Pundak , Yonit Hoffman
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
- Current Assignee: MICROSOFT TECHNOLOGY LICENSING, LLC
- Current Assignee Address: US WA Redmond
- Agency: Newport IP, LLC
- Agent Jacob P. Rohwer
- Main IPC: G10L25/93
- IPC: G10L25/93 ; G06N3/048 ; G06N3/08 ; G10L25/45

Abstract:
A computing system for a plurality of classes of audio events is provided, including one or more processors configured to divide a run-time audio signal into a plurality of segments and process each segment of the run-time audio signal in a time domain to generate a normalized time domain representation of each segment. The processor is further configured to feed the normalized time domain representation of each segment to an input layer of a trained neural network. The processor is further configured to generate, by the neural network, a plurality of predicted classification scores and associated probabilities for each class of audio event contained in each segment of the run-time input audio signal. In post-processing, the processor is further configured to generate smoothed predicted classification scores, associated smoothed probabilities, and class window confidence values for each class for each of a plurality of candidate window sizes.
Public/Granted literature
- US20230215460A1 AUDIO EVENT DETECTION WITH WINDOW-BASED PREDICTION Public/Granted day:2023-07-06
Information query