Invention Grant
- Patent Title: Classifying audio scene using synthetic image features
-
Application No.: US17452306Application Date: 2021-10-26
-
Publication No.: US11657833B2Publication Date: 2023-05-23
- Inventor: Eric Chris Wolfgang Sommerlade , Yang Liu , Alexandros Neofytou , Sunando Sengupta
- Applicant: Microsoft Technology Licensing, LLC
- Applicant Address: US WA Redmond
- Assignee: Microsoft Technology Licensing, LLC
- Current Assignee: Microsoft Technology Licensing, LLC
- Current Assignee Address: US WA Redmond
- Agency: Alleman Hall Creasman & Tuttle LLP
- Main IPC: H04N5/272
- IPC: H04N5/272 ; H04N7/14 ; G06F18/214 ; G06F18/241 ; G06V10/764 ; G10L25/51 ; G06V10/82 ; G06V10/44 ; G06V20/00

Abstract:
A computing system includes an encoder that receives an input image and encodes the input image into real image features, a decoder that decodes the real image features into a reconstructed image, a generator that receives first audio data corresponding to the input image and generates first synthetic image features from the first audio data, and receives second audio data and generates second synthetic image features from the second audio data, a discriminator that receives both the real and synthetic image features and determines whether a target feature is real or synthetic, and a classifier that classifies a scene of the second audio data based on the second synthetic image features.
Information query