Classifying audio scene using synthetic image features

Invention Grant

US11657833B2 Classifying audio scene using synthetic image features 有权

Please log in to see more content

Patent Title: Classifying audio scene using synthetic image features
Application No.: US17452306

Application Date: 2021-10-26
Publication No.: US11657833B2

Publication Date: 2023-05-23
Inventor: Eric Chris Wolfgang Sommerlade , Yang Liu , Alexandros Neofytou , Sunando Sengupta
Applicant: Microsoft Technology Licensing, LLC
Applicant Address: US WA Redmond
Assignee: Microsoft Technology Licensing, LLC
Current Assignee: Microsoft Technology Licensing, LLC
Current Assignee Address: US WA Redmond
Agency: Alleman Hall Creasman & Tuttle LLP
Main IPC: H04N5/272
IPC: H04N5/272 ; H04N7/14 ; G06F18/214 ; G06F18/241 ; G06V10/764 ; G10L25/51 ; G06V10/82 ; G06V10/44 ; G06V20/00

Classifying audio scene using synthetic image features

Abstract:

A computing system includes an encoder that receives an input image and encodes the input image into real image features, a decoder that decodes the real image features into a reconstructed image, a generator that receives first audio data corresponding to the input image and generates first synthetic image features from the first audio data, and receives second audio data and generates second synthetic image features from the second audio data, a discriminator that receives both the real and synthetic image features and determines whether a target feature is real or synthetic, and a classifier that classifies a scene of the second audio data based on the second synthetic image features.

Information query

Espacenet

IPC分类:

H	电学
H04	电通信技术
H04N	图像通信，如电视
H04N5/00	电视系统的零部件（扫描部件或其与供电电压产生的组合入H04N3/00）
H04N5/222	.电视演播室线路；电视演播室装置；电视演播室设备
H04N5/262	..电视演播室线路，例如用于混合、开关、转换、改变图像特性及其他特殊效果
H04N5/272	...在背景图像中插入前景图像的方法，即镶入、删除