Apparatus for generating relations between feature amounts of audio and scene types and method therefor

Invention Grant

US11011187B2 Apparatus for generating relations between feature amounts of audio and scene types and method therefor 有权

Please log in to see more content

Patent Title: Apparatus for generating relations between feature amounts of audio and scene types and method therefor
Application No.: US16920002

Application Date: 2020-07-02
Publication No.: US11011187B2

Publication Date: 2021-05-18
Inventor: Yuta Yuyama , Keita Arimoto
Applicant: Yamaha Corporation
Applicant Address: JP Hamamatsu
Assignee: Yamaha Corporation
Current Assignee: Yamaha Corporation
Current Assignee Address: JP Hamamatsu
Agency: Crowell & Moring LLP
Priority: JPJP2017-035367 20170227
Main IPC: G10L25/48
IPC: G10L25/48 ; G06N20/00 ; G06F3/16 ; G06F16/60 ; G10L25/57 ; G06F16/61 ; G06N7/00 ; G10L25/27

Apparatus for generating relations between feature amounts of audio and scene types and method therefor

Abstract:

An apparatus for generating relations between feature amounts of audio and scene type includes at least one processor and a memory. The memory is operatively coupled to the at least one processor. The processor is configured to set one of the scene types to each of clusters classifying the feature amounts of audio in one or more pieces of content. The processor is also configured to generate a plurality of pieces of learning data, each representative of a feature amount, from among the feature amounts of the audio, that belongs to each cluster and the scene type set for each cluster. The processor is also configured to generate an identification model representative of relations between the feature amounts of audio and the scene types by performing machine learning using the plurality of pieces of learning data.

Public/Granted literature

US20200335127A1 Apparatus for Generating Relations Between Feature Amounts of Audio and Scene Types and Method Therefor Public/Granted day:2020-10-22

Information query

Espacenet

IPC分类:

G	物理
G10	乐器；声学
G10L	语音分析或合成；语音识别；语音或声音处理；语音或音频编码或解码
G10L25/00	不限于组G10L 15/00-G10L 21/00的语言或者声音分析技术(当利用语音检测器来感知一些信号特殊特征的基于半导体的静噪放大器，如无信号时的感知入H03G3/34)
G10L25/48	.专门适用于特定用途