Audio identification based on data structure

Invention Grant

US10713296B2 Audio identification based on data structure 有权

Please log in to see more content

Patent Title: Audio identification based on data structure
Application No.: US15698532

Application Date: 2017-09-07
Publication No.: US10713296B2

Publication Date: 2020-07-14
Inventor: Zafar Rafii , Prem Seetharaman
Applicant: Gracenote, Inc.
Applicant Address: US CA Emeryville
Assignee: GRACENOTE, INC.
Current Assignee: GRACENOTE, INC.
Current Assignee Address: US CA Emeryville
Agency: Hanley, Flight & Zimmerman, LLC
Main IPC: G06F16/68
IPC: G06F16/68 ; G10L25/27 ; G10L25/51 ; G06F16/61 ; G06F17/14

Audio identification based on data structure

Abstract:

Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews. In particular, a special data structure provides a time-series representation of audio, and this time-series representation is robust to key changes, timbral changes, and small local tempo deviations. Accordingly, the systems and methods described herein analyze cross-similarity between these time-series representations. In some example embodiments, such systems and methods extract features from an audio fingerprint and calculate a distance measure that is robust and invariant to changes in musical structure.

Public/Granted literature

US20180075140A1 AUDIO IDENTIFICATION BASED ON DATA STRUCTURE Public/Granted day:2018-03-15

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06F	电数字数据处理（基于特定计算模型的计算机系统入G06N）
G06F16/00	信息检索；数据库结构；文件系统结构
G06F16/60	.•音频数据
G06F16/68	..••使用元数据的特征检索,例如,不来自内容或者元数据派生的