Invention Grant
- Patent Title: Audio identification based on data structure
-
Application No.: US15698532Application Date: 2017-09-07
-
Publication No.: US10713296B2Publication Date: 2020-07-14
- Inventor: Zafar Rafii , Prem Seetharaman
- Applicant: Gracenote, Inc.
- Applicant Address: US CA Emeryville
- Assignee: GRACENOTE, INC.
- Current Assignee: GRACENOTE, INC.
- Current Assignee Address: US CA Emeryville
- Agency: Hanley, Flight & Zimmerman, LLC
- Main IPC: G06F16/68
- IPC: G06F16/68 ; G10L25/27 ; G10L25/51 ; G06F16/61 ; G06F17/14

Abstract:
Example systems and methods represent audio using a sequence of two-dimensional (2D) Fourier transforms (2DFTs), and such a sequence may be used by a specially configured machine to perform audio identification, such as for cover song identification. Such systems and methods are robust to timbral changes, time skews, and pitch skews. In particular, a special data structure provides a time-series representation of audio, and this time-series representation is robust to key changes, timbral changes, and small local tempo deviations. Accordingly, the systems and methods described herein analyze cross-similarity between these time-series representations. In some example embodiments, such systems and methods extract features from an audio fingerprint and calculate a distance measure that is robust and invariant to changes in musical structure.
Public/Granted literature
- US20180075140A1 AUDIO IDENTIFICATION BASED ON DATA STRUCTURE Public/Granted day:2018-03-15
Information query