Invention Grant
- Patent Title: Method and system for learning and using latent-space representations of audio signals for audio content-based retrieval
-
Application No.: US16942410Application Date: 2020-07-29
-
Publication No.: US11670322B2Publication Date: 2023-06-06
- Inventor: Alejandro Koretzky , Naveen Sasalu Rajashekharappa
- Applicant: Distributed Creation Inc.
- Applicant Address: US NY New York
- Assignee: Distributed Creation Inc.
- Current Assignee: Distributed Creation Inc.
- Current Assignee Address: US NY New York
- Agency: Nicholson De Vos Webster & Elliott LLP
- Main IPC: G10L25/54
- IPC: G10L25/54 ; G06F16/65 ; G06F3/16 ; G06N3/08 ; G10L21/12 ; G10L21/14 ; G10L25/30 ; G06F18/214

Abstract:
A method and system are provided for extracting features from digital audio signals which exhibit variations in pitch, timbre, decay, reverberation, and other psychoacoustic attributes and learning, from the extracted features, an artificial neural network model for generating contextual latent-space representations of digital audio signals. A method and system are also provided for learning an artificial neural network model for generating consistent latent-space representations of digital audio signals in which the generated latent-space representations are comparable for the purposes of determining psychoacoustic similarity between digital audio signals. A method and system are also provided for extracting features from digital audio signals and learning, from the extracted features, an artificial neural network model for generating latent-space representations of digital audio signals which take care of selecting salient attributes of the signals that represent psychoacoustic differences between the signals.
Public/Granted literature
Information query