Patent search ap:("Dolby Laboratories Licensing Corporation") AND inv:"Roy M. Fejgin" Page 1

1.

发明授权
Perceptually-based loss functions for audio encoding and decoding based on machine learning 有权

公开(公告)号：US11817111B2

公开(公告)日：2023-11-14

申请号：US17046284

申请日：2019-04-10

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Roy M. Fejgin , Grant A. Davidson , Chih-Wei Wu , Vivek Kumar

IPC: G10L19/022 , G06F3/16 , G06N3/084 , G06N3/048

CPC classification number: G10L19/022 , G06F3/16 , G06N3/048 , G06N3/084

Abstract: Computer-implemented methods for training a neural network, as well as for implementing audio encoders and decoders via trained neural networks, are provided. The neural network may receive an input audio signal, generate an encoded audio signal and decode the encoded audio signal. A loss function generating module may receive the decoded audio signal and a ground truth audio signal, and may generate a loss function value corresponding to the decoded audio signal. Generating the loss function value may involve applying a psychoacoustic model. The neural network may be trained based on the loss function value. The training may involve updating at least one weight of the neural network.

2.

发明授权
Audio segmentation based on spatial metadata 有权

公开(公告)号：US10068577B2

公开(公告)日：2018-09-04

申请号：US15306051

申请日：2015-04-23

Applicant: Dolby Laboratories Licensing Corporation

Inventor： Vinay Melkote , Malcolm James Law , Roy M. Fejgin

IPC: G10L19/00 , G10L19/008 , G10L19/20 , G10L19/16

Abstract: A method of encoding adaptive audio, comprising receiving N objects and associated spatial metadata that describes the continuing motion of these objects, and partitioning the audio into segments based on the spatial metadata. The method encodes adaptive audio having objects and channel beds by capturing a continuing motion of a number N objects in a time-varying matrix trajectory comprising a sequence of matrices, coding coefficients of the time-varying matrix trajectory in spatial metadata to be transmitted via a high-definition audio format for rendering the adaptive audio through a number M output channels, and segmenting the sequence of matrices into a plurality of sub-segments based on the spatial metadata, wherein the plurality of sub-segments are configured to facilitate coding of one or more characteristics of the adaptive audio.

3.

发明申请
METHODS AND SYSTEM FOR WAVEFORM CODING OF AUDIO SIGNALS WITH A GENERATIVE MODEL 有权

公开(公告)号：US20220392458A1

公开(公告)日：2022-12-08

申请号：US17770035

申请日：2020-10-16

Applicant: Dolby Laboratories Licensing Corporation , DOLBY INTERNATIONAL AB

Inventor： Janusz Klejsa , Arijit Biswas , Lars Villemoes , Roy M. Fejgin , Cong Zhou

IPC: G10L19/00

Abstract: Described herein is a method of waveform decoding, the method including the steps of: (a) receiving, by a waveform decoder, a bitstream including a finite bitrate representation of a source signal; (b) waveform decoding the finite bitrate representation of the source signal to obtain a waveform approximation of the source signal; (c) providing the waveform approximation of the source signal to a generative model that implements a probability density function, to obtain a probability distribution for a reconstructed signal of the source signal; and (d) generating the reconstructed signal of the source signal based on the probability distribution. Described are further a method and system for waveform coding and a method of training a generative model.

4.

发明授权
Audio discontinuity detection and correction 有权

公开(公告)号：US11183202B2

公开(公告)日：2021-11-23

申请号：US15745824

申请日：2016-07-26

Applicant: DOLBY LABORATORIES LICENSING CORPORATION

Inventor： Roy M. Fejgin , Freddie Sanchez , Vinay Melkote , Michael Ward

IPC: G10L25/48 , G11B27/034 , G11B27/038 , G11B27/10 , G10L25/18

Abstract: Methods for detecting whether a rendered version of a specified seamless connection (“SSC”) at a connection point between two audio segment sequences results in an audible discontinuity, and methods for analyzing at least one SSC between audio segment sequences to determine whether a renderable version of each SSC would have an audible discontinuity at the connection point when rendered, and in appropriate cases, for a SSC having a renderable version which is determined to have an audible discontinuity when rendered, correcting at least one audio segment of at least one segment sequence to be connected in accordance with the SSC in an effort to ensure that rendering of the SSC will result in seamless connection without an audible discontinuity. Other aspects are editing systems configured to implement any of the methods, and storage media and rendering systems which store audio data generated in accordance with any of the methods.

5.

发明授权
Method and system for inter-channel coding 有权

公开(公告)号：US10553224B2

公开(公告)日：2020-02-04

申请号：US16150112

申请日：2018-10-02

Applicant: Dolby Laboratories Licensing Corporation , DOLBY INTERNATIONAL AB

Inventor： Janusz Klejsa , Roy M. Fejgin , Mark S. Vinton

IPC: G10L19/008 , G10L19/00

Abstract: A method for performing inter-channel encoding of a multi-channel audio signal comprising channel signals for N channels, with N being an integer, with N>1, is described. The method comprises determining a basic graph comprising the N channels as nodes and comprising directed edges between at least some of the N channels. Furthermore, the method comprises determining an inter-channel coding graph from the basic graph, such that the inter-channel coding graph is a directed acyclic graph, and such that a cumulated a cumulated cost of the signals of the nodes of the inter-channel coding graph is reduced.

Patent Agency Ranking