Method for Learned Image Compression and Related Autoencoder

    公开(公告)号:US20250088648A1

    公开(公告)日:2025-03-13

    申请号:US18758390

    申请日:2024-06-28

    Abstract: A method for learned image compression implemented in an autoencoder including a learnable encoder and a decoder, the method including: a) extracting from an image a latent space by the learnable encoder; b) quantizing the latent space by a quantizer to obtain a quantized latent space; c) entropy coding the quantized latent space by an entropy encoder to obtain a bitstream, wherein an entropy model used to encode the latent space is represented by a probability distribution; d) entropy decoding the bitstream by an entropy decoder to obtain an entropy decoded bitstream; e) feeding the entropy decoded bitstream to the decoder; f) recover a reconstructed image by the decoder; g) training the autoencoder via standard gradient descent of the backpropagated error gradient by finding learnable parameters of the learnable encoder and of the decoder that minimize a rate distortion cost function, wherein the entropy encoder is based on a differentiable formulation of a soft frequency counter.

    Method and apparatus for encoding and decoding digital images or video streams

    公开(公告)号:US11432012B2

    公开(公告)日:2022-08-30

    申请号:US16488851

    申请日:2018-03-02

    Abstract: A method for encoding digital images or video streams, includes a receiving phase, wherein a portion of an image is received; a graph weights prediction phase, wherein the elements of a weights matrix associated to the graph related to the blocks of the image (predicted blocks) are predicted on the basis of reconstructed, de-quantized and inverse-transformed pixel values of at least one previously coded block (predictor block) of the image, the weights matrix being a matrix comprising elements denoting the level of similarity between a pair of pixels composing said image, a graph transform computation phase, wherein the graph Fourier transform of the blocks of the image is performed, obtaining for the blocks a set of coefficients determined on the basis of the predicted weights; a coefficients quantization phase, wherein the coefficients are quantized an output phase wherein a bitstream comprising the transformed and quantized coefficients is transmitted and/or stored.

    Apparatuses and methods for encoding and decoding images

    公开(公告)号:US11012692B2

    公开(公告)日:2021-05-18

    申请号:US15768450

    申请日:2016-10-13

    Abstract: A method and an apparatus are provided for encoding and/or decoding digital images or video streams. The encoding apparatus includes a processor configured for reading at least a portion of an image (f), determining a rotation angle (θ) on the basis of the portion of the image (f), determining rotated transform matrix (V′) which is the result of a rotation of at least one basis vector of a Discrete Cosine Transform matrix (V) by the rotation angle (θ), computing transformed coefficients (f{circumflex over ( )}) on the basis of the pixel values contained in the portion of the image (f) and the rotated transform information (V′), outputting the transformed coefficients (f{circumflex over ( )}) to the destination.

    Methods and apparatuses for encoding and decoding superpixel borders

    公开(公告)号:US10708601B2

    公开(公告)日:2020-07-07

    申请号:US16488879

    申请日:2018-03-02

    Abstract: The present invention relates to a method for encoding the borders of pixel regions of an image, wherein the borders contain a sequence of vertices subdividing the image into regions of pixels (superpixels), by generating a sequence of symbols from an alphabet including the step of: defining for each superpixel a first vertex for coding the borders of the superpixel according to a criterion common to all superpixels; defining for each superpixel the same coding order of the border vertices, either clockwise or counter-clockwise; defining the order for coding the superpixels on the base of a common rule depending on the relative positions of the first vertices; defining a set of vertices as a known border, wherein the following steps are performed for selecting a symbol of the alphabet, for encoding the borders of the superpixels: a) determining the first vertex of the next superpixel border individuated by the common criterion; b) determining the next vertex to be encoded on the basis of the coding direction; c) selecting a first symbol (“0”) for encoding the next vertex if the next vertex of a border pertains to the known border, d) selecting a symbol (“1”; “2”) different from the first symbol (“0”) if the next vertex is not in the known border; e) repeating steps b), c), d) and e) until all vertices of the superpixel border have been encoded; f) adding each vertex of the superpixel border that was not in the known border to the set; g) determining the next superpixel whose border is to be encoded according to the common rule, if any; i) repeating steps a)-g) until the borders of all the superpixels of the image have being added to the known border.

    Methods and Apparatuses for Encoding and Decoding Superpixel Borders

    公开(公告)号:US20200014937A1

    公开(公告)日:2020-01-09

    申请号:US16488879

    申请日:2018-03-02

    Abstract: The present invention relates to a method for encoding the borders of pixel regions of an image, wherein the borders contain a sequence of vertices subdividing the image into regions of pixels (superpixels), by generating a sequence of symbols from an alphabet including the step of: defining for each superpixel a first vertex for coding the borders of the superpixel according to a criterion common to all superpixels; defining for each superpixel the same coding order of the border vertices, either clockwise or counter-clockwise; defining the order for coding the superpixels on the base of a common rule depending on the relative positions of the first vertices; defining a set of vertices as a known border, wherein the following steps are performed for selecting a symbol of the alphabet, for encoding the borders of the superpixels: a) determining the first vertex of the next superpixel border individuated by the common criterion; b) determining the next vertex to be encoded on the basis of the coding direction; c) selecting a first symbol (“0”) for encoding the next vertex if the next vertex of a border pertains to the known border, d) selecting a symbol (“1”; “2”) different from the first symbol (“0”) if the next vertex is not in the known border; e) repeating steps b), c), d) and e) until all vertices of the superpixel border have been encoded; f) adding each vertex of the superpixel border that was not in the known border to the set; g) determining the next superpixel whose border is to be encoded according to the common rule, if any; i) repeating steps a)-g) until the borders of all the superpixels of the image have being added to the known border.

    Method for managing a map of a personal navigation device and device thereof

    公开(公告)号:US10175053B2

    公开(公告)日:2019-01-08

    申请号:US14346245

    申请日:2012-09-21

    Inventor: Saverio Celia

    Abstract: A method for managing a map (140) stored in a memory (130) associated with a personal navigation device (10), including a command and data entry unit (115) operable by a user, wherein the map (140) can be visualized on a display (115) and includes a road network, the method including the step of combining a first set of data about the road network, stored in the map (140), with a second set of data about at least one road not present in the road network or defined as not viable, in particular a new road, wherein the second data can be entered by the user by the command and data entry unit (115).

    Methods and apparatuses for encoding and decoding digital light field images

    公开(公告)号:US10887608B2

    公开(公告)日:2021-01-05

    申请号:US16610751

    申请日:2018-05-03

    Abstract: A method for encoding a raw lenselet image includes a receiving phase, wherein at least a portion of a raw lenselet image is received, the image including a plurality of macro-pixels, each macro-pixel having pixels corresponding to a specific view angle for the same point of a scene, and an output phase, wherein a bitstream having at least a portion of an encoded lenselet image is outputted. The method has an image transform phase, wherein the pixels of said raw lenselet image are spatially displaced in a transformed multi-color image having a larger number of columns and rows with respect to the received raw lenselet image, wherein dummy pixels having undefined value are inserted into the raw lenselet image and wherein the displacement is performed so as to put the estimated center location of each macro-pixel onto integer pixel locations. Moreover, the method includes a sub-view generation phase, wherein a sequence of sub-views is generated, said sub-views having pixels of the same angular coordinates extracted from different macro-pixels of the transformed raw lenselet image. Finally, the method has a graph coding phase, wherein a bitstream is generated by encoding a graph representation of at least one of the sub-views of the sequence according to a predefined graph signal processing technique.

Patent Agency Ranking