-
公开(公告)号:US20240357104A1
公开(公告)日:2024-10-24
申请号:US18640520
申请日:2024-04-19
Applicant: Nokia Technologies Oy
Inventor: Honglei ZHANG , Francesco CRICRÌ , Alireza AMINLOU , Miska Matias HANNUKSELA , Nam Hai LE , Jukka Ilari AHONEN , Hamed REZAZADEGAN TAVAKOLI
IPC: H04N19/119 , H04N19/167 , H04N19/176
CPC classification number: H04N19/119 , H04N19/167 , H04N19/176
Abstract: Various embodiments describe an apparatus, a method, and a computer program product. An example apparatus includes at least one processor; and at least one memory storing instructions that, when executed by the at least one processor, cause the apparatus at least to perform: encoding an input picture by using a first encoder or first encoding parameters; encoding the input picture by using a second encoder or second encoding parameters; generating a first reconstructed picture based on the encoding of the input picture by using the first encoder or the first encoding parameters; and generating a second reconstructed picture based on the encoding of the input picture by using the second encoder or the second encoding parameters.
-
12.
公开(公告)号:US20240249514A1
公开(公告)日:2024-07-25
申请号:US18560430
申请日:2022-05-13
Applicant: Nokia Technologies Oy
Inventor: Jani LAINEMA , Francesco CRICRÌ , Honglei ZHANG , Hamed REZAZADEGAN TAVAKOLI , Yat Hong LAM , Miska Matias HANNUKSELA , Nannan ZOU
IPC: G06V10/82 , G06V10/771 , H04N19/117 , H04N19/159 , H04N19/172 , H04N19/70 , H04N19/82
CPC classification number: G06V10/82 , G06V10/771 , H04N19/117 , H04N19/159 , H04N19/172 , H04N19/70 , H04N19/82
Abstract: Various embodiments provide an apparatus, a method, and a computer program product. The apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform; train or finetune one or more additional parameters of at least one neural network (NN) or a portion of the at least one NN, wherein the one or more additional parameters comprise one or more scaling parameters; and encode or decode one or more media elements based on the at least one neural network or a portion of the at least one NN comprising the trained or finetuned one or more additional parameters.
-
公开(公告)号:US20230209092A1
公开(公告)日:2023-06-29
申请号:US17996040
申请日:2021-04-13
Applicant: Nokia Technologies Oy
IPC: H04N19/70 , H04N19/124 , H04N19/42
CPC classification number: H04N19/70 , H04N19/42 , H04N19/124
Abstract: In example embodiments, an apparatus, a method, and a computer program product are provided. The apparatus includes at least one processor; and at least one non-transitory memory including computer program code; wherein the at least one memory and the computer program code are configured to, with the at least one processor, cause the apparatus at least to perform: encode or decode a high-level bitstream syntax for at least one neural network; wherein the high-level bitstream syntax comprises at least one information unit, wherein the at least one information unit comprises syntax definitions for the at least one neural network or a portion of the at least one neural network; and wherein a serialized bitstream comprises one or more of the at least one information units.
-
公开(公告)号:US20230164336A1
公开(公告)日:2023-05-25
申请号:US17917153
申请日:2021-03-30
Applicant: Nokia Technologies OY
Inventor: Francesco CRICRI , Nam LE , Hamed REZAZADEGAN TAVAKOLI , Honglei ZHANG , Miska Matias HANNUKSELA , Emre Baris AKSU
IPC: H04N19/42 , H04N19/192 , G06N3/0455 , G06N3/084
CPC classification number: H04N19/42 , H04N19/192 , G06N3/0455 , G06N3/084
Abstract: Example embodiments provide a system for training a data coding pipeline including a feature extractor neural network, an encoder neural network, and a decoder neural network configured to reconstruct input data based on encoded features. A plurality of losses corresponding to different tasks may be determined for the coding pipeline. Tasks may be performed based on an output of the coding pipeline. A weight update may be determined for at least a subset of the coding pipeline based on the plurality of losses. The weight update may be configured to reduce a number of iterations for fine-tuning the coding pipeline for one of the tasks. This enables faster adaptation of the coding pipeline for one of the tasks after deployment of the coding pipeline. Apparatuses, methods, and computer programs are disclosed. Apparatuses, methods, and computer programs are disclosed.
-
公开(公告)号:US20230072093A1
公开(公告)日:2023-03-09
申请号:US17795631
申请日:2021-01-21
Applicant: Nokia Technologies Oy
Inventor: Ari HOURUNRANTA , Miska Matias HANNUKSELA , Emre Baris AKSU , Saba AHSAN
IPC: H04N21/218 , H04N21/472 , H04N21/81
Abstract: The embodiments relate to a method including determining a foreground area covering a viewport of 360-degree video and one or more other areas of 360-degree video, not containing the foreground area in its entirety; concluding a first set of tile streams among available tile streams of the 360-degree video to cover the foreground area; concluding a second set of tile streams among the available tile streams of the 360-degree video, to cover the one or more other areas; and requesting transmission of a first set of portions of the first set of tile streams and a second set of portions of the second set of tile streams, wherein the portions in the first set of portions have a shorter duration that portions in the second set of portions.
-
公开(公告)号:US20220256227A1
公开(公告)日:2022-08-11
申请号:US17649915
申请日:2022-02-03
Applicant: Nokia Technologies Oy
IPC: H04N21/435 , H04N21/44
Abstract: An example method is provided to include receiving a media bitstream comprising one or more media units and a first enhancement information message, wherein the first enhancement information message comprises at least two independently parsable structures, a first independently parsable structure comprising information about at least one purpose of one or more neural networks (NNs) to be applied to the one or more media units, and a second independently parsable structure comprising or identifying one or more neural networks; decoding the one or more media units; and using the one or more neural networks to enhance or filter one or more frames of the decoded the one or more media units, based on the at least one purpose. An example method includes. Corresponding apparatuses and computer program products are also provided.
-
公开(公告)号:US20220247990A1
公开(公告)日:2022-08-04
申请号:US17726194
申请日:2022-04-21
Applicant: Nokia Technologies Oy
Inventor: Kashyap Kammachi Sreedhar , Igor Danilo Diego CURCIO , Miska Matias HANNUKSELA , Sujeet Shyamsundar MATE , Emre Baris AKSU
IPC: H04N13/172 , H04N13/161
Abstract: A method includes generating a bitstream defining a presentation, the presentation comprising an omnidirectional visual media content and a first visual media component and a second visual media component; indicating in the bitstream a first presentation timeline and a second presentation timeline; and indicating in the bitstream a switching mode with respect to the first presentation timeline associated with the first visual media component, or with respect to the second presentation timeline associated with the second visual media component, the switching mode being indicated dependent on a viewpoint of a user; wherein the switching mode provides an indication of switching to the first visual media component or to the second visual media component, the first visual media component corresponding to content captured from a first omnidirectional camera in a first location, and the second visual media component corresponding to content captured from a second omnidirectional camera in a second location.
-
公开(公告)号:US20220086480A1
公开(公告)日:2022-03-17
申请号:US17532424
申请日:2021-11-22
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Miska Matias HANNUKSELA
Abstract: There are disclosed various methods, apparatuses and computer program products for video encoding and decoding. In some embodiments a method comprises at least one of the following: encoding into a bitstream an indication that motion fields are stored, but only for inter-layer motion prediction; encoding into a bitstream an indication on a limited scope of motion field usage; encoding into a bitstream an indication whether or not to use the motion field for prediction; encoding into a bitstream an indication of storage parameters for storing motion information.
-
公开(公告)号:US20190208222A1
公开(公告)日:2019-07-04
申请号:US16298600
申请日:2019-03-11
Applicant: NOKIA TECHNOLOGIES OY
Inventor: Kemal UGUR , Mehmet Oguz BICI , Miska Matias HANNUKSELA
IPC: H04N19/52 , H04N19/159 , H04N19/46 , H04N19/593 , H04N19/172 , H04N19/105 , H04N19/107 , H04N19/30
Abstract: There are disclosed various methods, apparatuses and computer program products for video encoding and decoding. In other embodiments, there is provided a method, an apparatus, a computer readable storage medium stored with code thereon for use by an apparatus, and a video encoder, for encoding a scalable bitstream, to provide indicating an encoding configuration, where only samples and syntax from intra coded pictures of base layer is used for coding the enhancement layer pictures. In other embodiments, there is provided an apparatus, a computer readable storage medium stored with code thereon for use by an apparatus, and a video decoder, for decoding a scalable bitstream, to receive indications of an encoding configuration, where only samples and syntax from intra coded pictures of base layer is used for coding the enhancement
-
公开(公告)号:US20180199051A1
公开(公告)日:2018-07-12
申请号:US15915446
申请日:2018-03-08
Applicant: NOKIA TECHNOLOGIES OY
IPC: H04N19/46 , H04N19/58 , H04N19/463 , H04N19/52
Abstract: A reference picture marking process and a reference picture list management process is handled in a unified reference picture marking and reference picture list management process. A new idle reference picture list may be used for handling reference pictures that are not used for reference in the current picture. Differential coding of picture order count may be used to increase coding efficiency. The reference picture management syntax structure may be sent in the picture parameter set for improved coding efficiency e.g. in regular GOP (group of pictures) arrangements.
-
-
-
-
-
-
-
-
-