SUBPICTURE ENTITY GROUP SIGNALING IN CODED VIDEO

    公开(公告)号:EP3972274A1

    公开(公告)日:2022-03-23

    申请号:EP21197202.1

    申请日:2021-09-16

    Applicant: Lemon Inc.

    Inventor: Wang, Ye-Kui

    Abstract: Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, wherein the format rule specifies a characteristic of a syntax element in the visual media file, and wherein the format rule specifies that the syntax element that has a value indicative of a level identification is coded in any one or both of a subpicture common group box or a subpicture multiple groups box using eight bits.

    CHROMA FORMAT AND BIT DEPTH INDICATION IN CODED VIDEO

    公开(公告)号:EP3972272A1

    公开(公告)日:2022-03-23

    申请号:EP21197131.2

    申请日:2021-09-16

    Applicant: Lemon Inc.

    Inventor: Wang, Ye-Kui

    Abstract: Systems, methods and apparatus for processing visual media data are described. One example method includes performing a conversion between visual media data and a visual media file including one or more tracks storing one or more bitstreams of the visual media data according to a format rule; wherein the format rule specifies whether a first element indicative of whether a track contains a bitstream corresponding to a specific output layer set controls whether a second element indicative of a chroma format of the track and/or a third element indictive of a bit depth information of the track is included in a configuration record of the track.

    OPERATING POINT ENTITY GROUP SIGNALING IN CODED VIDEO

    公开(公告)号:EP3972267A1

    公开(公告)日:2022-03-23

    申请号:EP21197219.5

    申请日:2021-09-16

    Applicant: Lemon Inc.

    Inventor: WANG, Ye-Kui

    Abstract: Systems, methods and apparatus for generating or parsing a visual media file according to a file format include performing a conversion between a visual media data and a visual media file that stores a bitstream of the visual media data according to a format rule. The visual media file stores multiple tracks that belong to an entity group of a specific type. The format rule specifies that, responsive to the multiple tracks having a track reference to a particular type to a group identifier, the multiple tracks (A) omit carrying a sample group of a specific type or (B) carry the sample group of the specific type such that information in the sample group of the specific type is consistent with that in the entity group of the specific type.

    DEPENDENCY INFORMATION SIGNALING IN CODED VIDEO

    公开(公告)号:EP3972266A1

    公开(公告)日:2022-03-23

    申请号:EP21197196.5

    申请日:2021-09-16

    Applicant: Lemon Inc.

    Inventor: WANG, Ye-Kui

    Abstract: Systems, methods and apparatus for storing or parsing a visual media file according a file format include performing a conversion between a visual media data and a visual media file that stores a bitstream of the visual media data according to a format rule. The visual media file stores one or more tracks comprising one or more video layers. The format rule specifies that whether a first set of syntax elements indicative of layer dependency information is stored in the visual media file is dependent on whether a second syntax element indicating that all layers in the visual media file are independent has a value 1.

    DECODER CONFIGURATION RECORD IN CODED VIDEO
    175.
    发明公开

    公开(公告)号:EP3972265A1

    公开(公告)日:2022-03-23

    申请号:EP21197127.0

    申请日:2021-09-16

    Applicant: Lemon Inc.

    Inventor: Wang, Ye-Kui

    Abstract: Systems, methods and apparatus for encoding or decoding a file format that stores one or more images are described. One example method includes performing a conversion between a visual media file and a bitstream of a visual media data according to a format rule, wherein the format rule specifies a characteristic of a syntax element in the visual media file, wherein the syntax element has a value that is indicative of a number of bytes used for indicating a constraint information associated with the bitstream.

    TRANSITION PERIOD FOR IMAGE TRANSITIONS IN A MEDIA FILE

    公开(公告)号:EP3965424A1

    公开(公告)日:2022-03-09

    申请号:EP21194361.8

    申请日:2021-09-01

    Applicant: Lemon Inc.

    Abstract: Systems, methods and apparatus for processing image data are described. One example method includes performing a conversion between a visual media file and a bitstream. The visual media file comprises image items each comprising a sequence of pictures according to a media file format, and the bitstream comprises access units each comprising one or more pictures each belonging to a layer according to a video coding format. The media file format specifies a suggested transition period of an image item to be applied for a transition between the image item and a next image item during displaying of the pictures as a slideshow.

    ASSOCIATION OF OPERATION POINT INFO PROPERTIES TO VVC IMAGE ITEMS

    公开(公告)号:EP3965421A1

    公开(公告)日:2022-03-09

    申请号:EP21194357.6

    申请日:2021-09-01

    Applicant: Lemon Inc.

    Abstract: Systems, methods and apparatus for processing image data are described. One example method includes performing a conversion between a visual media file and a bitstream. The visual media file comprises image items each comprising a sequence of one or more pictures according to a media file format, and the bitstream includes access units each consisting of one or more pictures each belonging to a layer according to a video coding format. The media file format specifies that image items comprising pictures originated from the bitstream are allowed to be associated with different instances of a property descriptor that indicates high-level characteristics of the bitstream.

    TEXT TRANSLATION METHOD AND APPARATUS, ELECTRONIC DEVICE AND MEDIUM

    公开(公告)号:EP4546211A1

    公开(公告)日:2025-04-30

    申请号:EP23899842.1

    申请日:2023-11-28

    Abstract: Embodiments of the present disclosure relate to a method, an apparatus, an electronic device, and a medium for text translation. The method includes determining a keyword set associated with a chapter-level monolingual corpus in a target language, where the keyword set includes a plurality of entity words and a plurality of pronouns, and masking the chapter-level monolingual corpus based on the keyword set. The method further includes generating a chapter-level text translation model based on the masked chapter-level monolingual corpus. According to the embodiments of the present disclosure, it is possible to enable translations of the same or associated words to have contextual consistency throughout a text, and to explicit a noun indicated by a pronoun, and further to supplement a missing pronoun, thereby improving accuracy of the text translation model.

    TRANSLATION METHOD AND APPARATUS, READABLE MEDIUM, AND ELECTRONIC DEVICE

    公开(公告)号:EP4546210A1

    公开(公告)日:2025-04-30

    申请号:EP23888031.4

    申请日:2023-11-08

    Abstract: Embodiments of the present disclosure relate to a translation method and apparatus, a readable medium, and an electronic device. The method includes: determining a source text to be translated and a source associated image corresponding to the source text; and inputting the source text and the source associated image into a pre-generated target multimodal translation model to obtain a target translation text output by the target multimodal translation model. The target multimodal translation model is a model generated by training an undetermined multimodal translation model according to sample data, and the sample data includes at least two of multimodal multilingual data, unimodal multilingual data, and multimodal monolingual data. The multimodal multilingual data includes a first source-language text, a first target-language text, and a first image corresponding to the first source-language text, the unimodal multilingual data includes a second source-language text and a second target-language text, and the multimodal monolingual data includes a third target-language text and a second image.

Patent Agency Ranking