Optimization of lip syncing in natural language translated video

    公开(公告)号:GB2625696A

    公开(公告)日:2024-06-26

    申请号:GB202405450

    申请日:2022-09-22

    Applicant: IBM

    Abstract: An approach for generating an optimized video of a speaker, translated from a source language into a target language with the speaker's lips synchronized to the translated speech, while balancing optimization of the translation into a target language. A source video may be fed into a neural machine translation model. The model may synthesize a plurality of potential translations.The translations may be received by a generative adversarial network which generates video for each translation and classifies the translations as in-sync or out of sync. A lip-syncing score may be for each of the generated videos that are classified as in-sync.

    Image encryption
    2.
    发明专利

    公开(公告)号:GB2630253B

    公开(公告)日:2025-04-30

    申请号:GB202413401

    申请日:2023-03-21

    Applicant: IBM

    Abstract: Image data encryption by receiving first image data corresponding to a first image having a first image size, compressing the first image data, yielding second image data corresponding to a second image having a second image size, augmenting the second image data yielding third image data corresponding to a third image having the first image size, determining coordinates of a location of the second image within the third image, encrypting the third image data according to the coordinates, providing the encrypted third image data to a decoder by a first communications channel, and providing the coordinates of the second image within the third image to the decoder by a second communications channel.

    Generative adversarial network implemented digital script modification

    公开(公告)号:GB2624614B

    公开(公告)日:2024-12-18

    申请号:GB202403627

    申请日:2022-08-25

    Applicant: IBM

    Abstract: A system, method, and computer program product for implementing digital script modification is provided. The method includes generating image sequences associated with textual content of a digital story. Multiple contextual dimensions are identified within the textual content and a group of dimensions are selected. The image sequences in combination with the group of dimensions are expanding or contracted and image sequences are altered based on detected interactions with the group of dimensions. Dimensions are extracted from the group of dimensions during presentation of the digital story and a scriptwriter is enabled to modify the dimensions. The image sequences are modified and a hardware interface device is enabled to interact with various image sequences and alter the multiple contextual dimensions. The textual content of the digital story is dynamically altered.

    Generative adversarial network implemented digital script modification

    公开(公告)号:GB2624614A

    公开(公告)日:2024-05-22

    申请号:GB202403627

    申请日:2022-08-25

    Applicant: IBM

    Abstract: A system, method, and computer program product for implementing digital script modification is provided. The method includes generating image sequences associated with textual content of a digital story. Multiple contextual dimensions are identified within the textual content and a group of dimensions are selected. The image sequences in combination with the group of dimensions are expanding or contracted and image sequences are altered based on detected interactions with the group of dimensions. Dimensions are extracted from the group of dimensions during presentation of the digital story and a scriptwriter is enabled to modify the dimensions. The image sequences are modified and a hardware interface device is enabled to interact with various image sequences and alter the multiple contextual dimensions. The textual content of the digital story is dynamically altered.

Patent Agency Ranking