SIMILARITY ANALYSES IN ANALYTICS WORKFLOWS

    公开(公告)号:US20220075794A1

    公开(公告)日:2022-03-10

    申请号:US17530866

    申请日:2021-11-19

    Abstract: Examples include bypassing a portion of an analytics workflow. In some examples, execution of an analytics workflow may be monitored upon receipt of a raw data and the execution may be interrupted at an optimal bypass stage to obtain insights data from the raw data. A similarity analysis may be performed to compare the insights data to a stored insights data in an insights data repository. Based, at least in part, on a determination of similarity, a bypass operation may be performed to bypass a remainder of the analytics workflow.

    TRANSFORMATION OF VIDEO STREAMS
    34.
    发明申请

    公开(公告)号:US20200005048A1

    公开(公告)日:2020-01-02

    申请号:US16023848

    申请日:2018-06-29

    Abstract: Example aspects for transformation of video streams include searching for a first signature of a segment of a video stream in an index comprising a first level signature of each of a plurality of stored segments. In response to identifying a first set of similar segments from the stored segments, a second signature of the segment may be determined. In response to identifying a second set of similar segments from the first set of similar segments based on the second signature, a matching segment may be ascertained from the second set of similar segments. The matching segment may be provided for being stored in place of the segment in a storage medium.

    RELEVANCE OPTIMIZED REPRESENTATIVE CONTENT ASSOCIATED WITH A DATA STORAGE SYSTEM

    公开(公告)号:US20180276290A1

    公开(公告)日:2018-09-27

    申请号:US15761991

    申请日:2016-03-10

    CPC classification number: G06F16/285 G06F16/345

    Abstract: Relevance optimized representative content associated with a data storage system is disclosed. One example is a system including a data summarization module, a clustering module, and a representative content selection module. The data summarization module associates, via a processor, each data object in a storage system with a derived data object. The clustering module determines clusters of similar data objects based on a similarity between associated derived data objects, and selects a representative data object for each determined cluster. The representative content selection module selects representative content associated with the storage system, where the representative content is based on the data objects, the derived data objects, and the representative data objects, and relevance optimizes of the selected representative content to an analytics application.

    Re-execution of an analytical process based on lineage metadata

    公开(公告)号:US20180096079A1

    公开(公告)日:2018-04-05

    申请号:US15281225

    申请日:2016-09-30

    CPC classification number: G06F16/907 G06F16/14

    Abstract: Examples disclosed herein relate to re-execution of an analytical process based on lineage metadata. In an example, a determination may be made on a hub device that an analytical process previously executed on a remote edge device is to be re-executed on the hub device, wherein the analytical process is part of an analytical workflow that is implemented at least in part on the hub device and the remote edge device. In response to the determination, a storage location of input data for re-executing the analytical process may be identified based on lineage metadata stored on the hub device, and input data may be acquired from the storage location.

Patent Agency Ranking