DATA SELECTION BASED ON CONSUMPTION AND QUALITY METRICS FOR ATTRIBUTES AND RECORDS OF A DATASET

    公开(公告)号:US20230289839A1

    公开(公告)日:2023-09-14

    申请号:US17693799

    申请日:2022-03-14

    Applicant: ADOBE INC.

    CPC classification number: G06Q30/0204

    Abstract: Embodiments provide systems, methods, and computer storage media for management, assessment, navigation, and/or discovery of data based on data quality, consumption, and/or utility metrics. Data may be assessed using attribute-level and/or record-level metrics that quantify data: “quality”—the condition of data (e.g., presence of incorrect or incomplete values), its “consumption”—the tracked usage of data in downstream applications (e.g., utilization of attributes in dashboard widgets or customer segmentation rules), and/or its “utility”—a quantifiable impact resulting from the consumption of data (e.g., revenue or number of visits resulting from marketing campaigns that use particular datasets, storage costs of data). This data assessment may be performed at different stages of a data intake, preparation, and/or modeling lifecycle. For example, a data selection interface may filter based on consumption and/or quality metrics to facilitate discovery of more effective data for machine learning model training, data visualization, or marketing campaigns.

    PERSONALIZED VISUALIZATION RECOMMENDATION SYSTEM

    公开(公告)号:US20220147540A1

    公开(公告)日:2022-05-12

    申请号:US17091941

    申请日:2020-11-06

    Applicant: ADOBE INC.

    Abstract: Systems and methods for personalized visualization recommendation are described. Embodiments of the described systems and methods are configured to identify a first matrix representing user interactions with a plurality of data attributes corresponding to a plurality of datasets, a second matrix representing user interactions with a plurality of visualizations, and a third matrix representing a plurality of meta-features for each of the data attributes; compute low-dimensional embeddings representing user characteristics, the data attributes, visualization configurations, and the meta-features using joint factorization of the first matrix, the second matrix and the third matrix; generate a model for predicting visualization preference weights based on the low-dimensional embeddings; predict the visualization preference weights for a user corresponding to a plurality of candidate visualizations of dataset using the model; and generate a personalized visualization of the dataset for the user based on the predicted visualization preference weights.

    GENERATING HIERARCHICAL QUERIES FROM NATURAL LANGUAGE

    公开(公告)号:US20250124023A1

    公开(公告)日:2025-04-17

    申请号:US18486603

    申请日:2023-10-13

    Applicant: ADOBE INC.

    Abstract: Systems and methods for generating hierarchical queries from text queries are described. Embodiments are configured to encode a text query to obtain a text embedding. Then, embodiments select a field of a data schema by comparing the text embedding to a field embedding corresponding to the field. Subsequently, embodiments generate a hierarchical query including a value corresponding to the selected field. Some embodiments further include one or more formatting models configured to format values included in the text query.

Patent Agency Ranking