Abstract:
According to an aspect of the present principles, a method is provided for generating resource description framework benchmarks. The method includes deriving (350) a resultant benchmark dataset with a user specified size and a user specified coherence from and with respect to an input dataset of a given size and a given coherence by determining (340) which triples of subject-property-object to add to the input dataset or remove from the input dataset to derive the resultant benchmark dataset.
Abstract:
Methods and systems for aggregating search query results include receiving (202) search query results and schema information for the query results from multiple heterogeneous sources (102), determining types (116) for elements of the query results based on the schema information, determining potential aggregations (204) for the query results based on the types, which are based on accumulated information from the plurality of heterogeneous resources (102), and aggregating (220) the query results according to one or more of the potential aggregations.
Abstract:
Methods and systems for determining schema element types are shown that include pooling (208) potential annotations for an element of an unlabeled schema from a plurality of heterogeneous sources, scoring (404) the pool of potential annotations according to relevancy using instance information from the plurality of heterogeneous sources to produce a relevancy score, and annotating (406) the element of the unlabeled schema using the most relevant potential annotations.
Abstract:
The present disclosure provides systems and methods that generate query templates that are expressed in a generic schema-agnostic language. The query templates can be generated "from scratch" or can be automatically generated from existing queries, a process which may be referred to as "templatizing" the existing queries. As one example, generation of query templates can be performed through an iterative process that iteratively generates candidate templates over time to optimize a coverage over a set of existing queries. After generation of the schema-agnostic query templates, the systems and methods described herein can automatically translate/map the templatized queries into "concrete," schema-specific queries that can be evaluated over specific customer schemas/datasets. In this manner, a query template for a given semantic query (e.g., "return the names of all employees"), is required to be written only once.
Abstract:
Methods and systems for determining schema element types are shown that include pooling (208) potential annotations for an element of an unlabeled schema from a plurality of heterogeneous sources, scoring (404) the pool of potential annotations according to relevancy using instance information from the plurality of heterogeneous sources to produce a relevancy score, and annotating (406) the element of the unlabeled schema using the most relevant potential annotations.
Abstract:
Methods and systems for aggregating search query results include receiving (202) search query results and schema information for the query results from multiple heterogeneous sources (102), determining types (116) for elements of the query results based on the schema information, determining potential aggregations (204) for the query results based on the types, which are based on accumulated information from the plurality of heterogeneous resources (102), and aggregating (220) the query results according to one or more of the potential aggregations.