-
公开(公告)号:US20250077959A1
公开(公告)日:2025-03-06
申请号:US18461703
申请日:2023-09-06
Applicant: AMPERITY, INC.
Inventor: Yan YAN , Pranav Behari LAL , Nicholas RESNICK , Joyce GORDON
IPC: G06N20/00
Abstract: In some implementations, the techniques described herein relate to a method including: loading a current and a new model, the new model including the most recent version of the current model; computing a migration duration based on computed properties, namely the jitter in predictions between the current and the new models based on imputing the same inference data to both models; blending outputs of the current model with outputs of the new model according to weights computed for a current time step in the migration process; and serving new predictions using the new model when the migration duration expires.
-
公开(公告)号:US12242514B2
公开(公告)日:2025-03-04
申请号:US17316293
申请日:2021-05-10
Applicant: AMPERITY, INC.
Inventor: Yan Yan , Stephen Keith Meyles , Graeme Andrew Kyle Roche , Jeffrey Allen Stokes , Carlos Minoru Sakoda , Dan Suciu
Abstract: The present disclosure relates clustering similar data records together in a hierarchical clustering scheme. Each tier in a cluster corresponds to a minimal match score, which reflects a degree of confidence. In this respect, a higher confidence may lead to smaller sized clusters while a lower confidence may lead to larger sized clusters. Ordinal classification may be used to generate hierarchical clusters. In some embodiments, hierarchical clustering with conflict resolution is used to resolve user-defined hard conflicts in each tier of the clustering results.
-
公开(公告)号:US12198072B2
公开(公告)日:2025-01-14
申请号:US18390803
申请日:2023-12-20
Applicant: AMPERITY, INC.
Inventor: Yan Yan , Aria Haghighi , Nicholas Resnick , Andrew Lim
IPC: G06N5/04 , G06F16/23 , G06F16/24 , G06N20/00 , G06Q30/0201 , G06Q30/01 , G06Q30/0202
Abstract: Disclosed are techniques for generating features to train a predictive model to predict a customer lifetime value or churn rate. In one embodiment, a method is disclosed comprising receiving a record that includes a plurality of fields and selecting a value associated with a selected field in the plurality of fields. The method then queries a lookup table comprising a mapping of values to aggregated statistics using the value and receives an aggregated statistic based on the querying. Next, the method generates a feature vector by annotating the record with the aggregated statistic. The method uses this feature vector as an input to a predictive model.
-
公开(公告)号:US11080043B1
公开(公告)日:2021-08-03
申请号:US16896844
申请日:2020-06-09
Applicant: Amperity, Inc.
Inventor: Gregory Kyle Look
Abstract: The present disclosure relates to methods and systems for applying version control of configurations to a software application, such as, a cloud-based application. Each version may be stored as a plurality of configuration nodes within a configuration tree structure. Version changes may lead to the creation or modification of configuration nodes. Configurations may be tested in a sandbox and undergo validation checks before being applied to the software application.
-
公开(公告)号:US20250148322A1
公开(公告)日:2025-05-08
申请号:US19017142
申请日:2025-01-10
Applicant: AMPERITY, INC.
Inventor: Yan YAN , Aria HAGHIGHI , Nicholas RESNICK , Andrew LIM
IPC: G06N5/04 , G06F16/23 , G06F16/24 , G06N20/00 , G06Q30/01 , G06Q30/0201 , G06Q30/0202
Abstract: Disclosed are techniques for generating features to train a predictive model to predict a customer lifetime value or churn rate. In one embodiment, a method is disclosed comprising receiving a record that includes a plurality of fields and selecting a value associated with a selected field in the plurality of fields. The method then queries a lookup table comprising a mapping of values to aggregated statistics using the value and receives an aggregated statistic based on the querying. Next, the method generates a feature vector by annotating the record with the aggregated statistic. The method uses this feature vector as an input to a predictive model.
-
公开(公告)号:US20250086180A1
公开(公告)日:2025-03-13
申请号:US18462762
申请日:2023-09-07
Applicant: AMPERITY, INC.
Inventor: Kailashnath Reddy KAVALAKUNTLA
IPC: G06F16/2453
Abstract: The present disclosure describes a system and method for optimizing SQL queries, specifically addressing challenges in handling and optimization of nested Common Table Expressions (CTEs). The system comprises a SQL optimization engine configured to receive SQL scripts from a SQL editor application and output optimized SQL to a query engine for execution on a database. The optimization engine utilizes three primary stages: a CTE normalization stage, a materialization stage, and a caching stage. The CTE normalization stage unnests nested CTEs into single-level CTEs. The materialization stage implements a materialized Create Table As Select (CTAS) strategy for materializing the base query. The caching stage enables reusability of the materialized base query across multiple queries, increasing efficiency and performance. This system provides technical solutions to enhance the capabilities of SQL engines that lack native support for nested CTEs, offering improved query performance and management of large datasets.
-
公开(公告)号:US12013855B2
公开(公告)日:2024-06-18
申请号:US18313753
申请日:2023-05-08
Applicant: AMPERITY, INC.
Inventor: Yan Yan , Aria Haghighi , Joseph Christianson
IPC: G06F16/2453 , G06F16/2457 , G06F16/28
CPC classification number: G06F16/24542 , G06F16/24578 , G06F16/285
Abstract: Disclosed are techniques for trimming large clusters of related records. In one embodiment, a method is disclosed comprising receiving a set of clusters, each cluster in the clusters including a plurality of records. The method extracts an oversized cluster in the set of clusters and performs a breadth-first search (BFS) on the oversized cluster to generate a list of visited records. The method terminates the BFS upon determining that the size of the list of visited records exceeds a maximum size and generates a new cluster from the list of visited records and adding the new cluster to the set of clusters. By recursively performing BFS traverse over the oversized cluster and extracting smaller new clusters from it, the oversized cluster is eventually partitioned into a set of sub-clusters with the size smaller than the predefined threshold.
-
公开(公告)号:US11797487B2
公开(公告)日:2023-10-24
申请号:US17715204
申请日:2022-04-07
Applicant: AMPERITY, INC.
Inventor: Stephen Meyles , Yan Yan , Dan Suciu , Michael P. Fikes
IPC: G06F7/02 , G06F16/00 , G06F16/174 , G06F16/28 , G06F16/22 , G06F40/197 , G06F17/16
CPC classification number: G06F16/1748 , G06F16/2272 , G06F16/285 , G06F40/197 , G06F16/288 , G06F17/16
Abstract: The present disclosure relates to optimizing one or more database tables that may include one or more redundant records. Records are clustered and assigned stable identifiers. In this manner, the underlying records within a cluster are not removed or deleted. As updates to the database are made, new clustering analyses are performed using the underlying records and any updates made. Newly identified clusters are reassigned stable identifiers.
-
公开(公告)号:US20230131884A1
公开(公告)日:2023-04-27
申请号:US17511946
申请日:2021-10-27
Applicant: AMPERITY, INC.
Inventor: Andrew LIM , Joseph CHRISTIANSON , Joyce GORDON , Nicholas RESNICK , Yan YAN
Abstract: The example embodiments are directed toward improvements in generating affinity groups. In an embodiment, a method is disclosed comprising generating probabilities of object interactions for a plurality of users, a given object recommendation ranking for a respective user comprising a ranked list of object attributes; calculating interaction probabilities for each user over a forecasting window; calculating affinity group rankings based on the probabilities of object interactions and the interaction probabilities for each user; and grouping the plurality of users based on the affinity group rankings.
-
公开(公告)号:US11003643B2
公开(公告)日:2021-05-11
申请号:US16399162
申请日:2019-04-30
Applicant: Amperity, Inc.
Inventor: Yan Yan , Stephen Keith Meyles , Graeme Andrew Kyle Roche , Jeffrey Allen Stokes , Carlos Minoru Sakoda , Dan Suciu
Abstract: The present disclosure relates clustering similar data records together in a hierarchical clustering scheme. Each tier in a cluster corresponds to a minimal match score, which reflects a degree of confidence. In this respect, a higher confidence may lead to smaller sized clusters while a lower confidence may lead to larger sized clusters. Ordinal classification may be used to generate hierarchical clusters. In some embodiments, hierarchical clustering with conflict resolution is used to resolve user-defined hard conflicts in each tier of the clustering results.
-
-
-
-
-
-
-
-
-