Systems and methods for data usage monitoring in multi-tenancy enabled HADOOP clusters
Abstract:
Systems and methods for data usage monitoring in multi-tenancy enabled HADOOP clusters are disclosed. According to one embodiment, a method for monitoring data usage in multi-tenancy enabled HADOOP clusters may include: (1) receiving metadata related to a dataset in one or more multi-tenant clusters; (2) receiving entitlement data for a plurality of users to the dataset; (3) receiving group membership data for the plurality of users; (4) receiving access permissions for the plurality of users to the dataset; (5) receiving audit logs comprising access history for the plurality of users to the dataset; (6) joining the metadata, entitlement data, group membership data, access permissions, and audit logs into a searchable database; (7) receiving a query comprising at least one of a date range, a file, a directory, a user, and a group of users; (8) applying the query to the searchable database; and (9) returning results to the query.
Information query
Patent Agency Ranking
0/0