Abstract:
Embodiments described herein provide a technique to crowdsource labeling of training data for a machine learning model while maintaining the privacy of the data provided by crowdsourcing participants. Client devices can be used to generate proposed labels for a unit of data to be used in a training dataset. One or more privacy mechanisms are used to protect user data when transmitting the data to a server. The server can aggregate the proposed labels and use the most frequently proposed labels for an element as the label for the element when generating training data for the machine learning model. The machine learning model is then trained using the crowdsourced labels to improve the accuracy of the model.
Abstract:
The disclosed embodiments provide a system that manages access to a user account on electronic devices of a user. An identity service provides a device token for the electronic device and a set of handles associated with the user account to the electronic device (302). The identity service receives, from the electronic device, a handle registration containing selected handles from the set (304). The identity service transmits an identity certificate comprising an association between the selected handles and the electronic device to the electronic device (306). The identity certificate and the association are used to route data associated with the selected handles to and from the electronic device (308). Examples of handles are the user's e-mail addresses or telephone numbers whereby the electronic device is configured to receive communications for the user sent to selected ones of the user's communication addresses comprised in the user account.
Abstract:
One embodiment provides for a mobile electronic device comprising a non-transitory machine-readable medium to store instructions, the instructions to cause the mobile electronic device to receive a set of labeled data from a server; receive a unit of data from the server, the unit of data of a same type of data as the set of labeled data; determine a proposed label for the unit of data via a machine learning model on the mobile electronic device, the machine learning model to determine the proposed label for the unit of data based on the set of labeled data from the server and a set of unlabeled data associated with the mobile electronic device; encode the proposed label via a privacy algorithm to generate a privatized encoding of the proposed label; and transmit the privatized encoding of the proposed label to the server.
Abstract:
Embodiments described herein provide a privacy mechanism to protect user data when transmitting the data to a server that estimates a frequency of such data amongst a set of client devices. In one embodiment, a differential privacy mechanism is implemented using a count-mean-sketch technique that can reduce resource requirements required to enable privacy while providing provable guarantees regarding privacy and utility. For instance, the mechanism can provide the ability to tailor utility (e.g. accuracy of estimations) against the resource requirements (e.g. transmission bandwidth and computation complexity).
Abstract:
Embodiments described herein ensure differential privacy when transmitting data to a server that estimates a frequency of such data amongst a set of client devices. The differential privacy mechanism may provide a predictable degree of variance for frequency estimations of data. The system may use a multibit histogram model or Hadamard multibit model for the differential privacy mechanism, both of which provide a predictable degree of accuracy of frequency estimations while still providing mathematically provable levels of privacy.
Abstract:
Embodiments described herein provide a privacy mechanism to protect user data when transmitting the data to a server that estimates a frequency of such data amongst a set of client devices. In one embodiment, a differential privacy mechanism is implemented using a countmean- sketch technique that can reduce resource requirements required to enable privacy while providing provable guarantees regarding privacy and utility. For instance, the mechanism can provide the ability to tailor utility (e.g. accuracy of estimations) against the resource requirements (e.g. transmission bandwidth and computation complexity).
Abstract:
Systems and methods are disclosed for a server learning new words generated by user client devices in a crowdsourced manner while maintaining local differential privacy of client devices. A client device can determine that a word typed on the client device is a new word that is not contained in a dictionary or asset catalog on the client device. New words can be grouped in classifications such as entertainment, health, finance, etc. A differential privacy system on the client device can comprise a privacy budget for each classification of new words. If there is privacy budget available for the classification, then one or more new terms in a classification can be sent to new term learning server, and the privacy budget for the classification reduced. The privacy budget can be periodically replenished.
Abstract:
The disclosed embodiments provide a system that manages access to a user account on electronic devices of a user. An identity service provides a device token for the electronic device and a set of handles associated with the user account to the electronic device (302). The identity service receives, from the electronic device, a handle registration containing selected handles from the set (304). The identity service transmits an identity certificate comprising an association between the selected handles and the electronic device to the electronic device (306). The identity certificate and the association are used to route data associated with the selected handles to and from the electronic device (308). Examples of handles are the user's e-mail addresses or telephone numbers whereby the electronic device is configured to receive communications for the user sent to selected ones of the user's communication addresses comprised in the user account.
Abstract:
One embodiment provides for a mobile electronic device comprising a non-transitory machine-readable medium to store instructions, the instructions to cause the mobile electronic device to receive a set of labeled data from a server; receive a unit of data from the server, the unit of data of a same type of data as the set of labeled data; determine a proposed label for the unit of data via a machine learning model on the mobile electronic device, the machine learning model to determine the proposed label for the unit of data based on the set of labeled data from the server and a set of unlabeled data associated with the mobile electronic device; encode the proposed label via a privacy algorithm to generate a privatized encoding of the proposed label; and transmit the privatized encoding of the proposed label to the server.