Invention Grant
- Patent Title: Data-preserving text redaction for text utterance data
-
Application No.: US16560839Application Date: 2019-09-04
-
Publication No.: US11308945B1Publication Date: 2022-04-19
- Inventor: Thomas Drake , Oluwaseyi Feyisetan , Thomas Diethe
- Applicant: Amazon Technologies, Inc.
- Applicant Address: US WA Seattle
- Assignee: Amazon Technologies, Inc.
- Current Assignee: Amazon Technologies, Inc.
- Current Assignee Address: US WA Seattle
- Agency: Davis Wright Tremaine LLP
- Main IPC: G10L15/197
- IPC: G10L15/197 ; G10L15/30 ; G10L15/22

Abstract:
A hypernym of a word in utterance data may be probabilistically determined. The utterance data may correspond to a spoken query or command. A redacted utterance may be derived by replacing the word with the hypernym. The hypernym may be determined by applying noise to a position in a hierarchical embedding that corresponds to the word. The word may be identified as being potentially sensitive. The hierarchical embedding may be a Hyperbolic embedding that may indicate hierarchical relationships between individual words of a corpus of words, such as “red” is a “color” or “Austin” is in “Texas.” Noise may be applied by obtaining a first value in Euclidean space based on a second value in Hyperbolic space, and obtaining a third value in Hyperbolic space based on the first value in Euclidean space. The second value in Hyperbolic space may correspond to the word.
Information query