Text de-obfuscation with image recognition of text
Abstract:
Techniques are described for a de-obfuscation framework that utilizes image recognition of text. A word input by a user is received by the de-obfuscation service. Visual feature data associated with an image corresponding to each character of the word is generated. Word embeddings are generated using the visual feature data and each character of the word using a character encoder layer. Feature vectors are generated from the word embedding by combining the generated word embeddings and a provided word embedding using a second neural network. The generated feature vector is classified. Potential text obfuscation is detected from the classified generated feature vector using a lexicon to determine de-obfuscated text closet to the user text.
Public/Granted literature
Information query
Patent Agency Ranking
0/0