- Patent Title: Identifying spam using near-duplicate detection for text and images
-
Application No.: US16510578Application Date: 2019-07-12
-
Publication No.: US11363064B2Publication Date: 2022-06-14
- Inventor: Spandan Thakur
- Applicant: ADOBE INC.
- Applicant Address: US CA San Jose
- Assignee: ADOBE INC.
- Current Assignee: ADOBE INC.
- Current Assignee Address: US CA San Jose
- Agency: Shook, Hardy & Bacon L.L.P.
- Main IPC: H04L9/30
- IPC: H04L9/30 ; H04L29/06 ; H04L9/40 ; G06F16/14

Abstract:
Embodiments described herein provide systems, methods, and computer storage media for detecting spam using by comparing hash values of content. In embodiments, hash values are generated based on the type of content and compared to other hash values in storage buckets. The similarity of content is determined by calculating the distance between two hash values and determining whether the distance exceeds a distance index. Counter values associated with hash values in storage are incremented when the distances between hash values exceed the distance index. Spam indications are communicated when the counter values for associated with hash values exceed a count threshold.
Public/Granted literature
- US20210014270A1 IDENTIFYING SPAM USING NEAR-DUPLICATE DETECTION FOR TEXT AND IMAGES Public/Granted day:2021-01-14
Information query