Invention Grant
- Patent Title: Filter for blocking image-based spam
- Patent Title (中): 过滤阻止基于图像的垃圾邮件
-
Application No.: US12039310Application Date: 2008-02-28
-
Publication No.: US08055078B2Publication Date: 2011-11-08
- Inventor: Jaesik Choi , Ke Wei , Vishwanth Tumkur Ramarao
- Applicant: Jaesik Choi , Ke Wei , Vishwanth Tumkur Ramarao
- Applicant Address: US CA Sunnyvale
- Assignee: Yahoo! Inc.
- Current Assignee: Yahoo! Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Frommer Lawrence & Haug LLP
- Agent John W. Branch
- Main IPC: G06K9/62
- IPC: G06K9/62 ; G06F15/16

Abstract:
A network device and method are directed towards detecting and blocking image spam within a message by employing a weighted min-hash to perform a near duplicate detection (NDD) of determined features within an image as compared to known spam images. The weighting for the min-hash is determined based on employing a machine learning algorithm, such as a perceptron, to identify an importance of each bit in a signature vector of the image. The signature vector is generated by extracting a shape of text in the image using a Discrete Cosine Transform, extracting low-frequency characteristics using a high-pass filter, and then performing various morphological operations to emphasize the shape of the text and reduce noise. Selected feature bits are extracted from the lowest frequency and intensity bits of the resulting signal to generate the signature vector used in the weighted min-hash NDD.
Public/Granted literature
- US20090220166A1 FILTER FOR BLOCKING IMAGE-BASED SPAM Public/Granted day:2009-09-03
Information query