Invention Grant
- Patent Title: Spam filtering based on statistics and token frequency modeling
- Patent Title (中): 基于统计和令牌频率建模的垃圾邮件过滤
-
Application No.: US12328723Application Date: 2008-12-04
-
Publication No.: US08364766B2Publication Date: 2013-01-29
- Inventor: Lei Zheng , Sharat Narayan , Mark E. Risher , Stanley Ke Wei , Vishwanath Tumkur Ramarao , Anirban Kundu
- Applicant: Lei Zheng , Sharat Narayan , Mark E. Risher , Stanley Ke Wei , Vishwanath Tumkur Ramarao , Anirban Kundu
- Applicant Address: US CA Sunnyvale
- Assignee: Yahoo! Inc.
- Current Assignee: Yahoo! Inc.
- Current Assignee Address: US CA Sunnyvale
- Agency: Frommer Lawrence & Haug LLP
- Agent John W. Branch
- Main IPC: G06F15/16
- IPC: G06F15/16

Abstract:
Embodiments are directed towards classifying messages as spam using a two phased approach. The first phase employs a statistical classifier to classify messages based on message content. The second phase targets specific message types to capture dynamic characteristics of the messages and identify spam messages using a token frequency based approach. A client component receives messages and sends them to the statistical classifier, which determines a probability that a message belongs to a particular type of class. The statistical classifier further provides other information about a message, including, a token list, and token thresholds. The message class, token list, and thresholds are provided to the second phase where a number of spam tokens in a given message for a given message class are determined. Based on the threshold, the client component then determines whether the message is spam or non-spam.
Public/Granted literature
- US20100145900A1 SPAM FILTERING BASED ON STATISTICS AND TOKEN FREQUENCY MODELING Public/Granted day:2010-06-10
Information query