Invention Grant
- Patent Title: Systems and methods for identifying spam messages using subject information
-
Application No.: US15278512Application Date: 2016-09-28
-
Publication No.: US09647975B1Publication Date: 2017-05-09
- Inventor: Roman A. Dedenok
- Applicant: AO KASPERSKY LAB
- Applicant Address: RU Moscow
- Assignee: AO KASPERSKY LAB
- Current Assignee: AO KASPERSKY LAB
- Current Assignee Address: RU Moscow
- Agency: Patterson Thuente Pedersen P.A.
- Priority: RU2016125278 20160624
- Main IPC: G06F17/30
- IPC: G06F17/30 ; G06F17/27 ; H04L12/58 ; G06F21/62

Abstract:
Systems and methods for identifying a spam email message. A system can include a rules database configured to store a plurality of ratio determination rules, a vectors database configured to store a plurality of known vectors, a message processing tool configured to receive an email message, a gram building tool configured to build a k-skip-n-gram set of word combinations according he ratio determination rules, a vector building tool configured to receive the k-skip-n-gram set of word combinations, and build a vector for each k-skip-n-gram word combination, and a spam identification tool configured to determine a spam presence threshold based on the cosine similarity for each k-skip-n-gram word combination and the plurality of known vectors for the particular email message subject field subject category, and determine that the email message contains spam when the spam presence threshold is exceeded.
Information query