Invention Grant
- Patent Title: Automated parsing of e-mail messages
- Patent Title (中): 自动解析电子邮件
-
Application No.: US12871879Application Date: 2010-08-30
-
Publication No.: US08527436B2Publication Date: 2013-09-03
- Inventor: Vamsi Salaka , Joy Thomas
- Applicant: Vamsi Salaka , Joy Thomas
- Applicant Address: US CA Mountain View
- Assignee: Stratify, Inc.
- Current Assignee: Stratify, Inc.
- Current Assignee Address: US CA Mountain View
- Main IPC: G06N5/00
- IPC: G06N5/00

Abstract:
An automated parser for e-mail messages identifies component parts such as header, body, signature, and disclaimer. The parser uses a hidden Markov model (HMM) in which the lines making up an e mail are treated as a sequence of observations of a system that evolves according to a Markov chain having states corresponding to the component parts. The HMM is trained using a manually-annotated set of e-mail messages, then applied to parse other e-mail messages. HMM-based parsing can be further refined or expanded using heuristic post-processing techniques that exploit redundancy of some component parts (e.g., signatures, disclaimers) across a corpus of e-mail messages.
Public/Granted literature
- US20120054135A1 AUTOMATED PARSING OF E-MAIL MESSAGES Public/Granted day:2012-03-01
Information query
IPC分类:
G | 物理 |
G06 | 计算;推算或计数 |
G06N | 基于特定计算模型的计算机系统 |
G06N5/00 | 利用基于知识的模式的计算机系统 |