Invention Grant
- Patent Title: Identification of content in an electronic document
- Patent Title (中): 电子文件内容的识别
-
Application No.: US11956687Application Date: 2007-12-14
-
Publication No.: US08301998B2Publication Date: 2012-10-30
- Inventor: Jean-David Ruvini
- Applicant: Jean-David Ruvini
- Applicant Address: US CA San Jose
- Assignee: eBay Inc.
- Current Assignee: eBay Inc.
- Current Assignee Address: US CA San Jose
- Agency: Schwegman Lundberg & Woessner, P.A.
- Main IPC: G06N3/00
- IPC: G06N3/00

Abstract:
In some embodiments, a method includes receiving an electronic document that comprises a plurality of sections. The method includes marking the plurality of sections as a content section or a non-content section using an attribute of the sections that includes at least one of a width of the section, a density of the plurality of hyperlinks in the section, a size of a font of text in the section and whether a title of the electronic document overlaps with text in the section. The method also includes storing the marking of the plurality of sections of the electronic document in a machine-readable medium.
Public/Granted literature
- US20090158138A1 IDENTIFICATION OF CONTENT IN AN ELECTRONIC DOCUMENT Public/Granted day:2009-06-18
Information query
IPC分类:
G | 物理 |
G06 | 计算;推算或计数 |
G06N | 基于特定计算模型的计算机系统 |
G06N3/00 | 基于生物学模型的计算机系统 |