Invention Grant
- Patent Title: Document division method and system
- Patent Title (中): 文件划分方法和制度
-
Application No.: US13370981Application Date: 2012-02-10
-
Publication No.: US09390077B2Publication Date: 2016-07-12
- Inventor: Shumeet Baluja
- Applicant: Shumeet Baluja
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06F17/00
- IPC: G06F17/00 ; G06F17/24 ; G06F17/21 ; G06F17/30

Abstract:
Computer-readable media stores instructions that perform operations including receiving a first electronic document; determining a first information gain value associated with a first line that divides the first electronic document into a first portion and a second portion; determining a second information gain value associated with a second line that divides the first electronic document into a third portion and a fourth portion; and determining which of the first information gain value and second information gain value is greater. Information gain values are determined by calculating a difference between an entropy value associated with a line and an entropy value associated with an electronic document. Entropy values associated lines or electronic documents are determined based at least in part on document objects in the portions created by a line or an electronic document.
Public/Granted literature
- US20150193407A1 Document Division Method and System Public/Granted day:2015-07-09
Information query