Template-free extraction of data from documents

Invention Grant

US10366123B1 Template-free extraction of data from documents 有权

Please log in to see more content

Patent Title: Template-free extraction of data from documents
Application No.: US15967375

Application Date: 2018-04-30
Publication No.: US10366123B1

Publication Date: 2019-07-30
Inventor: Sunil H. Madhani , Anu Sreepathy , Samir Revti Kakkar
Applicant: INTUIT INC.
Applicant Address: US CA Mountain View
Assignee: INTUIT INC.
Current Assignee: INTUIT INC.
Current Assignee Address: US CA Mountain View
Agency: Patterson + Sheridan, LLP
Main IPC: G06F17/30
IPC: G06F17/30 ; G06F16/90

Template-free extraction of data from documents

Abstract:

The disclosed embodiments provide a system that processes data. One example embodiment is a computer-implemented method for processing data. The computer-implemented method includes obtaining text from a document associated with a user, wherein the document was generated based on a template and, with the obtained text intact, applying a set of rules to each term in the obtained text to determine a broad category of a plurality of terms associated with the term. The computer-implemented method further includes applying an additional set of rules to refine the broad category associated with the term to a refined category of fewer terms based on a location in the document of at least one term in the broad category of the plurality of terms, extracting a term from the obtained text using template-independent code developed to process documents generated based on a plurality of templates and enabling use of the term with an application.

Information query

Espacenet