Invention Grant
- Patent Title: Document type identifying method and document type identifying apparatus
- Patent Title (中): 文件类型识别方法和文件类型识别装置
-
Application No.: US12585155Application Date: 2009-09-04
-
Publication No.: US08275792B2Publication Date: 2012-09-25
- Inventor: Akihiro Minagawa , Hiroaki Takebe , Katsuhito Fujimoto
- Applicant: Akihiro Minagawa , Hiroaki Takebe , Katsuhito Fujimoto
- Applicant Address: JP Kawasaki
- Assignee: Fujitsu Limited
- Current Assignee: Fujitsu Limited
- Current Assignee Address: JP Kawasaki
- Agency: Staas & Halsey LLP
- Main IPC: G06F17/30
- IPC: G06F17/30

Abstract:
A document type identifying apparatus includes in advance a database storing therein keywords used as keys that identify document types in association with each document type. The document type identifying apparatus aligns word strings written on a document and generates partial keyword strings for each keyword by using the keywords stored in the database. The partial keyword strings are to be checked for matching with the word strings written on the document. Then, the document type identifying apparatus checks matching of the grouped and aligned word strings with the partial keyword strings and obtains, for each keyword, each number of matched words with the highest matching rates between the grouped word strings that are successfully matched and the partial keyword strings. Then, each number of matched words is used to calculate each evaluation value to determine the document type.
Public/Granted literature
- US20100005096A1 Document type identifying method and document type identifying apparatus Public/Granted day:2010-01-07
Information query