Invention Grant
- Patent Title: Shape clustering in post optical character recognition processing
- Patent Title (中): 后光学字符识别处理中的形状聚类
-
Application No.: US12784359Application Date: 2010-05-20
-
Publication No.: US08111927B2Publication Date: 2012-02-07
- Inventor: Luc Vincent , Raymond W. Smith
- Applicant: Luc Vincent , Raymond W. Smith
- Applicant Address: US CA Mountain View
- Assignee: Google Inc.
- Current Assignee: Google Inc.
- Current Assignee Address: US CA Mountain View
- Agency: Fish & Richardson P.C.
- Main IPC: G06K9/62
- IPC: G06K9/62

Abstract:
Systems, methods and computer program products for shape clustering and applications in processing various documents, including an output of an optical character recognition (OCR) process. Clip images defined in a received OCR output are classified into a plurality of clusters of clip images. Clip images in each of the plurality of clusters are processed to generate a cluster image for each cluster. Shape differences between the cluster images of a first cluster and a second cluster and between the cluster images of the first cluster and a third cluster are used to determine a level of confidence in one or more first OCR character codes assigned to the first cluster.
Public/Granted literature
- US20100232719A1 Shape Clustering in Post Optical Character Recognition Processing Public/Granted day:2010-09-16
Information query