Efficient identification and correction of optical character recognition errors through learning in a multi-engine environment

Invention Grant

US09053350B1 Efficient identification and correction of optical character recognition errors through learning in a multi-engine environment 有权

Title translation: 通过在多引擎环境中学习，有效识别和校正光学字符识别错误

Please log in to see more content

Patent Title: Efficient identification and correction of optical character recognition errors through learning in a multi-engine environment
Patent Title (中): 通过在多引擎环境中学习，有效识别和校正光学字符识别错误
Application No.: US13619853

Application Date: 2012-09-14
Publication No.: US09053350B1

Publication Date: 2015-06-09
Inventor: Ahmad E. Abdulkader , Matthew R. Casey
Applicant: Ahmad E. Abdulkader , Matthew R. Casey
Applicant Address: US CA Mountain View
Assignee: Google Inc.
Current Assignee: Google Inc.
Current Assignee Address: US CA Mountain View
Agency: Fenwick & West LLP
Main IPC: G06K9/00
IPC: G06K9/00 ; G06K9/62 ; G06K9/03

Efficient identification and correction of optical character recognition errors through learning in a multi-engine environment

Abstract:

OCR errors are identified and corrected through learning. An error probability estimator is trained using ground truths to learn error probability estimation. Multiple OCR engines process a text image, and convert it into texts. The error probability estimator compares the outcomes of the multiple OCR engines for mismatches, and determines an error probability for each of the mismatches. If the error probability of a mismatch exceeds an error probability threshold, a suspect is generated and grouped together with similar suspects in a cluster. A question for the cluster is generated and rendered to a human operator for answering. The answer from the human operator is then applied to all suspects in the cluster to correct OCR errors in the resulting text. The answer is also used to further train the error probability estimator.

Abstract(Chinese):

通过学习识别和纠正OCR错误。使用地面真值训练误差概率估计器来学习误差概率估计。多个OCR引擎处理文本图像，并将其转换为文本。误差概率估计器比较多个OCR引擎的不匹配结果，并确定每个错配的错误概率。如果不匹配的错误概率超过错误概率阈值，则生成一个疑犯并将其与群集中的类似嫌疑人分组。生成集群的问题并将其呈现给操作人员进行应答。然后将人类操作员的答案应用于群集中的所有疑犯，以纠正所得文本中的OCR错误。答案也用于进一步训练误差概率估计器。

Information query

Espacenet

IPC分类:

G	物理
G06	计算；推算或计数
G06K	图形数据读取（图像或视频识别或理解G06V）；数据的呈现；记录载体；处理记录载体
G06K9/00	识别模式的方法或装置（图形读取或将机械参数模式（例如力或存在）转换为电信号的方法或装置 G06K11/00）（图像或视频识别或理解 G06V）（语音识别 G10L15/00 )