Invention Grant
- Patent Title: Systems and methods for text localization and recognition in an image of a document
-
Application No.: US16457346Application Date: 2019-06-28
-
Publication No.: US10671878B1Publication Date: 2020-06-02
- Inventor: Mohammad Reza Sarshogh , Keegan Hines
- Applicant: Capital One Services, LLC
- Applicant Address: US VA McLean
- Assignee: Capital One Services, LLC
- Current Assignee: Capital One Services, LLC
- Current Assignee Address: US VA McLean
- Agency: Bookoff McAndrews, PLLC
- Main IPC: G06K9/46
- IPC: G06K9/46 ; G06K9/62 ; G06K9/32

Abstract:
Disclosed are methods, systems, and non-transitory computer-readable medium for localization and recognition of text from images. For instance, a first method may include: receiving an image; processing the image through a convolutional backbone to obtain feature maps(s); processing the feature maps through a region of interest (RoI) network to obtain RoIs; filtering the RoIs through a filtering block to obtain final RoIs; and processing the final RoIs through a text recognition stack to obtain predicted character sequences for the final RoIs. A second method may include: constructing a text localization and recognition neural network (TLaRNN); obtaining training data; training the TLaRNN on the training data; and storing trained weights of the TLaRNN. The constructing the TLaRNN may include: connecting a convolutional backbone to a region of interest (RoI) network; connecting the RoI network to a filtering block; and connecting the filtering block to a text recognition network.
Information query