Framework for identifying documents
Abstract:
Systems, methods, and other embodiments associated with identifying a document type of an unidentified document based on image features are described. In one embodiment, image pairs are formed by pairing the unidentified document with anchor images from a plurality of anchor images, wherein each anchor image is a known document type. For each image pair, first visual features are extracted from the unidentified document and second visual features are extracted from the paired anchor image. A similarity function is applied to compare the first visual features and the second visual features, and a similarity score is generated for each image pair based on the comparing. The most similar anchor image from the image pairs, which has a greatest similarity score, is identified. The document type of the unidentified document is then predicted as the known document type associated with the most similar anchor image.
Public/Granted literature
Information query
Patent Agency Ranking
0/0