Invention Grant
US08634644B2 System and method for identifying pictures in documents 有权
用于识别文档中的图片的系统和方法

System and method for identifying pictures in documents
Abstract:
A system and method to identify pictures in documents. An image representing a page of a document is received. The image is analyzed to identify text objects in the page. A masked image is generated by masking out regions of the image including the text objects in the page. Groups of pixels in the masked image are identified, wherein a respective group of pixels corresponds to at least one picture in the page. When there is one or more groups of pixels, regions for pictures are identified based on the one or more groups of pixels. Metadata tags for the pictures are stored, wherein a respective metadata tag for a respective picture includes information about a respective bounding box for the respective picture.
Public/Granted literature
Information query
Patent Agency Ranking
0/0