Abstract:
A visual query is received from a client system, along with location information for the client system, and processed by a server system. The server system sends the visual query and the location information to a visual query search system, and receives from the visual query search system enhanced location information based on the visual query and the location information. The server system then sends a search query, including the enhanced location information, to a location-based search system. The search system receives and provides to the client one or more search results to the client system.
Abstract:
A system and method of deep learning using deep networks to predict new views from existing images may generate and improve models and representations from large-scale data. This system and method of deep learning may employ a deep architecture performing new view synthesis directly from pixels, trained from large numbers of posed image sets. A system employing this type of deep network may produce pixels of an unseen view based on pixels of neighboring views, lending itself to applications in graphics generation.
Abstract:
Systems and methods for a dynamic visual search engine are provided. In one example method, a criteria used to partition a set of compressed image descriptors into multiple database shards may be determined. Additionally, a size of a dynamic index may be determined. The dynamic index may represent a dynamic number of images and may be configured to accept insertion of reference images into the dynamic index that can be search against immediately. According to the method, an instruction to merge the uncompressed image descriptors of the dynamic index into the database shards of the compressed image descriptors may be received, and the uncompressed image descriptors of the dynamic index may be responsively merged into the database shards of the compressed image descriptors based on the criteria.
Abstract:
Methods, systems, and apparatus are provided for determining location information for images. In one aspect, a method includes obtaining landmark location data from content depicted in an image and corresponding confidence scores. Also, the method includes obtaining caption location data from user input and corresponding confidence scores, and obtaining metadata location data from data provided by an image capturing device. Further, the method includes identifying location pairs from the landmark, caption, and metadata location data, and generating, for each location pair, a geographic consistency score. Additionally, the method includes selecting a location pair based on the geographic consistency scores, and selecting an image location for the image from the selected location pair. Moreover, the method includes determining an image location score based on a confidence score for one of the locations in the selected location pair, and associating the image location and image location score with the image.