Abstract:
A computing system for realizing visual content of an image collection executes feature detection algorithms and semantic reasoning techniques on the images in the collection to elicit a number of different types of visual features of the images. The computing system indexes the visual features and provides technologies for multi-dimensional content-based clustering, searching, and iterative exploration of the image collection using the visual features and/or the visual feature indices.
Abstract:
A computer vision service includes technologies to, among other things, analyze computer vision or learning tasks requested by computer applications, select computer vision or learning algorithms to execute the requested tasks based on one or more performance capabilities of the computer vision or learning algorithms, perform the computer vision or learning tasks for the computer applications using the selected algorithms, and expose the results of performing the computer vision or learning tasks for use by the computer applications.
Abstract:
A computing system for realizing visual content of an image collection executes feature detection algorithms and semantic reasoning techniques on the images in the collection to elicit a number of different types of visual features of the images. The computing system indexes the visual features and provides technologies for multi-dimensional content-based clustering, searching, and iterative exploration of the image collection using the visual features and/or the visual feature indices.
Abstract:
A computing system for realizing visual content of an image collection executes feature detection algorithms and semantic reasoning techniques on the images in the collection to elicit a number of different types of visual features of the images. The computing system indexes the visual features and provides technologies for multi-dimensional content-based clustering, searching, and iterative exploration of the image collection using the visual features and/or the visual feature indices.
Abstract:
A computing system includes a vision-based user interface platform to, among other things, analyze multi-modal user interactions, semantically correlate stored knowledge with visual features of a scene depicted in a video, determine relationships between different features of the scene, and selectively display virtual elements on the video depiction of the scene. The analysis of user interactions can be used to filter the information retrieval and correlating of the visual features with the stored knowledge.
Abstract:
A system for object detection and tracking includes technologies to, among other things, detect and track moving objects, such as pedestrians and/or vehicles, in a real-world environment, handle static and dynamic occlusions, and continue tracking moving objects across the fields of view of multiple different cameras.
Abstract:
A computing system and method for identifying related visual content of a collection of visual media files comprising one or more image files and video files includes monitoring inputs to the computing system, the inputs associated with a user interaction with electronic content using the computer system, identifying a visual media file in the collection of visual media files relevant to the electronic content based on a semantic label assigned to the visual media file by the computing system, creating a representative image of the identified visual media file, and displaying the representative image for selection. The computing system enables a selection of the displayed representative image for association of the identified visual media file with the electronic content.
Abstract:
Methods and apparatuses of the present invention generally relate to generating actionable data based on multimodal data from unsynchronized data sources. In an exemplary embodiment, the method comprises receiving multimodal data from one or more unsynchronized data sources, extracting concepts from the multimodal data, the concepts comprising at least one of objects, actions, scenes and emotions, indexing the concepts for searchability; and generating actionable data based on the concepts.
Abstract:
A computing system for realizing visual content of an image collection executes feature detection algorithms and semantic reasoning techniques on the images in the collection to elicit a number of different types of visual features of the images. The computing system indexes the visual features and provides technologies for multi-dimensional content-based clustering, searching, and iterative exploration of the image collection using the visual features and/or the visual feature indices.
Abstract:
Embodiments of the present invention are directed towards methods and apparatus for generating a common operating picture of an event based on the event-specific information extracted from data collected from a plurality of electronic information sources. In some embodiments, a method for generating a common operating picture of an event includes collecting data, comprising image data and textual data, from a plurality of electronic information sources, extracting information related to an event from the data, said extracted information comprising image descriptors, visual features, and categorization tags, by applying statistical analysis and semantic analysis, aligning the extracted information to generate aligned information, recognizing event-specific information for the event based on the aligned information, and generating a common operating picture of the event based on the event-specific information.