Abstract:
A system and method of localizing vascular patterns by receiving frames from a video camera, identifying and tracking an object within the frames, determining temporal features associated with the object; and localizing vascular patterns from the frames based on the temporal features associated with the object.
Abstract:
A computer-implemented method for gait analysis of a subject includes obtaining visual data from an image capture device positioned in front of or behind the subject, the visual data comprising at least two image frames of the subject over a period of time walking toward or away from the image capture device, the at least two image frames capturing at least a portion of the gait of the subject, detecting within the at least two images body parts as two-dimensional landmarks using a pose estimation algorithm on each of the at least two frames, generating a joint model depicting the location of the at least one joint in each of the at least two frames, using the joint model to segment a gait cycle for the at least one joint, and comparing the gait cycle to a threshold value to detect abnormal gait.
Abstract:
Methods, systems, and processor-readable media for detecting the side window of a vehicle. A spatial probability map can be calculated, which includes data indicative of likely side window locations of a vehicle in an image. A side window detector can be run with respect to the image of the vehicle to determine detection scores. The detection scores can be weighted based on the spatial probability map. A detected region of interest can be extracted from the image as extracted image patch. An image classification can then be performed with respect to the extracted patch to provide a classification that indicates whether or not a passenger is in the vehicle or no-passenger is in the vehicle.
Abstract:
A method and system for domain adaptation based on multi-layer fusion in a convolutional neural network architecture for feature extraction and a two-step training and fine-tuning scheme. The architecture concatenates features extracted at different depths of the network to form a fully connected layer before the classification step. First, the network is trained with a large set of images from a source domain as a feature extractor. Second, for each new domain (including the source domain), the classification step is fine-tuned with images collected from the corresponding site. The features from different depths are concatenated with and fine-tuned with weights adjusted for a specific task. The architecture is used for classifying high occupancy vehicle images.
Abstract:
A computer-based apparatus including a computer including a processor arranged to select a first video regarding a medical condition; create a second video including segments from the first video; transmit the second video for viewing by qualified medical personnel; receive input from the personnel; based on the input confirm accuracy of a first segment or modify a second segment or delete a third segment; create, from the second video, by at least including the first or second segment or deleting the third segment; transmit the third video for viewing by viewers; receive a respective response from each viewer identifying a respective fourth segment of the third video deemed relevant to the medical condition or enjoyable; create a fourth video including at least a portion of the respective fourth segments; and store the fourth video for inclusion in a video regarding the medical condition.
Abstract:
A system and method of providing annotated trajectories by receiving image frames from a video camera and determining a location based on the image frames from the video camera. The system and method can further include the steps of determining that the location is associated with a preexisting annotation and displaying the preexisting annotation. Additionally or alternatively, the system and method can further include the steps of generating a new annotation automatically or based on a user input and associating the new annotation with the current location.
Abstract:
A method for detecting sitting behavior includes acquiring a sequence of frames capturing a scene-of-interest at an overhead view. The method includes detecting at least one empty seat within the scene-of-interest and associating the seat as being unoccupied and the frame as a reference frame. The method includes extracting reference features describing a region of the unoccupied seat in the reference frame and quantifying the reference features to form a reference feature vector. The method includes extracting features describing the region in a given frame and quantifying the features to form a current feature vector. The method includes measuring a change in a feature vector over time using the reference feature vector and the current feature vector. The method includes and determining a status of the seat in the given frame as being one of occupied and unoccupied based on the change in the feature vector.
Abstract:
A system and method for detecting electronic device use by a driver of a vehicle including acquiring an image including a vehicle from an associated image capture device positioned to view oncoming traffic, locating a windshield region of the vehicle in the captured image, processing pixels of the windshield region of the image for computing a feature vector describing the windshield region of the vehicle, applying the feature vector to a classifier for classifying the image into respective classes including at least classes for candidate electronic device use and candidate electronic device non-use, and outputting the classification.
Abstract:
A method for reconstructing an image of a scene captured using a compressed sensing device. A mask is received which identifies at least one region of interest in an image of a scene. Measurements are then obtained of the scene using a compressed sensing device comprising, at least in part, a spatial light modulator configuring a plurality of spatial patterns according to a set of basis functions each having a different spatial resolution. A spatial resolution is adaptively modified according to the mask. Each pattern focuses incoming light of the scene onto a detector which samples sequential measurements of light. These measurements comprise a sequence of projection coefficients corresponding to the scene. Thereafter, an appearance of the scene is reconstructed utilizing a compressed sensing framework which reconstructs the image from the sequence of projection coefficients.
Abstract:
A system and method of providing annotated trajectories by receiving image frames from a video camera and determining a location based on the image frames from the video camera. The system and method can further include the steps of determining that the location is associated with a preexisting annotation and displaying the preexisting annotation. Additionally or alternatively, the system and method can further include the steps of generating a new annotation automatically or based on a user input and associating the new annotation with the current location.