Abstract:
A method, a system, and a computer readable recording medium are disclosed for performing object recognition. The method includes receiving image data from an image; performing a multilayer feature extraction on the image data; generating current feature maps from the multilayer feature extraction; generating a region of proposal network map from at least the current feature maps, the region of proposal network map having one or more regions of interest; inputting previously generated feature maps and the current feature maps into a classifier with the region of proposed network map; and classifying the one or more regions of interest in the region of proposal network map.
Abstract:
A method, system and computer readable medium for detecting creativity in real-time are disclosed. The method includes sensing electrical activity along a scalp of a subject during a learning phase, the learning phase including presenting to the subject one or more tasks configured to generate electrical activity corresponding to cortical events that are likely to correspond to a creative type experience and cortical events that are likely to correspond to a non-creative type experience. Features from the electrical activity obtained during the learning phase are extracted to create a brainwave profile for the subject. Real-time electrical activity along the scalp of the subject is sensed during a performance of one or more real-time tasks, and the electrical activity of the subject is compared to previously recorded electrical activity using the brainwave profile for the subject to classify the electrical activity obtained during the performance of the one or more real-time.
Abstract:
A method, computer readable medium, and system are disclosed of enhancing cell images for analysis. The method includes performing a multi-thresholding process on a cell image to generate a plurality of images of the cell image; smoothing each component within each of the plurality of images; merging the smoothed components into a merger layer; classifying each of the components of the merged layer into convex cell regions and concave cell regions; combining the concave cell regions with a cell boundary for each of the corresponding concave cell regions to generate a smoothed shape profile for each of the concave cell regions; and generating an output image by combining the convex cell regions with the concave cell regions with smoothed shape profiles.
Abstract:
A method is disclosed for detecting interaction between two or more participants in a meeting, which includes capturing at least one three-dimensional stream of data on the two or more participants; extracting a time-series of skeletal data from the at least one three-dimensional stream of data on the two or more participants; classifying the time-series of skeletal data for each of the two or more participants based on a plurality of body position classifiers; and calculating an engagement score for each of the two or more participants. In addition, a method is disclosed for improving a group interaction in a meeting, which includes calculating, for each of the two or more participants, an individual engagement state based on attitudes of the participant, wherein the individual engagement state is an engagement state of the participant to the meeting including an engaged state and a disengaged state.
Abstract:
A method, computer readable storage medium, and system are disclosed for improving communication productivity, comprising: capturing at least one three-dimensional (3D) stream of data on two or more subjects; extracting a time-series of skeletal data from the at least one 3D stream of data on the two or more subjects; and determining an engagement index between the two or more subjects by comparing the time-series of skeletal data on each of the two or more subjects over a time window.
Abstract:
A method for recognizing abnormal behavior is disclosed, the method includes: capturing at least one video stream of data on one or more subjects; extracting body skeleton data from the at least one video stream of data; classifying the extracted body skeleton data as normal behavior or abnormal behavior; and generating an alert, if the extracted skeleton data is classified as abnormal behavior.
Abstract:
A method, a system, and a non-transitory computer readable medium for recognizing an object are disclosed, the method including: emitting an array of infrared rays from an infrared emitter towards a projection region, the projection region including a first object; generating a reference infrared image by recording an intensity of ray reflection from the projection region without the first object; generating a target infrared image by recording the intensity of ray reflection from the projection region with the first object; comparing the target infrared image to the reference infrared image to generate a predetermined intensity threshold; and extracting the first object from the target infrared image, if the intensity of ray reflection of the target infrared image of the first object exceeds the predetermined intensity threshold.
Abstract:
A method, computer readable storage medium, and system are disclosed for improving communication productivity in a conference between two or more subjects, wherein at least one of the two or more subjects participates in the conference from a first location and one or more of the two or more subjects participate in the meeting from a second location. The method includes capturing, at least one first three-dimensional (3D) stream of data and at least one second three-dimensional (3D) stream of data on each of the two or more subjects participating in the conference; generating a synchrony score for the two or more subjects, wherein the synchrony score is calculated by comparing time series of skeletal data of each of the two or more subjects to one another for a defined period of time; and using the synchrony score to generate an engagement index between the two or more subjects.
Abstract:
A method and system for recognizing behavior is disclosed, the method includes: capturing at least one video stream of data on one or more subjects; extracting body skeleton data from the at least one video stream of data; computing feature extractions on the extracted body skeleton data to generate a plurality of 3 dimensional delta units for each frame of the extracted body skeleton data; generating a plurality of histogram sequences for each frame by projecting the plurality of 3 dimensional delta units for each frame to a spherical coordinate system having a plurality of spherical bins; generating an energy map for each of the plurality of histogram sequences by mapping the plurality of spherical bins versus time; applying a Histogram of Oriented Gradients (HOG) algorithm on the plurality of energy maps to generate a single column vector; and classifying the single column vector as a behavior and/or emotion.
Abstract:
A method, system and non-transitory computer readable medium for recognizing gestures are disclosed, the method includes capturing at least one three-dimensional (3D) video stream of data on a subject; extracting a time-series of skeletal data from the at least one 3D video stream of data; isolating a plurality of points of abrupt content change called temporal cuts, the plurality of temporal cuts defining a set of non-overlapping adjacent segments partitioning the time-series of skeletal data; identifying among the plurality of temporal cuts, temporal cuts of the time-series of skeletal data having a positive acceleration; and classifying each of the one or more pair of consecutive cuts with the positive acceleration as a gesture boundary.