Abstract:
A method and apparatus with emotion recognition acquires a plurality of pieces of data corresponding a plurality of inputs for each modality and corresponding to a plurality of modalities; determines a dynamics representation vector corresponding to each of the plurality of modalities based on a plurality of features for each modality extracted from the plurality of pieces of data; determines a fused representation vector based on the plurality of dynamics representation vectors corresponding to the plurality of modalities; and recognizes an emotion of a user based on the fused representation vector.
Abstract:
A method and apparatus for estimating a pose that estimates a pose of a user using a depth image is provided, the method including, recognizing a pose of a user from a depth image, and tracking the pose of the user using a user model exclusively of one another to enhance precision of estimating the pose.
Abstract:
An object recognition system is provided. The object recognition system for recognizing an object may include an input unit to receive, as an input, a depth image representing an object to be analyzed, and a processing unit to recognize a visible object part and a hidden object part of the object, from the depth image, by using a classification tree. The object recognition system may include a classification tree learning apparatus to generate the classification tree.
Abstract:
An apparatus for processing a depth image using a relative angle between an image sensor and a target object includes an object image extractor to extract an object image from the depth image, a relative angle calculator to calculate a relative angle between an image sensor used to photograph the depth image and a target object corresponding to the object image, and an object image rotator to rotate the object image based on the relative angle and a reference angle.
Abstract:
A processor-implemented method with liveness detection includes: receiving a plurality of phase images of different phases; generating a plurality of preprocessed phase images by performing preprocessing, including edge enhancement processing, on the plurality of phase images of different phases; generating a plurality of differential images based on the preprocessed phase images; generating a plurality of low-resolution differential images having lower resolutions than the differential images, based on the differential images; generating a minimum map image based on the low-resolution differential images; and performing a liveness detection on an object in the phase images based on the minimum map image.
Abstract:
A method and apparatus for detecting a liveness based on a phase difference are provided. The method includes generating a first phase image based on first visual information of a first phase, generating a second phase image based on second visual information of a second phase, generating a minimum map based on a disparity between the first phase image and the second phase image, and detecting a liveness based on the minimum map.
Abstract:
An image sensor includes: a motion detection circuit configured to detect a motion in image frames; and a micro control unit (MCU) configured to adjust at least a portion of a target frame among the image frames based on whether the motion is detected, and detect whether a target object is present based on the adjusted portion of the target frame.
Abstract:
An object classification method and apparatus are disclosed. The object classification method includes receiving an input image, storing first feature data extracted by a first feature extraction layer of a neural network configured to extract features of the input image, receiving second feature data from a second feature extraction layer which is an upper layer of the first feature extraction layer, generating merged feature data by merging the first feature data and the second feature data, and classifying an object in the input image based on the merged feature data.
Abstract:
A processor-implemented method includes: generating a preprocessed infrared (IR) image by performing first preprocessing based on an IR image including an object; generating a preprocessed depth image by performing second preprocessing based on a depth image including the object; and determining whether the object is a genuine object based on the preprocessed IR image and the preprocessed depth image.
Abstract:
A processor-implemented method of tracking a target object includes: extracting a feature from frames of an input image; selecting one a neural network model from among a plurality of neural network models that is provided in advance based on a feature value range, based on a feature value of a target object that is included in the feature of a previous frame among the frames; and generating a bounding box of the target object included in a current frame among the frames, based on the selected neural network model.