UNCERTAINTY-AWARE FUSION TOWARDS LARGE-SCALE NERF

    公开(公告)号:WO2023086192A1

    公开(公告)日:2023-05-19

    申请号:PCT/US2022/047232

    申请日:2022-10-20

    Abstract: A method for achieving high-fidelity novel view synthesis and 3D reconstruction for large-scale scenes is presented. The method includes obtaining images from a video stream received from a plurality of video image capturing devices, grouping the images into different image clusters representing a large-scale 3D scene, training a neural radiance field (NeRF) and an uncertainty multilayer perceptron (MLP) for each of the image clusters to generate a plurality of NeRFs and a plurality of uncertainty MLPs for the large-scale 3D scene, applying a rendering loss and an entropy loss to the plurality of NeRFs, performing uncertainty-based fusion to the plurality of NeRFs to define a fused NeRF, and jointly fine-tuning the plurality of NeRFs and the plurality of uncertainty MLPs, and during inference, applying the fused NeRF for novel view synthesis of the large-scale 3D scene.

    PRIVACY-PRESERVING VISUAL RECOGNITION VIA ADVERSARIAL LEARNING

    公开(公告)号:WO2020097182A1

    公开(公告)日:2020-05-14

    申请号:PCT/US2019/060037

    申请日:2019-11-06

    Abstract: A method for protecting visual private data by preventing data reconstruction from latent representations of deep networks is presented. The method includes obtaining latent features (316) from an input image (312) and learning, via an adversarial reconstruction learning framework (318), privacy -preserving feature representations to maintain utility performance and prevent the data reconstruction by: simulating a black-box model inversion attack by training a decoder (332) to reconstruct the input image from the latent features and training an encoder (314) to maximize a reconstruction error to prevent the decoder from inverting the latent features while minimizing the task loss.

    FACE-AWARE PERSON RE-IDENTIFICATION SYSTEM
    4.
    发明申请

    公开(公告)号:WO2022103684A1

    公开(公告)日:2022-05-19

    申请号:PCT/US2021/058407

    申请日:2021-11-08

    Abstract: A method for employing facial information in unsupervised person re-identification is presented. The method includes extracting (1001), by a body feature extractor, body features from a first data stream, extracting (1003), by a head feature extractor, head features from a second data stream, outputting (1005) a body descriptor vector from the body feature extractor, outputting (1007) a head descriptor vector from the head feature extractor, and concatenating (1009) the body descriptor vector and the head descriptor vector to enable a model to generate a descriptor vector.

    VOTING-BASED APPROACH FOR DIFFERENTIALLY PRIVATE FEDERATED LEARNING

    公开(公告)号:WO2022072776A1

    公开(公告)日:2022-04-07

    申请号:PCT/US2021/053086

    申请日:2021-10-01

    Abstract: A method for employing a general label space voting-based differentially private federated learning (DPFL) framework is presented. The method includes labeling (1010) a first subset of unlabeled data from a first global server, to generate first pseudo-labeled data, by employing a first voting-based DPFL computation where each agent trains a local agent model by using private local data associated with the agent, labeling (1020) a second subset of unlabeled data from a second global server, to generate second pseudo-labeled data, by employing a second voting-based DPFL computation where each agent maintains a data- independent feature extractor, and training (1030) a global model by using the first and second pseudo-labeled data to provide provable differential privacy (DP) guarantees for both instance-level and agent- level privacy regimes.

    DOMAIN GENERALIZABLE CONTINUAL LEARNING USING COVARIANCES

    公开(公告)号:WO2023086196A1

    公开(公告)日:2023-05-19

    申请号:PCT/US2022/047514

    申请日:2022-10-24

    Abstract: A computer-implemented method for model training is provided. The method includes receiving, by a hardware processor, sets of images, each set corresponding to a respective task. The method further includes training, by the hardware processor, a task-based neural network classifier having a center and a covariance matrix for each of a plurality of classes in a last layer of the task-based neural network classifier and a plurality of convolutional layers preceding the last layer, by using a similarity between an image feature of a last convolutional layer from among the plurality of convolutional layers and the center and the covariance matrix for a given one of the plurality of classes, the similarity minimizing an impact of a data model forgetting problem.

    DOMAIN GENERALIZED MARGIN VIA META-LEARNING FOR DEEP FACE RECOGNITION

    公开(公告)号:WO2022103748A1

    公开(公告)日:2022-05-19

    申请号:PCT/US2021/058612

    申请日:2021-11-09

    Abstract: A method for training a model for face recognition is provided. The method forward trains (610) a training batch of samples to form a face recognition model w(t), and calculates (620) sample weights for the batch. The method obtains (630) a training batch gradient with respect to model weights thereof and updates, using the gradient, the model w(t) to a face recognition model what(t). The method forwards (640) a validation batch of samples to the face recognition model what(t). The method obtains (650) a validation batch gradient, and updates, using the validation batch gradient and what(t), a sample-level importance weight of samples in the training batch to obtain an updated sample-level importance weight. The method obtains (660) a training batch upgraded gradient based on the updated sample-level importance weight of the training batch samples, and updates (660), using the upgraded gradient, the model w(t) to a trained model w(t+l) corresponding to a next iteration.

    FACE RECOGNITION FROM UNSEEN DOMAINS VIA LEARNING OF SEMANTIC FEATURES

    公开(公告)号:WO2022103682A1

    公开(公告)日:2022-05-19

    申请号:PCT/US2021/058404

    申请日:2021-11-08

    Abstract: A method for improving face recognition from unseen domains including obtaining (1001) face images with associated identities from a plurality of datasets, randomly (1003) selecting two datasets of the plurality of datasets to train a model, sampling (1005) batch face images and their corresponding labels, sampling (1007) triplet samples including one anchor face image, a sample face image from a same identity, and a sample face image from a different identity than that of the one anchor face image, performing (1009) a forward pass by using the samples of the selected two datasets, finding (1011) representations of the face images by using a backbone convolutional neural network (CNN), generating (1013) covariances from the representations of the face images and the backbone CNN, the covariances made in different spaces by using positive pairs and negative pairs, and employing (1015) the covariances to compute a cross-domain similarity loss function.

    DOMAIN ADAPTATION FOR SEMANTIC SEGMENTATION VIA EXPLOITING WEAK LABELS

    公开(公告)号:WO2021097055A1

    公开(公告)日:2021-05-20

    申请号:PCT/US2020/060178

    申请日:2020-11-12

    Abstract: Systems and methods for adapting semantic segmentation across domains is provided. The method includes inputting (720) a source image into a segmentation network, and inputting (710) a target image into the segmentation network. The method further includes identifying (760) category wise features for the source image and the target image using category wise pooling, and discriminating (780) between the category wise features for the source image and the target image. The method further includes training (730) the segmentation network with a pixel-wise cross-entropy loss on the source image, and a weak image classification loss and an adversarial loss on the target image, and outputting a semantically segmented target image.

Patent Agency Ranking