MULTI-MODALITY DATA ANALYSIS ENGINE FOR DEFECT DETECTION

    公开(公告)号:WO2023086533A1

    公开(公告)日:2023-05-19

    申请号:PCT/US2022/049646

    申请日:2022-11-11

    Abstract: Systems and methods for defect detection for vehicle operations, including collecting a multiple modality input data stream from a plurality of different types of vehicle sensors, extracting one or more features from the input data stream using a grid-based feature extractor, and retrieving spatial attributes of objects positioned in any of a plurality of cells of the grid-based feature extractor. One or more anomalies are detected based on residual scores generated by each of cross attention-based anomaly detection and time-series-based anomaly detection. One or more defects are identified based on a generated overall defect score determined by integrating the residual scores for the cross attention-based anomaly detection and the time-series based anomaly detection being above a predetermined defect score threshold. Operation of the vehicle is controlled based on the one or more defects identified.

    SEQUENCE MODELS FOR AUDIO SCENE RECOGNITION
    5.
    发明申请

    公开(公告)号:WO2021041144A1

    公开(公告)日:2021-03-04

    申请号:PCT/US2020/047171

    申请日:2020-08-20

    Abstract: A method is provided. Intermediate audio features are generated (610) from an input acoustic sequence. Using a nearest neighbor search, segments of the input acoustic sequence are classified (620) based on the intermediate audio features to generate a final intermediate feature as a classification for the input acoustic sequence. Each segment corresponds to a respective different acoustic window. The generating step includes learning (610A) the intermediate audio features from Multi-Frequency Cepstral Component (MFCC) features extracted from the input acoustic sequence. The generating step includes dividing (610B) the same scene into the different acoustic windows having varying MFCC features. The generating step includes feeding (610E) the MFCC features of each of the different acoustic windows into respective LSTM units such that a hidden state of each respective LSTM unit is passed through an attention layer to identify feature correlations between hidden states at different time steps corresponding to different ones of the different acoustic windows.

    WORD-OVERLAP-BASED CLUSTERING CROSS-MODAL RETRIEVAL

    公开(公告)号:WO2021015936A1

    公开(公告)日:2021-01-28

    申请号:PCT/US2020/040649

    申请日:2020-07-02

    Abstract: A system (200) for cross-modal data retrieval is provided that includes a neural network having a time series encoder (211) and text encoder (212) which are jointly trained using an unsupervised training method which is based on a loss function. The loss function jointly evaluates a similarity of feature vectors of training sets of two different modalities of time series and free-form text comments and a compatibility of the time series and the free-form text comments with a word-overlap-based spectral clustering method configured to compute pseudo labels for the unsupervised training method. The computer processing system further includes a database (205) for storing the training sets with feature vectors extracted from encodings of the training sets. The encodings are obtained by encoding a training set of the time series using the time series encoder and encoding a training set of the free-form text comments using the text encoder.

    ORDINAL TIME SERIES CLASSIFICATION WITH MISSING INFORMATION

    公开(公告)号:WO2022055698A1

    公开(公告)日:2022-03-17

    申请号:PCT/US2021/047286

    申请日:2021-08-24

    Abstract: A method classifies missing labels. The method computes (320), using a neural network model trained on training data, rank-based statistics of a feature of a time series segment to attempt to select two candidate labels from the training data that the segment most likely belongs to. The method classifies(350) the segment using k-NN-based classification applied to the training data, responsive to the two candidate labels being present in the training data. The method classifies (335) the segment by hypothesis testing, responsive to only one candidate label being present in the training data. The method classifies (345) the segment into a class with higher values of the rank-based statistics from among a plurality of classes with different values of the rank-based statistics, responsive to no candidate labels being present in the training data. The method corrects (340) a prediction by an applicable one of the classifying steps by majority voting with time windows.

Patent Agency Ranking