-
公开(公告)号:US20230274555A1
公开(公告)日:2023-08-31
申请号:US17682610
申请日:2022-02-28
Applicant: Verizon Patent and Licensing Inc.
Inventor: Matteo SIMONCINI , Douglas COIMBRA DE ANDRADE , Leonardo TACCARI , Leonardo SARTI , Francesco SAMBO , Fabio SCHOEN , Niccolo BELLACCINI
CPC classification number: G06V20/58 , G06V10/40 , G06V10/82 , G06N3/0454 , G06N3/0445
Abstract: A device may receive a video and corresponding sensor information associated with a vehicle, and may extract feature vectors associated with the corresponding sensor information and an appearance and a geometry of another vehicle captured in the video. The device may generate a tensor based on the feature vectors, and may process the tensor, with a convolutional neural network model, to generate a modified tensor. The device may select a decoder model from a plurality of decoder models, and may process the modified tensor, with the decoder model, to generate a caption for the video based on attributes associated with the video. The device may perform one or more actions based on the caption for the video.
-
2.
公开(公告)号:US20240290118A1
公开(公告)日:2024-08-29
申请号:US18175993
申请日:2023-02-28
Applicant: Verizon Patent and Licensing Inc.
Inventor: Niccolo BELLACCINI , Matteo SIMONCINI , Douglas COIMBRA DE ANDRADE , Francesco SAMBO
IPC: G06V20/70 , G06F16/51 , G06F16/53 , G06F16/58 , G06F40/186 , G06F40/279 , G06V10/40 , G06V10/764 , G06V10/82
CPC classification number: G06V20/70 , G06F16/51 , G06F16/53 , G06F16/5866 , G06F40/186 , G06F40/279 , G06V10/40 , G06V10/764 , G06V10/82
Abstract: A device may receive a plurality of narratives associated with a plurality of scenes and an image identifying a scene not included in the plurality of scenes, and may process the image, with a classifier model, to detect a plurality of features in the image. The device may replace keywords in the plurality of narratives, with tags, to generate a plurality of sentences, and may group similar sentences of the plurality of sentences, based on a defined measure of dissimilarity, into clusters of templates. The device may select a candidate template from each of the clusters to generate a set of candidate templates, and may select a template from the set of candidate templates. The device may populate tags of the template with the plurality of features detected in the image to generate an image caption, and may provide the image and the image caption for display.
-
3.
公开(公告)号:US20240242510A1
公开(公告)日:2024-07-18
申请号:US18155435
申请日:2023-01-17
Applicant: Verizon Patent and Licensing Inc.
Inventor: Niccolo BELLACCINI , Matteo SIMONCINI , Andrea BENERICETTI , Henrique Pineiro MONTEAGUDO , Francesco SAMBO
CPC classification number: G06V20/588 , B60W50/14 , G06T3/40 , G06T7/13 , G06T7/20 , G06T2207/20084 , G06T2207/30256
Abstract: In some implementations, a video system may receive, from a camera mounted to a vehicle, a video of a portion of a road on which the vehicle is traveling. The video system may extract, from each frame of a plurality of frames associated with the video of the road, a frame strip to form a plurality of frame strips, wherein each frame strip extends a predetermined width in a horizontal direction and a predetermined height in a vertical direction. The video system may form, from each frame strip, a single-pixel strip, to form a plurality of single-pixel strips. The video system may compile the plurality of single-pixel strips to form a motion profile. The video system may determine, using machine learning, one of: at least one driving maneuver associated with the vehicle based on the motion profile, or that no driving maneuvers are present in the motion profile.
-
-