-
公开(公告)号:US20220292811A1
公开(公告)日:2022-09-15
申请号:US17619615
申请日:2019-07-12
Applicant: SONY INTERACTIVE ENTERTAINMENT INC.
Inventor: Tsutomu HORIKAWA , Daichi ONO , Hiroyuki YABE
IPC: G06V10/764 , G06V20/64 , G06V20/40 , G06V10/26 , G06T7/73
Abstract: Provided are an image processing device, an image processing method, and a program for recognizing an object in a three-dimensional map, which do not require collecting learning data of the three-dimensional map and can perform high-speed processing with a small load. The image processing device includes an image acquiring section that sequentially acquires a two-dimensional input image for each frame; an object type recognition executing section that attaches, to each of pixels of the input image acquired for each frame, a label indicating a type of an object represented by the pixels; and a labeling section that executes three-dimensional position recognition of a subject represented in the input image to create a three-dimensional map, based on the input image sequentially input, and attaches, to each voxel included in the three-dimensional map, the label of the pixel corresponding to the voxel.
-
公开(公告)号:US20220335681A1
公开(公告)日:2022-10-20
申请号:US17635951
申请日:2019-09-12
Applicant: SONY INTERACTIVE ENTERTAINMENT INC.
Inventor: Tsutomu HORIKAWA , Daichi ONO , Hiroyuki YABE
Abstract: Provided are an image processing device, an image processing method, and a program capable of forming an accurate three-dimensional map even in a case where a moving object is included in an input image. The image processing device includes an image acquiring section that sequentially acquires two-dimensional captured images, an object type recognition executing section that attaches, to each pixel in the sequentially acquired captured images, a label indicating the type of an object represented by the pixel, and a three-dimensional map creating section that executes three-dimensional position recognition of each pixel of the captured images to create a three-dimensional map, on the basis of the sequentially acquired captured images, and the three-dimensional map creating section restricts the three-dimensional position recognition of each pixel of the captured images according to the label attached to the pixel.
-