Multi-task learning for real-time semantic and/or depth aware instance segmentation and/or three-dimensional object bounding
Abstract:
A machine-learning (ML) architecture for determining three or more outputs, such as a two and/or three-dimensional region of interest, semantic segmentation, direction logits, depth data, and/or instance segmentation associated with an object in an image. The ML architecture may output these outputs at a rate of 30 or more frames per second on consumer grade hardware.
Information query
Patent Agency Ranking
0/0