Abstract:
Autonomous driving systems described herein provide an efficient way to manage camera-based perception by considering the characteristics of captured images. In one example, a camera sensor may capture an image and a processor may determine a first region of interest (ROI) within the image and a second ROI within the image. The processor may generate a first image of the first ROI and a second image of the second ROI. The processor may transmit a control signal based on one or more objects detected in the first ROI and/or one or more objects detected in the second ROI to cause the vehicle to perform an autonomous driving operation.
Abstract:
Certain aspects of the present disclosure provide a method for lane marker detection, including: receiving an input image; providing the input image to a lane marker detection model; processing the input image with a shared lane marker portion of the lane marker detection model; processing output of the shared lane marker portion of the lane marker detection model with a plurality of lane marker-specific representation layers of the lane marker detection model to generate a plurality of lane marker representations; and outputting a plurality of lane markers based on the plurality of lane marker representations.
Abstract:
A method, which is performed by an electronic device, of automatically activating a flash for an image sensor of the electronic device is disclosed. The method may include receiving a first image including at least one text region and determining feature data characterizing the at least one text region in the first image. The method may also identify at least one candidate specular reflection region in the first image. Based on the feature data and the at least one candidate specular reflection region, the flash may be activated. Upon activating the flash, a second image including the at least one text region may be captured.
Abstract:
A method, a computer-readable medium, and an apparatus for object detection are provided. The apparatus may determine a regression vector using a neural network based on an input image that contains an object. The object may have a planar surface with a known shape. The apparatus may derive a transform matrix based on the regression vector. The apparatus may identify a precise boundary of the object based on the transform matrix. The precise boundary of the object may include a plurality of vertices of the object. To identify the boundary of the object, the apparatus may apply the transform matrix to a determined shape of the object.
Abstract:
A method for capturing an image of a traffic sign by an image sensor is disclosed. The method may include capturing, by the image sensor, at least one image including the traffic sign, wherein the image sensor is mounted in a vehicle. The method may include detecting, by a processor, the traffic sign in the at least one image. The method may also include in response to detecting the traffic sign, determining, by the processor, at least one direction of the image sensor based on the at least one image and motion of the vehicle. In addition, the method may include adjusting, by the processor, the image sensor to the at least one direction.
Abstract:
A method, which is performed by an electronic device, for resizing an image having text is disclosed. The method may include determining layout information of at least one text region in the image. The layout information may include at least one of a number, a size, a location, a shape, or a text density of the at least one text region in the image. The method may also select a seam carving operation, a cropping operation, or a scaling operation for the image based on the layout information, a size of the image, and a target image size. The selected operation may be performed to resize the image to the target image size based at least on one of the layout information, the size of the image, or the target image size. The resized image may include the at least one text region.
Abstract:
A method, which is performed by an electronic device, for resizing an image having text is disclosed. The method may include determining layout information of at least one text region in the image. The layout information may include at least one of a number, a size, a location, a shape, or a text density of the at least one text region in the image. The method may also select a seam carving operation, a cropping operation, or a scaling operation for the image based on the layout information, a size of the image, and a target image size. The selected operation may be performed to resize the image to the target image size based at least on one of the layout information, the size of the image, or the target image size. The resized image may include the at least one text region.
Abstract:
A method, which is performed by an electronic device, of automatically activating a flash for an image sensor of the electronic device is disclosed. The method may include receiving a first image including at least one text region and determining feature data characterizing the at least one text region in the first image. The method may also identify at least one candidate specular reflection region in the first image. Based on the feature data and the at least one candidate specular reflection region, the flash may be activated. Upon activating the flash, a second image including the at least one text region may be captured.