LOW-POWER CHANGE-BASED NEURAL NETWORK INFERENCE FOR IMAGE PROCESSING

    公开(公告)号:US20230377321A1

    公开(公告)日:2023-11-23

    申请号:US17664262

    申请日:2022-05-20

    CPC classification number: G06V10/82 G06V20/46 G06V20/70 G06V10/42

    Abstract: One or more aspects of the present disclosure enable high accuracy computer vision and image processing techniques with decreased system resource requirements (e.g., with decreased computational load, shallower neural network designs, etc.). As described in more detail herein, one or more aspects of the described techniques may leverage key layers (e.g., certain key layers of a neural network) and compressed tensor comparisons to efficiently exploit temporal redundancy in videos and other slow changing signals (e.g., to efficiently reduce neural network inference computational burden, with only minor increase in data transfer power consumption). For example, key layers of a neural network may be identified, and temporal/spatial redundancy across frames may be efficiently leveraged such that only a computation region in a subsequent frame n+1 is re-computed in layers between identified key layers, while remaining feature-map calculations may be disabled in the layers between the identified key layers.

    DEMOIRÉ USING MULTIPLE CAMERAS
    4.
    发明申请

    公开(公告)号:US20250054117A1

    公开(公告)日:2025-02-13

    申请号:US18447743

    申请日:2023-08-10

    Abstract: Systems and methods for image processing (e.g., image correction) are described. Embodiments of the present disclosure include image processing techniques that reduce or remove Moiré patterns by leveraging low resolution images (e.g., images captured using low resolution sensors, such as an ultra-wide camera). For instance, an image including a Moiré pattern may be corrected based on a second image having a low resolution. In one example, a device may capture a high resolution image that includes a Moiré pattern. The device may also capture a low resolution image that is aligned with the high resolution image and used to correct (e.g., remove) the Moiré pattern. In some embodiments, the systems and techniques described herein may be implemented in real-time on a user device (e.g., that includes a high resolution image sensor and a low resolution image sensor) for efficient and effective correction of Moiré patterns in image/video capture applications.

    Real-time facial landmark detection

    公开(公告)号:US11574500B2

    公开(公告)日:2023-02-07

    申请号:US17151339

    申请日:2021-01-18

    Abstract: Embodiments of the present disclosure enable and accurate detection of facial landmarks on mobile devices in real-time. An architecture of a facial landmark detection model is provided including one or more of an attention mechanism (e.g., an attention network), a graph convolution model (e.g., a two-dimensional facial geometry graph convolution model), a multiscale coarse-to-fine mechanism, a patch-facial landmark detachment mechanism, and error estimation techniques. The attention mechanism may increase the accuracy of the facial landmark detection model by attending to meaningful patches. The graph convolution network may improve patch feature aggregation by considering the facial landmarks' geometry. The coarse-to-fine mechanism reduces a network convergence to two cycles (e.g., two facial landmark detection iterations). A patch-facial landmark detachment mechanism reduces the computation burden without significant accuracy degradation. Error estimation techniques provide accurate estimation of the regression error to the computation load and increase the accuracy of the model.

    REAL-TIME FACIAL LANDMARK DETECTION

    公开(公告)号:US20220075994A1

    公开(公告)日:2022-03-10

    申请号:US17151339

    申请日:2021-01-18

    Abstract: Embodiments of the present disclosure enable and accurate detection of facial landmarks on mobile devices in real-time. An architecture of a facial landmark detection model is provided including one or more of an attention mechanism (e.g., an attention network), a graph convolution model (e.g., a two-dimensional facial geometry graph convolution model), a multiscale coarse-to-fine mechanism, a patch-facial landmark detachment mechanism, and error estimation techniques. The attention mechanism may increase the accuracy of the facial landmark detection model by attending to meaningful patches. The graph convolution network may improve patch feature aggregation by considering the facial landmarks' geometry. The coarse-to-fine mechanism reduces a network convergence to two cycles (e.g., two facial landmark detection iterations). A patch-facial landmark detachment mechanism reduces the computation burden without significant accuracy degradation. Error estimation techniques provide accurate estimation of the regression error to the computation load and increase the accuracy of the model.

Patent Agency Ranking