Abstract:
A method of determining a number format for representing a set of two or more network parameters of a Deep Neural Network “DNN” for use in configuring hardware logic to implement the DNN. The method includes: determining a sensitivity of the DNN with respect to each network parameter in the set of network parameters; for each candidate number format of a plurality of candidate number formats: determining a quantisation error associated with quantising each network parameter in the set of network parameters in accordance with the candidate number format; generating an estimate of an error in an output of the DNN caused by quantisation of the set of network parameters based on the sensitivities and the quantisation errors; generating a local error based on the estimated error; and selecting the candidate number format of the plurality of candidate number formats with the minimum local error as the number format for the set of network parameters.
Abstract:
Methods for determining a fixed point format for one or more layers of a DNN based on the portion of the output error of the DNN attributed to the fixed point formats of the different layers. Specifically, in the methods described herein the output error of a DNN attributable to the quantisation of the weights or input data values of each layer is determined using a Taylor approximation and the fixed point number format of one or more layers is adjusted based on the attribution. For example, where the fixed point number formats used by a DNN comprises an exponent and a mantissa bit length, the mantissa bit length of the layer allocated the lowest portion of the output error may be reduced, or the mantissa bit length of the layer allocated the highest portion of the output error may be increased. Such a method may be iteratively repeated to determine an optimum set of fixed point number formats for the layers of a DNN.
Abstract:
Methods for selecting fixed point number formats for representing values input to and/or output from layers of a DNN which take into account the impact of the fixed point number formats for a particular layer in the context of the DNN. The methods comprise selecting the fixed point number format(s) used to represent sets of values input to and/or output from a layer one layer at a time in a predetermined sequence wherein any layer is preceded in the sequence by the layer(s) from which it depends. The fixed point number format(s) for each layer is/are selected based on the error in the output of the DNN associated with the fixed point number formats. Once the fixed point number format(s) for a layer has/have been selected any calculation of the error in the output of the DNN for a subsequent layer in the sequence is based on that layer being configured to use the selected fixed point number formats.
Abstract:
An image processing system and method for determining an intrinsic colour component of one or more objects for use in rendering the object(s) is described. One or more input images are received, each representing a view of the object(s), wherein values of the input image(s) are separable into intrinsic colour estimates and corresponding shading estimates. A set of surface normals for the object(s) of the input image(s) is determined. In accordance with the values of the input image(s) and the determined set of surface normals, a global lighting estimate is determined which provides consistent corresponding intrinsic colour estimates for a plurality of regions of the object(s) from the input image(s). The intrinsic colour component is determined in accordance with the values of the input image(s) and the determined global lighting estimate. The determined intrinsic colour component of the object(s) and the determined set of surface normals for the object(s) are stored for use in rendering the object(s).
Abstract:
Relightable free-viewpoint rendering allows a novel view of a scene to be rendered and relit based on multiple views of the scene from multiple camera viewpoints. Image values from the multiple camera viewpoints can be separated into diffuse image components and specular image components, such that an intrinsic colour component of a relightable texture can be determined for a specular scene, by using the separated diffuse image components. Furthermore, surface normals of geometry in the scene can be refined by constructing a height map based on a conservative component of an initial surface normal field and then determining the refined surface normals based on the constructed height map.
Abstract:
A model of a scene of an image (e.g. a frame of a video sequence) is generated from one or more views of the scene captured from one or more different camera viewpoints. An initial texture for applying to the model is derived from the one or more views of the scene. The initial texture is separated into a lighting estimate and a colour estimate, which may be orthogonal and which may be processed independently. The lighting estimate is filtered with a high-pass filter to thereby determine shadow regions of the scene which are regions of detailed shadow which are likely to be caused by ambient occlusion in the scene and which are therefore retained when the texture is relit for rendering the image. A shadow-detail estimate (or “dark map”) is provided which indicates one or more shadow regions of the texture which are to remain in shadow when the image is rendered.