Abstract:
What is disclosed is a novel video processing system and method wherein a plurality of image frames of a video captured using a video camera with a spatial resolution of (M×N) in the (x, y) direction, respectively, and a temporal resolution (T) in frames per unit of time. A first and second magnification factor f1, f2 are selected for spatial enhancement in the (x, y) direction. A third magnification factor f3 is selected for a desired temporal enhancement in (T). The video data is processed using a dictionary comprising high and low resolution patch cubes which are used to induce spatial and temporal components in the video where no data exists. A high resolution course video X0 is generated which has an enhanced spatial resolution of (f1* M)×(f2*N) and an enhanced temporal resolution of (f3*T) frames. The course high resolution video is then smoothed, when found required, to generate a smoothed high resolution video.