Fig. 3: The structure of RecGen2. A low-resolution input image i_2 is transformed into a high-frequency image residual r_2 by an encoder-decoder network. A high-resolution output image is generated by adding the image residual to the upscaled input image. The dimension of each feature map is denoted in the figure. An example output of each convolutional layer is also shown.