1st. Thermal Image Super-Resolution Challenge
In general, thermal images have a poor resolution, which could be improved by using learning-based traditional super-resolution methods. These methods have been largely used in the visible spectral domain. They work by downsampling and adding noise and blur to the given image. These noisy and blurred poor quality images, together with the given images (which are used as the Ground Truths), are used in the learning process.
The approach mentioned above has been mostly used to tackle the super-resolution problem, however there are few contributions where the learning process is based on the usage of a pair of images (low and a high-resolution images) obtained from different cameras. For the current challenge, a novel thermal image dataset has been created, containing images with three different resolutions (low, mid, high) obtained with three different thermal cameras. The challenge consists in creating a solution capable of generating super-resolution images in x2, x3 and x4 scales from the different resolutions, in the case of x2 an additional evaluation will be performed by using a HR image obtained from another camera. The results from each team will be evaluated in two ways as detailed below.
A dataset has been created using three different thermal cameras, in order to have real scenarios, with different conditions and objects. Each camera has a different resolution (low, mid, high). The technical information is shown below in Table 1 and illustrations from each camera depicted in Fig. 1.
Image Description | Brand Camera | FOV | Native Resolution |
---|---|---|---|
Low (LR) | Axis Domo P1290 | 8mm | 160x120 |
Mid (MR) | Axis Q2901-E | 9mm | 320x240 |
High (HR) | FC-632O FLIR | 13mm | 640x512* |
A total of 1021 thermal images were simultaneously taken with each camera. All images are semi-paired. From all these images, 1001 are going to be published (951 for training and 50 for testing), the remaining 20 will be used in the challenge for validation.
PSNR and structural similarity (SSIM) measures are going to be computed over a small set of images left for evaluating the performance of the proposed solution. Two kinds of evaluations are going to be performed. For the first evaluation, a set of 10 downsampled and noisy images will be shared for traditional evaluation as shown in Fig. 2. For the second evaluation a set of 10 real images, from the MR dataset, will be shared. In this case, the obtained SR images will be compared with the corresponding real High Resolution semi-paired images, as shown in Fig. 3. For the second evaluation, HR images are going to be registered with the computed SR images in order to use PSNR & SSIM metrics.
*Due to the semi-paired nature, for a fair comparison, the evaluation will be performed over a small region (50% of image size) centered in the image.
The super-resolution (SR) results must be submitted in a zip file together with a short description of the proposed approach to <pbvs20.tisr.challenge@gmail.com>. The approach with the higher performance in most evaluation metrics will be the winner of the challenge.
Scale | Evaluation 1 | Evaluation 2 |
---|---|---|
x2 | PSNR/SSIM | PSNR/SSIM |
x3 | PSNR/SSIM | --- |
x4 | PSNR/SSIM | --- |