This refers to "A Semantic Segmentation-guided Approach for Ground-to-Aerial Image Matching". It procudes our results on a subset of the CVUSA dataset.
Nowadays the accurate geo-localization of ground-view images has an important role across different domains, including journalism, forensics analysis, and Earth Observation (EO). This work addresses the problem of matching a query ground-view image with the corresponding satellite image without GPS data. This is done by comparing the features from a ground-view image with a satellite one together with its segmentation mask through a three-stream Siamese-like network. The proposed method focuses on limited Field-of-View (FoV) and ground panorama images (images with a FoV of 360°). The novelty is in the fusion of satellite images in combination with their segmentation masks, aimed at ensuring that the model can extract features and focus on the significant parts of the images. This work shows how the proposed model through semantic segmentation mask images improves the performance on the unlabelled CVUSA dataset on all the tested FoV.
- This code uses
TensorFlow 2.10.0
numpy 1.25.2
cv2 4.8.0
and run on modern GPU of at least 6GB of memory - We run the code with
Python 3.9.2
- Download the used CVUSA subset at the following link:
- Clone this repository
git clone https://github.com/pro1944191/SemanticAlignNet
- For training run the following command from the shell:
python train_no_session.py --train_grd_noise 360 --train_grd_FOV $YOUR_FOV --test_grd_FOV $YOUR_FOV
- This command will run the python file
train_no_session.py
, inside it there are other possible parameters to pass as input or to leave as default value $YOUR_FOV
choose a value bewteen 0° and 360°, that is the FoV value used for the ground view images- The trained model will be saved in the path
./saved_models/model_name/
, will be created a folder for each trained epoch and a.txt
file with a recap of the training
- For testing run the following command from the shell:
python test_no_session.py --train_grd_noise 360 --train_grd_FOV $YOUR_FOV --test_grd_FOV $YOUR_FOV
- This command will run the python file
test_no_session.py
, inside it there are other possible parameters to pass as input or to leave as default value $YOUR_FOV
choose a value bewteen 0° and 360°, that is the FoV value used for the ground view images- The results of the testing phase will be saved in a
.mat
file
- Dataset
- Once you downloaded the dataset you will have the following folders:
Streetview
this folder contains the ground view images (streetview)Bingmap
this folder contains the original satellite view imagesSegmap
here are contained the original segmented mask imagesPolarmap/
/normal
this subfolder contains the polar transformed satellite images/segmap
this subfolder contains the polar transformed segmetation mask images
- Once you downloaded the dataset you will have the following folders: