SphereGlue: A Graph Neural Network based feature matching for high-resolution spherical images

Abstract

Traditionally, spherical keypoint matching has been performed using greedy algorithms, such as Nearest Neighbors (NN) search. NN based algorithms often lead to erroneous or insufficient matches as they fail to leverage global keypoint neighborhood information. Inspired by a recent learned perspective matching approach we introduce SphereGlue: a Graph Neural Network based feature matching for high-resolution spherical images. The proposed model naturally handles the severe distortions resulting from geometric transformations. Rigorous evaluations demonstrate the efficacy of SphereGlue in matching both learned and handcrafted keypoints, on synthetic and real high-resolution spherical images. Moreover, SphereGlue generalizes well to previously unseen real-world and synthetic scenes. Results on camera pose estimation show that SphereGlue can directly replace state-of-the-art matching algorithms, in downstream tasks.

Network Architecture

Full paper PDF

Repo Structure

├── data                     <- Keypoints information from two images. 
│   ├── akaze                <- Data from akaze detector.
│   ├── kp2d                 <- Data from kp2d detector.
│   ├── sift                 <- Data from sift detector.
│   ├── superpoint           <- Data from superpoint detector.
|   └── superpoint_tf        <- Data from superpoint_tf detector.
|
├── images                   <- Equirectangular images for visualizing matches
|
├── matches                  <- Matches folder to save drawn matches (will be created automatically)
│
├── models             
│   └── spherglue.py         <- Trained and serialized models, model predictions, or model summaries     
│
├── model_weights        
│   ├── akaze                <- Model weights for akaze detector.
│   ├── kp2d                 <- Model weights for kp2d detector.
│   ├── sift                 <- Model weights for sift detector.
│   ├── superpoint           <- Model weights for superpoint detector.
|   └── superpoint_tf        <- Model weights for superpoint_tf detector.
│
├── output                   <- Output folder to save the predictions (will be created automatically)
|
├── utils              
|   ├── demo_mydataset.py    <- Data loader
|   └── Utils.py             <- Util file
|
├── demo_SphereGlue.py       <- Demo code to run SphereGlue
│
├── LICENSE
|
├── README.md                <- The top-level README for developers using this project.
|
└── requirements.txt         <- The requirements file for reproducing the analysis environment, e.g.generated with `pip freeze > requirements.txt`

Dependencies

Python 3 >= 3.9
PyTorch >= 1.10
Pytorch geometric >= 2.0
OpenCV >= 4.5
Matplotlib >= 3.5
NumPy >= 1.21

Or simply run pip install -r requirements.txt

Structure of Dataset

Keypoint Coordinates, Keypoint Descriptors, and Keypoint Scores can be extracted from:

SuperPoint: Code
KP2D: Code
Superpoint_tf: Code
Akaze: : Will be added soon
Sift: : Will be added soon

Keypoint Coordinates used in SphereGlue are in spherical coordinates. The keypoint coordinates obtained from the above detectors will be in pixel coordinates. To convert this use:

def PixelToSpherical(pixel_coordinates:np.array, imgWidth:int, imgHeight:int):
    x,y = np.hsplit(pixel_coordinates,2)
    theta = (1. - (x + .5) / imgWidth) * 2*np.pi
    phi = ((y + .5) * np.pi) / imgHeight
    return np.hstack((phi, theta))

Create a list of image pair and merge the Keypoint Coordinates, Keypoint Descriptors, and Keypoint Scores of two images into a npz file. The structure of the npz file (dictionary) can be seen below:

{keypointCoords0: Keypoint Coordinates of image 0,
keypointCoords1: Keypoint Coordinates of image 1, 
keypointDescriptors0: Keypoint Descriptors of image 0,
keypointDescriptors1: Keypoint Descriptors of image 1,
keypointScores0: Keypoint Scores of image 0,
keypointScores1: Keypoint Scores of image 1
}

Demo

To run the demo on the data, use python demo_SphereGlue.py --save_npz True

There are 4 flags:

--save_npz, when True, it will save the npz files in the folder output.
--draw_matches, when True, it will save the drawn matches in the folder matches.
--display_matches, when True, it will display the drawn matches.
--detector, can be used to change the detector. python demo_SuperGlue.py --save_npz True --detector 'sift'.

Citation

If you are any ideas from the paper or code in your research, please cite our paper

@InProceedings{Gava_2023_CVPR,
    author    = {Gava, Christiano and Mukunda, Vishal and Habtegebrial, Tewodros and Raue, Federico and Palacio, Sebastian and Dengel, Andreas},
    title     = {SphereGlue: Learning Keypoint Matching on High Resolution Spherical Images},
    booktitle = {Proceedings of the IEEE/CVF Conference on Computer Vision and Pattern Recognition (CVPR) Workshops},
    month     = {June},
    year      = {2023},
    pages     = {6133-6143}
}

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

SphereGlue: A Graph Neural Network based feature matching for high-resolution spherical images

Abstract

Network Architecture

Repo Structure

Dependencies

Structure of Dataset

Demo

Citation

About

Releases

Packages

Languages

Name		Name	Last commit message	Last commit date
Latest commit History 46 Commits
data		data
images		images
model		model
model_weights		model_weights
utils		utils
.gitattributes		.gitattributes
LICENSE		LICENSE
README.md		README.md
demo_SphereGlue.py		demo_SphereGlue.py
requirements.txt		requirements.txt

License

vishalsharbidar/SphereGlue

Folders and files

Latest commit

History

Repository files navigation

SphereGlue: A Graph Neural Network based feature matching for high-resolution spherical images

Abstract

Network Architecture

Repo Structure

Dependencies

Structure of Dataset

Demo

Citation

About

Resources

License

Stars

Watchers

Forks

Releases

Packages 0

Languages

Packages