LVG-SfM: Learning-based view-graph generation for robust on-the-fly SfM

Wentian Gan1, Yifei Yu1, Giulio Perda2, Luca Morelli2, Rui Xia1 , Zongqian Zhan1, Xin Wang1*, Fabio Remondino2

1 Wuhan University, Wuhan, China
2 3D Optical Metrology (3DOM) unit, Bruno Kessler Foundation (FBK), Trento, Italy


Introduction

Most of the SfM methods operate offline, whereas the demand for real-time applications (such as quick disaster response, online measurements, collaborative 3D mapping, etc.) is increasing. Therefore, many researchers investigated online (or real-time) SfM solutions that aim to solve camera poses and sparse point cloud at speeds comparable to the image capturing rate.
Supported by recent advancements in learning-based feature extraction, matching and outlier detection methods, a more robust view-graph can be constructed, significantly enhancing the performances of online SfM. The paper presents a new real-time SfM solution, named LVG-SfM, which integrates and offers three operative processes:


Method Overview

intro_img

Paper
We leverage on local feature extractors like SuperPoint, DISK and ALIKED, and feature matching methods like SuperGlue and LightGlue to extract sufficient and robust correspondences. Then, for each new image, we apply the original retrieval module of on-the-fly SfM with a pre-trained global feature extractor and HNSW, selecting up to 30 of the most similar image pairs. We also leverage on Doppelgangers to differentiate true overlapping image pairs from ambiguous ones. If the number of remaining pairs after disambiguation is above two, the newly captured image is solved with the pipeline of on-the-fly SfM and added into the "registered image" sets, otherwise, it is inserted into the "not registered image" set. Finaly, the different agents involved in the surveying operation can simultaneously work on separate parts of the scene, leading to nonoverlapping image subsets. For each of these subsets, a distinct submap is created and updated in parallel. When the number of common/overlapping images between different submaps reaches a threshold, submaps are merged using the solution described in on-the-fly SfM.


Experiments

  • Performance on poor texture sequences

  • Here shows the reconstruction results on poor texture datasets with various learning-based methods and quantitative Camera poses and view-graph reconstruction results of datasets with repetitive structures. Blunders are highlighted in circles.

    poortext_result

  • Performance of disambiguation on repetitive structures

  • Camera poses and view-graph reconstruction results of datasets with repetitive structures are showed below. Blunders are highlighted in circles.

    repetitive1 repetitive1

    repetitive1 repetitive1


    Application

    Our method has been proven to have significant practical effects. This work, titled "Exploring the potential of collaborative UAV 3D mapping in Kenyan savanna for wildlife research," utilizes the system we have developed in the research theme of enhancing the capabilities of wildlife conservation and ecological monitoring.


    About us

    If you have any questions or advice, you can contact us through following address: Thanks for your support!