DG-3DGS: Dynamic Growing 3D Gaussian Splatting for Monocular Laparoscopic Sequential Scene Reconstruction

Scene reconstruction of monocular endoscope video is essential for the enhancement of Surgical Endoscope Image Analysis and Application. However, restricted by the narrow space of endoscopic movement and the obstruction of vision within cavities, it’s difficult for most conventional methods to conduct high-quality reconstruction. To overcome these challenges, a novel dynamic growing 3D gaussian splatting architecture is proposed to construct the 3D model of endoscopic scene without pre-compute camera poses or Structure from Motion. Firstly, to establish spatial feature associations between interframe, a 2D 3D displacement field is designed based on dense feature matches and depth prediction. On this basis, a novel displacement field variational optimization is developed to acquire relative poses by minimizing the energy functional of field transformation. Secondly, to address the constraint of endoscopic view, by gaussian dynamic transformation and differential gradient field optimization, a novel dynamic gaussian growing strategy is proposed to sequentially grow the local gaussian model. Finally, a novel Forward-Reconstruction&Backward-Optimization architecture is proposed to generate the global gaussian model. The evaluation is conducted on two public endoscopic datasets: Scared and C3VD. The experimental results show the proposed method outperforms state-of-the-art methods in quantitative (PSRN, SSIM and LIPIS) and qualitative comparisons. The project page is https://iheckzza.github.io/DG-3DGS/.

DG-3DGS: Dynamic Growing 3D Gaussian Splatting for Monocular Laparoscopic Sequential Scene Reconstruction

Abstract

The architecture of DG-3DGS From View_N to View_N+1,View_N+1 to View_N+2, and to View_N+. The gaussian evolution path is presented from G_N to G_N+1,G_N+1 to G_N+2, and finally to G_N+.

The architecture of proposed dynamic gaussian growing module between interframe View_N and View_N+1.The G_N and G_N+1 represent the gaussian model under the LCCS of View_N and View_N+1, respectively.

D1K1 Predict

D2K3 Predict

D4K2 Predict

D8K1 Predict

D1K1 GT

D2K3 GT

D4K2 GT

D8K1 GT

Cecum Predict

Desc Predict

Sigmoid Predict

Trans Predict

Cecum GT

Desc GT

Sigmoid GT

Trans GT

BibTeX