Guide3D: A Bi-planar X-ray Dataset for 3D Shape Reconstruction

1University of Liverpool, UK
2Imperial College London, UK
3University of Information Technology - VNUHCM, VN
4University of Aberdeen, UK
5AIOZ, Singapore
6University of Arkansas, USA
cars peace
Setup Materials: a) Overall setup & endovascular phantom, b) Radifocus (angled) guidewire. and c) Nitrex (straight) guidewire.

Abstract

Endovascular surgical tool reconstruction represents an important factor in advancing endovascular tool navigation, which is an important step in endovascular surgery. However, the lack of publicly available datasets significantly restricts the development and validation of novel machine learning approaches. Moreover, due to the need for specialized equipment such as biplanar scanners, most of the previous research employs monoplanar fluoroscopic technologies, hence only capturing the data from a single view and significantly limiting the reconstruction accuracy. To bridge this gap, we introduce Guide3D, a bi-planar X-ray dataset for 3D reconstruction. The dataset represents a collection of high resolution bi-planar, manually annotated fluoroscopic videos, captured in real-world settings. Validating our dataset within a simulated environment reflective of clinical settings confirms its applicability for real-world applications. Furthermore, we propose a new benchmark for guidewrite shape prediction, serving as a strong baseline for future work. Guide3D not only addresses an essential need by offering a platform for advancing segmentation and 3D reconstruction techniques but also aids the development of more accurate and efficient endovascular surgery interventions.

Dataset Description

Dataset Overview: Guide3D contains manually annotated frames from two views for 3D reconstruction (left), from which the reconstruction is derived (right).
cars peace
The dataset includes 3,664 instances of angled guidewires with fluid and 484 without, while straight guidewires are represented by 2,472 instances with fluid and 2,126 without. This distribution reflects a variety of procedural contexts. All 8,746 images in the dataset are accompanied by manual segmentation ground truth, facilitating the development of algorithms that require segmentation maps as reference data.
Sample Type Radifocus Guidewire (Angled) Nitrex Guidewire (Straight) Total
w fluid 3,664 484 4,148
w/o fluid 2,472 2,126 4,598
Total 6,136 2,610 8,746

Method

The figure illustrates the essential components of the proposed model. a) Spherical coordinates are used to predict the guidewire shape. b) The model predicts the 3D shape of a guidewire from image sequences. A Vision Transformer (ViT) extracts spatial features, which a Gated Recurrent Unit (GRU) processes to capture temporal dependencies, producing hidden states. The final hidden state drives three prediction heads: the Tip Prediction Head for the 3D tip position, the Spherical Offset Prediction Head for coordinate offsets, and the Stop Prediction Head for terminal point probability.
cars peace

BibTeX

Soon