Real-time device detection with rotated bounding boxes and its clinical application

Ma, YingLiang ORCID: https://orcid.org/0000-0001-5770-5843, Howell, Sandra, Rinaldi, Aldo, Dhanjal, Tarv and Rhode, Kawal S. (2024) Real-time device detection with rotated bounding boxes and its clinical application. In: Clinical Image-Based Procedures. Lecture Notes in Computer Science, LNCS 15196 . Springer, pp. 83-93. ISBN 978-3-031-73082-5

[thumbnail of MICCAI2024_CLIP_final] PDF (MICCAI2024_CLIP_final) - Accepted Version
Restricted to Repository staff only until 1 October 2025.

Request a copy

Abstract

Interventional devices and insertable imaging devices such as transesophageal echo (TOE) probes are routinely used in minimally invasive cardiovascular procedures. Detecting their positions and orientations in X-ray fluoroscopic images is important for many clinical applications. Nearly all interventional devices used in cardiovascular procedures contain a wire or wires and are inserted into major blood vessels. In this paper, novel attention mechanisms were designed to guide a convolution neural network (CNN) model to the areas of wires in X-ray images. The first attention mechanism was achieved by using multi-scale Gaussian derivative filters in the first convolutional layer inside the proposed CNN backbone. By combining these multi-scale Gaussian derivative filters together, they can provide a global attention on the wire-like or tube-like structures. Furthermore, the dot-product based attention layer was used to calculate the similarity between the random filter output and the output from the Gaussian derivative filters, which further enhances the attention on the wire-like or tube-like structures. By using both attention mechanisms, a high-performance CNN backbone was created, and it can be plugged into light-weighted CNN models for multiple object detection. An accuracy of 0.88±0.04 was achieved for detecting an echo probe in X-ray images at 58 FPS, which was measured by inter-section-over-union (IoU). Based on the detected pose of the echo probe, 3D echo can be fused with live X-ray images to provide a hybrid guidance solution. Codes are available at https://github.com/YingLiangMa/AttWire.

Item Type: Book Section
Faculty \ School: Faculty of Science > School of Computing Sciences
UEA Research Groups: Faculty of Science > Research Groups > Norwich Epidemiology Centre
Faculty of Medicine and Health Sciences > Research Groups > Norwich Epidemiology Centre
Related URLs:
Depositing User: LivePure Connector
Date Deposited: 08 Oct 2024 17:30
Last Modified: 13 Oct 2024 06:30
URI: https://ueaeprints.uea.ac.uk/id/eprint/96952
DOI: 10.1007/978-3-031-73083-2_9

Actions (login required)

View Item View Item