Cross-attention learning enables real-time nonuniform rotational distortion correction in OCT

Zhang, Haoran; Yang, Jianlong; Zhang, Jingqian; Zhao, Shiqing; Zhang, Aili

doi:10.1364/BOE.512337

Electrical Engineering and Systems Science > Image and Video Processing

arXiv:2306.04512 (eess)

[Submitted on 7 Jun 2023 (v1), last revised 5 Jan 2024 (this version, v2)]

Title:Cross-attention learning enables real-time nonuniform rotational distortion correction in OCT

Authors:Haoran Zhang, Jianlong Yang, Jingqian Zhang, Shiqing Zhao, Aili Zhang

View PDF HTML (experimental)

Abstract:Nonuniform rotational distortion (NURD) correction is vital for endoscopic optical coherence tomography (OCT) imaging and its functional extensions, such as angiography and elastography. Current NURD correction methods require time-consuming feature tracking or cross-correlation calculations and thus sacrifice temporal resolution. Here we propose a cross-attention learning method for the NURD correction in OCT. Our method is inspired by the recent success of the self-attention mechanism in natural language processing and computer vision. By leveraging its ability to model long-range dependencies, we can directly obtain the correlation between OCT A-lines at any distance, thus accelerating the NURD correction. We develop an end-to-end stacked cross-attention network and design three types of optimization constraints. We compare our method with two traditional feature-based methods and a CNN-based method, on two publicly-available endoscopic OCT datasets and a private dataset collected on our home-built endoscopic OCT system. Our method achieved a $\sim3\times$ speedup to real time ($26\pm 3$ fps), and superior correction performance.

Subjects:	Image and Video Processing (eess.IV); Computer Vision and Pattern Recognition (cs.CV); Medical Physics (physics.med-ph)
Cite as:	arXiv:2306.04512 [eess.IV]
	(or arXiv:2306.04512v2 [eess.IV] for this version)
	https://doi.org/10.48550/arXiv.2306.04512
Journal reference:	Biomedical Optics Express 15.1 (2024): 319-335
Related DOI:	https://doi.org/10.1364/BOE.512337

Submission history

From: Haoran Zhang [view email]
[v1] Wed, 7 Jun 2023 15:25:27 UTC (9,970 KB)
[v2] Fri, 5 Jan 2024 06:51:15 UTC (15,098 KB)

Electrical Engineering and Systems Science > Image and Video Processing

Title:Cross-attention learning enables real-time nonuniform rotational distortion correction in OCT

Submission history

Access Paper:

References & Citations

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators

Electrical Engineering and Systems Science > Image and Video Processing

Title:Cross-attention learning enables real-time nonuniform rotational distortion correction in OCT

Submission history

Access Paper:

References & Citations

BibTeX formatted citation

Bookmark

Bibliographic and Citation Tools

Code, Data and Media Associated with this Article

Demos

Recommenders and Search Tools

arXivLabs: experimental projects with community collaborators