Abstract

Introduction

The accuracy of echocardiography interpretations is crucial. However, echocardiography has varying degrees of image quality, which can impact diagnostic decision-making. Automated chamber quantification of the left ventricle (LV) and left atrium (LA) has been shown to be feasible using machine learning (ML) systems. However, the effect of image quality on the performance of these ML systems has not been investigated.

Purpose

To assess the effect of different echocardiographic image quality on the performance of left-sided heart quantification using an ML system.

Methods

A multi-center study enrolled 240 patients who underwent stress echocardiography across five centers. The acquired rest phase images were anonymized for subsequent analysis. Expert cardiologists reviewed the studies, selecting apical four-chamber (4Ch) and two-chamber (2Ch) view images. The cardiologists identified end-systolic (ES) and end-diastolic (ED) frames. They traced the endocardial borders for the LA in ES and the LV in ED and ES. The same ED and ES frames were then processed by an ML algorithm trained on a separate set of images. Subsequently, an expert cardiologist, blinded to the previous measurements, assessed the image quality of LA and LV. The image quality for each chamber in the ES or ED frame was categorized into one of four grades: poor, suboptimal, fair, and optimal. Agreement analysis between human and AI measurements was performed within each image quality group. Yield was defined as the percentage of studies where the ML system predicted an echocardiographic parameter compared to a cardiologist.

Results

The yield of study parameters varied significantly with image quality. The lowest yield was 45%, observed for LV end-diastolic volume in the 2Ch view when image quality was poor. In contrast, the yield of 100% was seen for LV end-diastolic volumes in both the 4Ch and 2Ch views for optimal quality images. The correlation between automated and expert cardiologist measurements of left heart volumes varied depending on image quality. For poor or suboptimal images, the Pearson correlation coefficient ranged from 0.66 to 0.91 and improved to 0.85-0.92 in fair and optimal-quality images. Additionally, there was an inverse relation between agreement metrics (bias, limits of agreement, and coefficient of variation) and improvements in image quality. However, this tendency was less pronounced or nonexistent when comparing fair and optimal-quality images. (Table 1).

Conclusions

Image quality affects the yield of automated analysis and the agreement between cardiologists and automated measurements in echocardiography, highlighting the need for high-quality image acquisition in use with ML systems.

This content is only available as a PDF.

Author notes

Funding Acknowledgements: None.

This article is published and distributed under the terms of the Oxford University Press, Standard Journals Publication Model (https://academic.oup.com/pages/standard-publication-reuse-rights)