¹ßÇ¥Çü½Ä :
|
Á¢¼ö¹øÈ£ - 980173 RHOP 6-2 |
CLINICIANS 'SEE' THE WAVEFORMS: A COMPUTER VISION APPROACH FOR
INTERPRETABLE SLEEP STAGING |
GRADUATE SCHOOL OF DATA SCIENCE, SEOUL NATIONAL UNIVERSITY©ö, DEPARTMENT OF COMPUTER ENGINEERING, SCHOOL OF SOFTWARE, HALLYM UNIVERSITY©÷, OBSTRUCTIVE UPPER AIRWAY RESEARCH LABORATORY, DEPARTMENT OF PHARMACOLOGY, SEOUL NATIONAL UNIVERSITY COLLEGE OF MEDICINE©ø, DEPARTMENT OF OTORHINOLARYNGOLOGY-HEAD AND NECK SURGERY, SEOUL NATIONAL UNIVERSITY HOSPITAL©ù, OUAR LAB, INC⁵ |
HYOJIN LEE,
HYOJIN LEE©ö, JAEMIN JEONG©÷, HYUN KYUNG LEE©ø, YOU RIM CHOI©ö, WONHYUCK YOON©ù, HYUNG-SIN KIM©ö, HYUN-WOO SHIN©ø©ù⁵
|
¸ñÀû: Understanding the physiology and pathophysiology of sleep cycles
is
clinically significant as humans spend about one-third of their
lives
asleep. Polysomnography(PSG) is the gold standard in sleep
studies.
Especially, sleep stage classification is crucial for the
effective
diagnosis and treatment of sleep-rated disorders. However, manual
scoring of polysomnography(PSG) is labor-intensive and
subjective, with
high inter-rater variability. Although existing machine-learning
approaches have made significant progress in recent years, they
have
not been adopted widely in clinical settings due to black-box
skepticism. In this study, we propose a novel vision-based
automatic
sleep staging model that provides enriched explanations of the
model¡¯s
predictions. The goal of our method is to assist sleep clinicians
with
scoring tasks in clinical settings. ¹æ¹ý:Our approach is based on standardized image-based PSG dataset
rather
than raw signals, to mimic clinician¡¯s scoring as they ¡°see¡±
the
waveforms. This dataset encompasses a total of 10,253 patient
records,
which was collected from four institutes in South Korea between
2013
and 2020. The standard image is a snapshot of what sleep
clinicians see
in scoring tasks, encompassing not only EEG, EOG, and EMG but
also
respiratory and movement signals. We also validated our model
with
publicly available PSG datasets by transforming those signals
into
images. Our proposed framework adopted Vision Transformer
architecture
and can be divided into three modules. First, the Intra-epoch
Transformer learns the features of single-epoch images. By
utilizing
the transformer explainability method, we show a detailed
attribution
map that highlights where the model focuses. After extracting
feature
vectors from the previous module, we concatenated multiple epochs
to
generate the input of the Inter-Epoch Transformer. Afterward, the
Inter-epoch Transformer learns the context of the adjacent
epochs, as
clinicians see the preceding and subsequent epochs to score the
target
one. We also provided Inter-epoch interpretability, which
visualizes
the relevance of adjacent epochs in predicting the target epoch.
Throughout the training phase, we employed image augmentation
and 5-
fold ensemble method. During inference, we adopted a sliding
window
scheme to increase the performance. °á°ú:Our model shows Macro F1 greater than 80%, which is comparable to the
state-of-the-art sleep staging models. We evaluate our model to
publicly available datasets and verify robust performance. Unlike
signal-based models which require tailored pre-processing depending on
diverse datasets, our image transformation can be uniformly applied to
other datasets. This significantly reduces the training costs by
eliminating the necessity for fine-tuning the signal pre-processing.
Furthermore, our interpretable visualization shows a significant
alignment with the AASM scoring guide rules. For Intra-epoch
Interpretability, our results successfully pinpoint salient waveforms
such as alpha waves, sleep spindles, K-complexes, and delta waves. The
Inter-epoch interpretability shows how the model mirrors the
clinicians¡¯ approach of referencing neighboring epoch stages during
scoring. We found that REM stages have benefited the most from the
Inter-epoch transformer, as clinicians do in scoring tasks.
Furthermore, to the best of our knowledge, this is the first attempt to
provide an interpretable visualization of how neighboring epochs attend
each other. °á·Ð:The proposed framework achieves state-of-the-art performance in
terms
of the Macro F1 metric and additionally provides the reasoning
for the
sleep stage prediction. For the interpretability analysis, our
method
provides the most detailed and accurate visualization aligning
with the
sleep scoring rules. This breakthrough allows AI models to be
seamlessly integrated into practical healthcare settings. |
|