Early Detection of Alzheimer’s Disease Using Vision Transformers on CWT-Transformed EEG Signals in Response to Olfactory Stimuli


Albaidhani A. H. J., SERBES G., İLHAN H. O.

IEEE Access, cilt.14, ss.6968-6985, 2026 (SCI-Expanded, Scopus) identifier

  • Yayın Türü: Makale / Tam Makale
  • Cilt numarası: 14
  • Basım Tarihi: 2026
  • Doi Numarası: 10.1109/access.2026.3652249
  • Dergi Adı: IEEE Access
  • Derginin Tarandığı İndeksler: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Compendex, INSPEC, Directory of Open Access Journals
  • Sayfa Sayıları: ss.6968-6985
  • Anahtar Kelimeler: Alzheimer’s, CatBoost, complex wavelet transform, short time Fourier transform, transfer learning, vision transformer
  • Yıldız Teknik Üniversitesi Adresli: Evet

Özet

Alzheimer’s disease (AD) is a progressive neurodegenerative disorder that significantly impairs cognitive function, which can potentially hinder global healthcare systems. Early and accurate diagnosis of AD is imperative for effective intervention and treatment to take place. Electroencephalography (EEG) is a non-invasive method for monitoring brain activity, which can open opportunities for the early detection of AD through associated biomarkers. Recent studies have revealed a significant correlation between an impaired sense of smell and the onset of early AD. Our approach analyzes EEG signals generated in response to olfactory stimuli to leverage this biomarker and enhance classification accuracy. Despite admirable progress in Artificial Intelligence (AI) and deep learning for diagnosing neurodegenerative diseases, obstacles such as scarce datasets, model generalization, and suboptimal feature extraction hinder the practical application of these techniques for AD detection. The objective of this study is to develop a reliable approach for the early detection of AD by leveraging EEG data. This can be achieved by analysing EEG data using signal transformation techniques, such as the Short-Time Fourier Transform (STFT) and Continuous Wavelet Transform (CWT), and implementing advanced deep learning models. Our results show that patient-wise performance was highest using the CWT. In this configuration, the Vision Transformer (ViT) produced the best diagnostic accuracy (91.43% at the patient level and 85.18% at the image level). In the context of image-level evaluation, the custom Convolutional Neural Network (CNN) paired with CatBoost achieved 81.66% accuracy; the CNN reached 81.25%; transfer learning with ResNet50 yielded 80.69%; and VGG16 attained 79.82%.