Emboli detection using a wrapper-based feature selection algorithm with multiple classifiers

Sakar B. E., SERBES G., AYDIN N.

Biomedical Signal Processing and Control, vol.71, 2022 (SCI-Expanded) identifier identifier

  • Publication Type: Article / Article
  • Volume: 71
  • Publication Date: 2022
  • Doi Number: 10.1016/j.bspc.2021.103080
  • Journal Name: Biomedical Signal Processing and Control
  • Journal Indexes: Science Citation Index Expanded (SCI-EXPANDED), Scopus, Compendex, EMBASE, INSPEC
  • Keywords: Directional Dual-Tree Rational-Dilation Com-plex Wavelet Transform, Embolic signals, Ensemble learning, Dimensionality reduction, Grid-search, COMPLEX WAVELET TRANSFORM, SIGNALS, DISCRIMINATION, CLASSIFICATION, MICROEMBOLI, DISEASE
  • Yıldız Technical University Affiliated: Yes


© 2021 Elsevier LtdTraditionally, analyzing spectral recordings and Doppler shift sounds for detecting emboli is done by experts visually and aurally. These techniques for detecting emboli are both subjective and expensive. In the proposed study, an emboli detection system, which makes binary classification to decide whether a signal is emboli or not, is developed using Q-factor tuned Directional Dual-Tree Rational-Dilation Complex Wavelet Transform for feature extraction. A Doppler ultrasound signal dataset including 400 samples – 200 from embolic, 100 from speckles, and 100 from artifacts – is used. Besides feeding dataset to Support Vector Machines, Multilayer Perceptron, Logistic Regression algorithms for classification; the ensemble voting approach was also applied to obtain higher performance. The experiments including the feature selection and classification algorithms are conducted with an unbiased two-step cross-validation procedure. Firstly, grid-search was used for finding optimum hyper-parameters and features on the training and validation sets. Next, the optimal model was applied to the test set. Lastly, a wrapper-based feature selection algorithm called Boruta was applied to the dataset to overcome insufficient number of samples problem. This problem is very common in biomedical studies due to the difficulties occurring in their creation such as the high-dependency to well-trained human power to acquire meaningful data, and expensiveness in terms of money and time. The results showed that ensemble learning has higher performance than single classifiers when a limited number of training samples is available. Besides, the results point out that close prediction performance was obtained with fewer samples when the Boruta algorithm was applied.