High-Performance Computing for Machine Learning and Artificial Intelligence in Brain-Computer Interfaces with Big Data

I.  Stefanyshyn; О.  Pastukh

doi:10.36910/6775-2524-0560-2025-59-34

I. Stefanyshyn https://orcid.org/0009-0008-6930-528X
О. Pastukh https://orcid.org/0000-0002-0080-7053

DOI: https://doi.org/10.36910/6775-2524-0560-2025-59-34

Keywords: machine learning, electroencephalogram, motor imagery, high-performance computing, big data, information technologies, brain-computer interfaces

Abstract

The article explores approaches to optimizing the processing of big data of EEG signals in BCI by combining dimensionality reduction methods and HPC. The relevance of the problem is due to the fact that modern BCIs generate large datasets of signals, the processing of which in real time often creates a critical load on hardware and software resources. The aim of the work is to establish an optimal balance between classification accuracy, model robustness, and data processing time using various dimensionality reduction methods – PCA, ICA, LDA – in combination with the MLP classifier and the Dask library for parallel calculations. A series of experiments was conducted by varying the number of components for each decomposition. It was found that when using PCA with n_components=0.999 or LDA with n_components=13, the accuracy and f1_weighted remain practically the same as in the model without dimensionality reduction, but the processing time is reduced by 1.5-4 times, depending on the settings. The use of fewer components allows for even higher performance, but is accompanied by a noticeable decrease in accuracy, which is critical for neuroengineering and rehabilitation tasks. The use of Dask for organizing parallel calculations made it possible to effectively scale experiments and avoid excessive load on individual system nodes. A comparative analysis of the accuracy, robustness, f1_weighted, roc_auc_ovr_weighted metrics and execution time showed that the optimal settings of matrix layouts allow preserving key information in the signal without significant loss of classification quality. The developed approach has proven its effectiveness for tasks where resource limitations are combined with requirements for stability and accuracy of the system in real-time mode. The practical value of the results lies in the possibility of adapting the proposed pipeline for a wide range of biomedical and engineering applications, where speed, reliability, and robustness of brain signal processing are critical

References

1. Adolf A., Köllőd C. M., Márton G., Fadel W., Ulbert I. The Effect of Processing Techniques on the Classification Accuracy of Brain-Computer Interface Systems. Brain Sciences. 2024. Vol. 14, 1272.
2. Pastukh O., Stefanyshyn V., Baran I., Yakymenko I., Vasylkiv V. Mathematics and software for controlling mobile software devices based on brain activity signals. CEUR Workshop Proceedings. 2023. Vol. 3628. P. 684–689.
3. Xu F., Zheng W., Shan D., Yuan Q., Zhou W. Decoding spectro-temporal representation for motor imagery recognition using ECoG-based brain-computer interfaces. Journal of Integrative Neuroscience. 2020. Vol. 19, No. 2. P. 259–272.
4. Bleuzé A., Mattout J., Congedo M. Tangent space alignment: Transfer learning for Brain-Computer Interface. Frontiers in Human Neuroscience. 2022. Vol. 16.
5. Stefanyshyn I., Pastukh O., Stefanyshyn V., Baran I., Boyko I. Robustness of AI algorithms for neurocomputer interfaces based on software and hardware technologies. CEUR Workshop Proceedings. 2024. Vol. 3742. P. 137–149.