Optimising Machine Learning Integration in Real-Time Text Analytics Platforms: Technical Approaches and Performance Criteria

O. Korostin

doi:10.36910/6775-2524-0560-2025-58-05

O. Korostin https://orcid.org/0009-0007-7510-6757

DOI: https://doi.org/10.36910/6775-2524-0560-2025-58-05

Keywords: machine learning, text streams, real-time platforms, algorithm adaptability, data analysis, distributed computing, performance optimisation, algorithm transparency

Abstract

The article investigates the integration of machine learning into real-time platforms for analysing text streams. The relevance of the topic is driven by the growing volume of unstructured textual data and the need for its prompt and accurate processing to support decision-making in such fields as media monitoring, cybersecurity, finance, and healthcare. The effectiveness of such platforms is shown to depend on the adaptability of algorithms, analysis accuracy, scalability, and transparency of results. Special attention is paid to the technical aspects of implementation, including distributed architecture, streaming data processing, optimisation of computing resources, and integration of explainable models. The purpose of the article is to study the possibilities of integrating machine learning algorithms into real-time platforms for analysing text streams, in particular, to develop approaches to improving the efficiency of data processing, ensuring their transparency and adaptability in a changing information environment. To achieve this goal, the study applies a combination of literature analysis, comparative evaluation of existing algorithms, and an experimental assessment of technical solutions. The findings indicate that the main challenges of integration include the computational complexity of deep models, scalability constraints, and delays in data stream processing. It has been shown that the use of distributed computing technologies, hardware accelerators (GPU/TPU), and online learning mechanisms significantly improves the performance of such platforms. The application of adaptive algorithms capable of real-time parameter updates increases analysis accuracy under unstable data conditions. The study concludes that integrating machine learning into real-time systems enhances the speed, reliability, and scalability of text analytics. Further research should focus on developing universal multilingual platforms that combine energy efficiency, modularity, and high analytical performance.

References

1. Guha, A., & Samanta, D. (2020). Real-time application of document classification based on machine learning. In L. Jain, S. L. Peng, B. Alhadidi, & S. Pal (Eds.), Intelligent computing paradigm and cutting-edge technologies. ICICCT 2019 (Vol. 9, pp. 401–416). Springer.
2. Yu, M., Huang, Q., Qin, H., Scheele, C., & Yang, C. (2020). Deep learning for real-time social media text classification for situation awareness – Using Hurricanes Sandy, Harvey, and Irma as case studies. In Social sensing and big data computing for disaster management (pp. 33–50). Routledge.
3. Li, Q., Peng, H., Li, J., Xia, C., Yang, R., Sun, L., Yu, P. S., & He, L. (2022). A survey on text classification: From traditional to deep learning. ACM Transactions on Intelligent Systems and Technology (TIST), 13(2), 1–41.
4. Gomes, H. M., Read, J., Bifet, A., Barddal, J. P., & Gama, J. (2019). Machine learning for streaming data: State of the art, challenges, and opportunities. ACM SIGKDD Explorations Newsletter, 21(2), 6–22.
5. Bahri, M., Bifet, A., Gama, J., & Maniu, S. (2021). Data stream analysis: Foundations, major tasks and tools. Wiley Interdisciplinary Reviews: Data Mining and Knowledge Discovery, 11(3), e1405.