Research of methods for improving the quality of classification on highly correlated and unbalanced data

dc.contributor.authorGavrylenko, Svitlana
dc.contributor.authorZozulia, Vladislav
dc.contributor.authorPoltoratskyi, Vadim
dc.date.accessioned2025-11-10T10:49:12Z
dc.date.issued2024
dc.description.abstractThe object of the study is the process of identifying the state of a computer systems and network. The subject of the study are the methods of identifying the state of computer systems and networks. The purpose of this paper is to develop a method for detecting intrusions in computer networks on highly correlated and unbalanced data. The results obtained. The paper analyzes traditional machine learning algorithms, deep learning methods and considers the advantages of using ensemble models. The scientific novelty of the obtained results lies in the comprehensive use of the developed procedure for reducing feature correlation, the use of the SMOTEENN data balancing method, and the tuning of parameters for basic classifiers and meta-algorithm. The developed methods are implemented using Python and the GOOGLE COLAB cloud service with Jupyter Notebook. Conclusions. Experiments confirmed the efficiency of the proposed method. The comprehensive use of the above procedures and methods allowed to improve the quality of models by 30% in solving the task of intrusion detection in the operation of computer systems and networks. This makes it possible to recommend it for practical use, in order to improve the accuracy of identifying the state of a computer system.
dc.identifier.citationGavrylenko S. Research of methods for improving the quality of classification on highly correlated and unbalanced data / Gavrylenko S., Zozulia V., Poltoratskyi V. // Problems of scientific, technical and legal support for cybersecurity in the modern world : monograph / ed.: dr.hab., prof. Semenov S., dr.hab., prof. Muhatsky M. – Krakow : UNEC, 2024. – P. 11-20.
dc.identifier.orcidhttps://orcid.org/0000-0002-5093-0420
dc.identifier.orcidhttps://orcid.org/0009-0003-5312-4939
dc.identifier.urihttps://repository.kpi.kharkov.ua/handle/KhPI-Press/95016
dc.language.isoen
dc.publisherUniversity of the national education commission
dc.subjectcomputer systems
dc.subjectnetwork
dc.subjectmachine learning
dc.subjectdata preprocessing
dc.subjectcorrelated and unbalanced data
dc.subjectSMOTEENN
dc.subjectensemble classifier
dc.subjectbagging
dc.subjectrandom forest
dc.subjectadaboost
dc.subjectgradient boosting
dc.titleResearch of methods for improving the quality of classification on highly correlated and unbalanced data
dc.typeArticle

Файли

Контейнер файлів

Зараз показуємо 1 - 1 з 1
Вантажиться...
Ескіз
Назва:
Gavrylenko_Research_2024.pdf
Розмір:
721.25 KB
Формат:
Adobe Portable Document Format

Ліцензійна угода

Зараз показуємо 1 - 1 з 1
Вантажиться...
Ескіз
Назва:
license.txt
Розмір:
11.25 KB
Формат:
Item-specific license agreed upon to submission
Опис: