TY - GEN
T1 - Evaluation of the performance of supervised and unsupervised Machine learning techniques for intrusion detection
AU - Portela, Fernando Gutierrez
AU - Almenares Mendoza, Florina
AU - Benavides, Liliana Calderon
N1 - Publisher Copyright:
© 2019 IEEE.
PY - 2019/11
Y1 - 2019/11
N2 - machine learning techniques are widely used in the research for intelligent solutions anomalies detection on different computers and communications systems, which have allowed to modernize the intrusion detection systems, to ensure data privacy. For that, this paper evaluates the performance of some supervised (i.e., KNN and SVM) and unsupervised (i.e., Isolation Forest and K-Means) algorithms, for intrusion detection, using data set UNSW-NB12. The results show that the supervised algorithm SVM gaussiana fine, obtained 92% in accuracy, indicating the ability to correctly classify normal and abnormal data. With regard to the unsupervised algorithms, the K-Means algorithm groups the data together correctly and allows the appropriate number of groups to be clearly defined; however, this data set is highly agglomerated. For Isolation Forest, despite being a robust algorithm for the separation of atypical values, it presented difficulty for it. Finally, it should be made clear that not all methods of detecting anomalies by distance work properly for all data sets.
AB - machine learning techniques are widely used in the research for intelligent solutions anomalies detection on different computers and communications systems, which have allowed to modernize the intrusion detection systems, to ensure data privacy. For that, this paper evaluates the performance of some supervised (i.e., KNN and SVM) and unsupervised (i.e., Isolation Forest and K-Means) algorithms, for intrusion detection, using data set UNSW-NB12. The results show that the supervised algorithm SVM gaussiana fine, obtained 92% in accuracy, indicating the ability to correctly classify normal and abnormal data. With regard to the unsupervised algorithms, the K-Means algorithm groups the data together correctly and allows the appropriate number of groups to be clearly defined; however, this data set is highly agglomerated. For Isolation Forest, despite being a robust algorithm for the separation of atypical values, it presented difficulty for it. Finally, it should be made clear that not all methods of detecting anomalies by distance work properly for all data sets.
KW - Clustering algorithms
KW - Intrusion detection
KW - Machine learning algorithms
KW - Outlier detection
KW - Supervised algorithms
KW - Unsupervised algorithms
UR - http://www.scopus.com/inward/record.url?scp=85084667524&partnerID=8YFLogxK
U2 - 10.1109/iCASAT48251.2019.9069538
DO - 10.1109/iCASAT48251.2019.9069538
M3 - Libros de Investigación
AN - SCOPUS:85084667524
T3 - 2019 IEEE International Conference on Applied Science and Advanced Technology, iCASAT 2019
BT - 2019 IEEE International Conference on Applied Science and Advanced Technology, iCASAT 2019
PB - Institute of Electrical and Electronics Engineers Inc.
T2 - 2019 IEEE International Conference on Applied Science and Advanced Technology, iCASAT 2019
Y2 - 27 November 2019 through 28 November 2019
ER -