Anomaly detection of SVM under poisoning attack

Samanwickrama, WD; Amadoru, I.J.; Perera, LDRD

WUSL Repo Home
→
1. WUSL Publications
→
a). Proceedings
→
Wayamba University Research Congress (WURC)
→
View Item

dc.contributor.author	Samanwickrama, WD
dc.contributor.author	Amadoru, I.J.
dc.contributor.author	Perera, LDRD
dc.date.accessioned	2026-05-22T09:08:28Z
dc.date.available	2026-05-22T09:08:28Z
dc.date.issued	2021
dc.identifier.uri	http://repository.wyb.ac.lk/handle/1/3612
dc.description.abstract	Machine Learning (IvIL) applications have been adopted heavily due to the use of artificial intelligence systems, cloud computing, social media and smart computing. Venders integrate ML into products across various industries. ML systems train models periodically to enhance the ad hoc functionality. However, data poisoning has been identified as a challenge in ML, and this can be occuned by an injection attack or a label flrpping. A hacker needs to know the existing system, and they have to craft and transplant compromised data points avoiding outlier regions for an injection attack. In label flipping, the hackers should access training data systems and make alterations or periodically insert data into the systems through a proper channel trending towards wrong decision making. Support Vector Machine (SVM) is an algorithm which is commonly used in ML, but the algorithm has become a target for data poisoning. The objective of this study was to develop earry identification parameters in order to check whether a SVM model is attacked by data poisoning or not. Danmini Doorbell (DDb) data in University of California, Irvine (UCD machine leaming repository was used in this experiment. Each record contained N:\|15 features which were genffated by the publishers of the dataset using the raw attributes of network traffic. The top 20 (n) was picked from the total N features using the Gini index obtained by the Random Forest algorithm in order to test t}tem according to a reduced feature set architecture. Accuracy and kappa were calculated using one Class SVM model to identi& a poisoning attack on the training data set. 139 Proceedings of Wayamba University Research Congress 202 I , Senate Research and Higher Degrees Commitlee Table 1: The change ofaccuracy and kappa values in data poisoning Sample A B c DE F All features 115 kappa 0.8876 0.8894 0.8697 0.2349 0.2112 -0.0423 Accuracy 0.9951 0.9951 4.9941 0.8909 0.8800 0.8657 Top 20 features kappa 0.8851 0.4254 0.2261 0.1135 0.0997 -0.0446 Accuracy 0.9950 0.9502 0.8847 0.7900 0.7783 0.8243 % polsorung 0 0.0099 0.0i99 0.2 0.2999 1 According to the results, inthe all-features set architecture, the accuracy and the kappa values were decreased with the increase in data poisoning (Table 1). Similar results were also obtained in the reduced-features set architecture as well. The results revealed that the SVM anomalous behavior due to data poisoning can be identified by checking accuracy and kappa on training data sets. However, in order to make firm applications it should be further investigated with different data sets and algorithms.	en_US
dc.publisher	Wayamba University Research Congress	en_US
dc.subject	Accuracy	en_US
dc.subject	D ata poisoning	en_US
dc.subject	Kappa	en_US
dc.subject	Machine learning	en_US
dc.title	Anomaly detection of SVM under poisoning attack	en_US