Efficacy of using SVM on Iot Security Threat Identification under Reduced Feature Set Architecture Compared to J48 and Random Forest Algorithms using UCI Benchmark Data Set

Samanwickrama, WD; Munasinghe, Lankeshwara; Perera, LDRD

WUSL Repo Home
→
1. WUSL Publications
→
a). Proceedings
→
Wayamba University Research Congress (WURC)
→
View Item

dc.contributor.author	Samanwickrama, WD
dc.contributor.author	Munasinghe, Lankeshwara
dc.contributor.author	Perera, LDRD
dc.date.accessioned	2026-05-22T09:04:41Z
dc.date.available	2026-05-22T09:04:41Z
dc.date.issued	2020
dc.identifier.uri	http://repository.wyb.ac.lk/handle/1/3611
dc.description.abstract	More than 5 billion IoT devices are on the Internet due to smart technology development in 2020. Most of them are connected to the Internet and they will create an easy gateway to hackers. So the network security has become an immense challenge. Maintaining minimal resource consumption of IoT devices is also difficult. Here we try to build up light weight anomaly detection mechanism for smart devices. Supervised machine learning algorithms were used for building a classifier for threat detection and their performances were compared. Danmini Doorbell (DDb) data in UCI repository is used in this experiment. The DDb dataset contains three types of web traffic data — benign traffic, Mirai traffic and Gafgyt traffic. Each record contains N=115 features which were generated by the publishers of the dataset using raw attributes of network traffic. As both Gafgyt and Mirai traffic produced by the attack activity, the two data- sources were combined to construct the overall set of malicious data for this experiment. In order to test the hypothesis, we used Gafgyt and Mirai traffic data and benign traffic used to train the model for SVM. Feature importance is determined using the Gini index which was obtained using Random Forest algorithm. We picked top n=20 from the total N features to train the one class SVM, J48 and Random forest which is used to detect potential attacks in test data. Comparing the total N feature classification against reduced n feature classification equal accuracy were obtained in each algorithm. According to the benchmark testing the computational cost were reduced in each algorithm. However, when comparing other algorithms with SVM, the SVM had the lowest accuracy and performance (Table 1) Table 1. Algorithm comparison results Algorithm Mean time (s) Random Forest 1.640045 KSVM 184.698795 J48 1.046536 SVM can be trained using benign traffic but others need malicious traffic to train the system. Although SVM shows less accuracy and high resource consumption it can be used to identify unknown threats. So the SVM is the only algorithm among these three algorithms that can be employed to identify zero day attack. These results can be used the reduced feature set in IoT anomaly detection. It should be further investigated with different data sets and different algorithms. This becomes an area for potential research in future.	en_US
dc.publisher	Wayamba University Research Congress	en_US
dc.subject	Anomaly detection	en_US
dc.subject	IoT	en_US
dc.subject	Machine learning	en_US
dc.subject	Network security	en_US
dc.subject	UCI	en_US
dc.title	Efficacy of using SVM on Iot Security Threat Identification under Reduced Feature Set Architecture Compared to J48 and Random Forest Algorithms using UCI Benchmark Data Set	en_US
dc.type	Article	en_US