Dalarna University's logo and link to the university's website

du.sePublications
Change search
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • chicago-author-date
  • chicago-note-bibliography
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf
Machine Learning for Cross-Site Scripting (XSS) Detection: A comparative analysis of machine learning models for enhanced XSS detection
Dalarna University, School of Information and Engineering.
Dalarna University, School of Information and Engineering.
2024 (English)Independent thesis Basic level (degree of Bachelor), 10 credits / 15 HE creditsStudent thesis
Abstract [en]

The objective of this study is to assess the efficacy of several machine learning (ML) algorithms in identifying cross-site scripting (XSS) vulnerabilities, which are a widespread and significant cybersecurity risk. Several studies have emphasized the absence of a rich data set for model training. This research employs a comprehensive dataset from open sources, which includes 219,176 scripts evenly divided into harmful and non-harmful categories. The purpose of this study is to train and evaluate the effectiveness of various machine learning approaches. The evaluation utilizes criteria such as accuracy, F1-scores, and the confusion matrix. The algorithms analyzed are support vector machines (SVM), artificial neural networks (ANN), and recurrent neural networks (RNN). Out of all the models, the Artificial Neural Network (ANN) proved to be the most efficient, with an accuracy rate of 99% and F1-scores surpassing 0.98 in all categories. It greatly outperformed the other models.

The results indicate that combining the advantages of each model with a hybrid approach could improve detection accuracy. Integrating Support Vector Machine (SVM) with Recurrent Neural Network (RNN) and Artificial Neural Network (ANN) models can provide a dependable solution. Initially, SVM can filter data, thereby reducing the analysis time. This, in turn, improves the efficiency of RNN or ANN in detecting cross-site scripting (XSS) attacks. This approach should result in a stronger detection system for XSS vulnerabilities by combining SVM's accuracy in handling non-malicious instances with the sophisticated pattern recognition abilities of RNN and ANN. 

Place, publisher, year, edition, pages
2024.
Keywords [en]
"Cross-site scripting" "Attack, " "Injection, " or "Vulnerability" "Artificial Intelligence, " "Machine Learning, " "Deep Learning."
National Category
Computer Sciences
Identifiers
URN: urn:nbn:se:du-49044OAI: oai:DiVA.org:du-49044DiVA, id: diva2:1883365
Subject / course
Microdata Analysis
Available from: 2024-07-10 Created: 2024-07-10

Open Access in DiVA

fulltext(913 kB)916 downloads
File information
File name FULLTEXT01.pdfFile size 913 kBChecksum SHA-512
f92ee83e08dcbfc59454a4ca2e19a7980a79b2c8372ac0ab5a5e6ef0b42c342a8114b4923e2b16311116a4e7684b4e9fa46eeebf276e44b524c1e6527aa52e66
Type fulltextMimetype application/pdf

By organisation
School of Information and Engineering
Computer Sciences

Search outside of DiVA

GoogleGoogle Scholar
Total: 918 downloads
The number of downloads is the sum of all downloads of full texts. It may include eg previous versions that are now no longer available

urn-nbn

Altmetric score

urn-nbn
Total: 798 hits
CiteExportLink to record
Permanent link

Direct link
Cite
Citation style
  • apa
  • ieee
  • modern-language-association-8th-edition
  • vancouver
  • chicago-author-date
  • chicago-note-bibliography
  • Other style
More styles
Language
  • de-DE
  • en-GB
  • en-US
  • fi-FI
  • nn-NO
  • nn-NB
  • sv-SE
  • Other locale
More languages
Output format
  • html
  • text
  • asciidoc
  • rtf