skip to main content
10.1145/3416921.3416937acmotherconferencesArticle/Chapter ViewAbstractPublication PagesiccbdcConference Proceedingsconference-collections
research-article

Autoencoders: A Low Cost Anomaly Detection Method for Computer Network Data Streams

Published: 24 September 2020 Publication History

Abstract

Computer networks are vulnerable to cyber attacks that can affect the confidentiality, integrity and availability of mission critical data. Intrusion detection methods can be employed to detect these attacks in real-time. Anomaly detection offers the advantage of detecting unknown attacks in a semi-supervised fashion. This paper aims to answer the question if autoencoders, a type of semi-supervised feedforward neural network, can provide a low cost anomaly detector method for computer network data streams. Autoencoder methods were evaluated online with the KDD'99 and UNSW-NB15 data sets, demonstrating that running time and labeling cost are significantly reduced compared to traditional online classification techniques for similar detection performance. Further research would consider the trade-off between single vs stacked networks, multi-label classification, concept drift detection and active learning.

References

[1]
Anna L. Buczak and Erhan Guven. 2016. A survey of data mining and machine learning methods for cyber security intrusion detection. IEEE Communications Surveys & Tutorials, 18, 2, 1153--1176.
[2]
Xiaoming Yuan, RanWang, Yi Zhuang, Kun Zhu, and Jie Hao. 2018. A concept drift based ensemble incremental learning approach for intrusion detection. In 2018 IEEE International Conference on Internet of Things (iThings) and IEEE Green Computing and Communications (GreenCom) and IEEE Cyber, Physical and Social Computing (CPSCom) and IEEE Smart Data (SmartData). IEEE, 350- 357.
[3]
João Gama, Indrė Žliobaitė, Albert Bifet, Mykola Pechenizkiy, and Abdelhamid Bouchachia. 2014. A survey on concept drift adaptation. ACM computing surveys (CSUR), 46, 4, 44.
[4]
R. Can Aygun and A. Gokhan Yavuz. 2017. Network anomaly detection with stochastically improved autoencoder based models. In 2017 IEEE 4th International Conference on Cyber Security and Cloud Computing (CSCloud). IEEE, 193--198.
[5]
Ian Goodfellow, Yoshua Bengio, and Aaron Courville. 2016. Deep Learning. http://www.deeplearningbook.org. MIT Press.
[6]
Thi-Thu-Huong Le, Jihyun Kim, and Howon Kim. 2017. An effective intrusion detection classifier using long short-term memory with gradient descent optimization. In 2017 International Conference on Platform Technology and Service (PlatCon). IEEE, 1--6.
[7]
Yisroel Mirsky, Tomer Doitshman, Yuval Elovici, and Asaf Shabtai. 2018. Kitsune: an ensemble of autoencoders for online network intrusion detection. arXiv preprint arXiv:1802.09089.
[8]
Miguel Nicolau and James McDermott. 2016. A hybrid autoencoder and density estimation model for anomaly detection. In International Conference on Parallel Problem Solving from Nature. Springer, 717--726.
[9]
Ali H. Mirza and Selin Cosan. 2018. Computer network intrusion detection using sequential lstm neural networks autoencoders. In 2018 26th Signal Processing and Communications Applications Conference (SIU). IEEE, 1--4.
[10]
Jinghui Chen, Saket Sathe, Charu Aggarwal, and Deepak Turaga. 2017. Outlier detection with autoencoder ensembles. In Proceedings of the 2017 SIAM International Conference on Data Mining. SIAM, 90--98.
[11]
Tung Kieu, Bin Yang, Chenjuan Guo, and Christian S. Jensen. 2019. Outlier detection for time series with recurrent autoencoder ensembles. In 28th international joint conference on artificial intelligence.
[12]
XuKui Li, Wei Chen, Qianru Zhang, and LifaWu. 2020. Building auto-encoder intrusion detection system based on random forest feature selection. Computers & Security, 101851.
[13]
Christopher Nixon, Mohamed Sedky, and Mohamed Hassan. 2019. Practical application of machine learning based online intrusion detection to internet of things networks. In 2019 IEEE Global Conference on Internet of Things (GCIoT). IEEE, 1--5.
[14]
Nitish Srivastava, Geoffrey Hinton, Alex Krizhevsky, Ilya Sutskever, and Ruslan Salakhutdinov. 2014. Dropout: a simple way to prevent neural networks from overfitting. The journal of machine learning research, 15, 1, 1929--1958.
[15]
Jacob Montiel, Jesse Read, Albert Bifet, and Talel Abdessalem. 2018. Scikitmultiflow: a multi-output streaming framework. The Journal of Machine Learning Research, 19, 1, 2915--2914.
[16]
Albert Bifet, Ricard Gavaldà, Geoff Holmes, and Bernhard Pfahringer. 2018. Machine Learning for Data Streams with Practical Examples in MOA. https://moa.cms.waikato.ac.nz/book/. MIT Press.
[17]
Indrė Žliobaitė, Albert Bifet, Bernhard Pfahringer, and Geoffrey Holmes. 2013. Active learning with drifting streaming data. IEEE transactions on neural networks and learning systems, 25, 1, 27--39.

Cited By

View all

Recommendations

Comments

Information & Contributors

Information

Published In

cover image ACM Other conferences
ICCBDC '20: Proceedings of the 2020 4th International Conference on Cloud and Big Data Computing
August 2020
130 pages
ISBN:9781450375382
DOI:10.1145/3416921
Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than ACM must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected]

In-Cooperation

  • Brookes: Oxford Brookes University
  • Staffordshire University: Staffordshire University
  • University of Liverpool

Publisher

Association for Computing Machinery

New York, NY, United States

Publication History

Published: 24 September 2020

Permissions

Request permissions for this article.

Check for updates

Author Tags

  1. anomaly detection
  2. autoencoders
  3. intrusion detection
  4. online learning

Qualifiers

  • Research-article
  • Research
  • Refereed limited

Conference

ICCBDC '20

Contributors

Other Metrics

Bibliometrics & Citations

Bibliometrics

Article Metrics

  • Downloads (Last 12 months)26
  • Downloads (Last 6 weeks)0
Reflects downloads up to 13 Jan 2025

Other Metrics

Citations

Cited By

View all

View Options

Login options

View options

PDF

View or Download as a PDF file.

PDF

eReader

View online with eReader.

eReader

Media

Figures

Other

Tables

Share

Share

Share this Publication link

Share on social media