Search CORE

1,269 research outputs found

Early-stage malware prediction using recurrent neural networks

Author: Burnap Pete
Jones Kevin
Rhode Matilda
Publication venue: 'Elsevier BV'
Publication date: 22/05/2018
Field of study

Static malware analysis is well-suited to endpoint anti-virus systems as it can be conducted quickly by examining the features of an executable piece of code and matching it to previously observed malicious code. However, static code analysis can be vulnerable to code obfuscation techniques. Behavioural data collected during file execution is more difficult to obfuscate, but takes a relatively long time to capture - typically up to 5 minutes, meaning the malicious payload has likely already been delivered by the time it is detected. In this paper we investigate the possibility of predicting whether or not an executable is malicious based on a short snapshot of behavioural data. We find that an ensemble of recurrent neural networks are able to predict whether an executable is malicious or benign within the first 5 seconds of execution with 94% accuracy. This is the first time general types of malicious file have been predicted to be malicious during execution rather than using a complete activity log file post-execution, and enables cyber security endpoint protection to be advanced to use behavioural data for blocking malicious payloads rather than detecting them post-execution and having to repair the damage

arXiv.org e-Print Archive

Crossref

Online Research @ Cardiff

Tiresias: Predicting Security Events Through Deep Learning

Author: Blond Stevens Le
Brown Peter F.
Chua Zheng Leong
Gu Guofei
Ho Grant
Liu Yang
Melicher William
Provos Niels
Provos Niels
Richard Shin Eui Chul
Sabottke Carl
Soska Kyle
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 01/01/2018
Field of study

With the increased complexity of modern computer attacks, there is a need for defenders not only to detect malicious activity as it happens, but also to predict the specific steps that will be taken by an adversary when performing an attack. However this is still an open research problem, and previous research in predicting malicious events only looked at binary outcomes (e.g., whether an attack would happen or not), but not at the specific steps that an attacker would undertake. To fill this gap we present Tiresias, a system that leverages Recurrent Neural Networks (RNNs) to predict future events on a machine, based on previous observations. We test Tiresias on a dataset of 3.4 billion security events collected from a commercial intrusion prevention system, and show that our approach is effective in predicting the next event that will occur on a machine with a precision of up to 0.93. We also show that the models learned by Tiresias are reasonably stable over time, and provide a mechanism that can identify sudden drops in precision and trigger a retraining of the system. Finally, we show that the long-term memory typical of RNNs is key in performing event prediction, rendering simpler methods not up to the task

arXiv.org e-Print Archive

Crossref

Boston University Institutional Repository (OpenBU)

Comparison of Deep Learning and the Classical Machine Learning Algorithm for the Malware Detection

Author: Rathore Hemant
Sahay Sanjay K.
Sewak Mohit
Publication venue: 'Institute of Electrical and Electronics Engineers (IEEE)'
Publication date: 16/09/2018
Field of study

Recently, Deep Learning has been showing promising results in various Artificial Intelligence applications like image recognition, natural language processing, language modeling, neural machine translation, etc. Although, in general, it is computationally more expensive as compared to classical machine learning techniques, their results are found to be more effective in some cases. Therefore, in this paper, we investigated and compared one of the Deep Learning Architecture called Deep Neural Network (DNN) with the classical Random Forest (RF) machine learning algorithm for the malware classification. We studied the performance of the classical RF and DNN with 2, 4 & 7 layers architectures with the four different feature sets, and found that irrespective of the features inputs, the classical RF accuracy outperforms the DNN.Comment: 11 Pages, 1 figur

arXiv.org e-Print Archive

Crossref

An investigation of a deep learning based malware detection system

Author: David O. E.
Kolosnjaji Bojan
Lin Chih-Ta
Pascanu R.
Sahay Sanjay K.
Siddiqui Muazzam
Srivastava Nitish
Publication venue: 'Association for Computing Machinery (ACM)'
Publication date: 16/09/2018
Field of study

We investigate a Deep Learning based system for malware detection. In the investigation, we experiment with different combination of Deep Learning architectures including Auto-Encoders, and Deep Neural Networks with varying layers over Malicia malware dataset on which earlier studies have obtained an accuracy of (98%) with an acceptable False Positive Rates (1.07%). But these results were done using extensive man-made custom domain features and investing corresponding feature engineering and design efforts. In our proposed approach, besides improving the previous best results (99.21% accuracy and a False Positive Rate of 0.19%) indicates that Deep Learning based systems could deliver an effective defense against malware. Since it is good in automatically extracting higher conceptual features from the data, Deep Learning based systems could provide an effective, general and scalable mechanism for detection of existing and unknown malware.Comment: 13 Pages, 4 figure

arXiv.org e-Print Archive

Crossref