HOW I MET YOUR MALWARE

Machine learning driven malware detection 🤖

The project goal is to evaluate the application of different Machine Learning techniques in order to classify PE files as malicious or benign.
The data set can be found here.

Each sample has more than 70 features obtained through static analysis of its correspondent PE file (e.g. SizeOfCode, SectionMaxEntropy etc.).
The data set contains 19,611 sample of which 14,599 malicious and 5,012 benign.

We used three supervised classification techniques:

K Nearest Neighbors
Support Vector Machine (3rd order polynomial kernel)
Logistic Regression

We performed 5-fold cross validation on each technique in order to find the optimal hyperparameter: the number K of neighbors, the C values for tuning the margin and the regularization parameter respectively.

Developers 👥

Name		Name	Last commit message	Last commit date
Latest commit History 20 Commits
imgs		imgs
plots		plots
.gitignore		.gitignore
README.md		README.md
classifier.py		classifier.py
dataset_malwares.csv		dataset_malwares.csv
malware_detection.pdf		malware_detection.pdf

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Repository files navigation

HOW I MET YOUR MALWARE

Machine learning driven malware detection 🤖

Developers 👥

Riccardo Nannini 🇮🇹

Veronica Grosso 🇮🇹

Andrea Giarduz 🇮🇹

About

Releases

Packages

Contributors 3

Languages

andrea-gi/MalwareDetection

Folders and files

Latest commit

History

Repository files navigation

HOW I MET YOUR MALWARE

Machine learning driven malware detection 🤖

Developers 👥

Riccardo Nannini 🇮🇹

Veronica Grosso 🇮🇹

Andrea Giarduz 🇮🇹

About

Resources

Stars

Watchers

Forks

Releases

Packages 0

Contributors 3

Languages

Packages