Skip to content

Latest commit

 

History

History
16 lines (12 loc) · 891 Bytes

File metadata and controls

16 lines (12 loc) · 891 Bytes

Machine-Learning-approach-for-Malware-Detection

A Machine Learning approach for classifying a file as Malicious or Legitimate.

This approach tries out 6 different classification algorithms before deciding which one to use for prediction by comparing their results. Different Machine Learning models tried are, Linear Regression, RandomForest, DecisionTree, Adaboost, Gaussian, Gradient Boosting.

In order to test the model on an unseen file, it's required to extract the characteristics of the given file. Python's pefile.PE library is used to construct and build the feature vector and a ML model is used to predict the class for the given file based on the already trained model.

Dependencies

  • pandas pip install pandas
  • numpy pip install numpy
  • pickle pip install pickle
  • scipy pip install scipy
  • scikit pip install -U scikit-learn