机器学习在恶意软件检测中的应用

#we initiate the classifier

clf1 = RandomForestClassifier()

#training

clf1.fit(X_train,y_train)

#prediction labels for X_test

y_pred=clf1.predict(X_test)

#metrics evaluation

"""

tn = True Negative a correct prediction clean predicted as clean

fp = False Positive a false alarm clean predicted as malicious

tp = True Positive a correct prediction (malicious)

fn = False Negative a malicious label predicted as clean

"""

tn, fp, fn, tp = confusion_matrix(y_test, y_pred).ravel()

print "TN = ",tn

print "TP = ",tp

print "FP = ",fp

print "FN = ",fn

法度榜样输出为：

TN = 697 
TP = 745 
FP = 6 
FN = 4

根据处理结不雅，在没有进行参数微调和修改的情况下，我们只有6个践言性和4个假阴性误判，这个结不雅相当不错。我们可以精确断定697个正常文件以及745个恶意软件，大年夜结不雅上来看，我们的小型反病毒引擎效不雅还可以。

接下来我们试一下另一个分类器，我们建立一个简单的神经收集，看看它对随机瓜分的处理效不雅若何。

根据维诽谤科的词条解释：

独裁感知器(multilayer perceptron，MLP)是一种前馈人工神经收集模型，它将输入数据集映射为一组恰当的输出集。MLP由有向图中的独裁节点构成，每层节点都与下一层节点完全相连。除了输入节点之外，每个节点都是具有非线性激活功能的神经元(或处理单位)。MLP应用了反向传播(back propagation)这种监督进修技巧(supervised learning technique)来练习神经收集。MLP是标准线性感知器的修改版，可以用来区分不克不及线性分别的那些数据。

大年夜上述定义我们可知，MLP是感知器的一种广义情势，也是深度进修办法的根本模型之一，可以用于处理广度和深度收集。

#our usual split 
X_train,X_test,y_train,y_test = train_test_split(X,y,test_size = 0.3,random_state=0) 
#This is a special process called feature engineering where we transform our data into the same scale for better predictions 
scaler = StandardScaler() 
scaler.fit(X_train) 
X_train = scaler.transform(X_train) 
X_test = scaler.transform(X_test) 
#Here we build a Multi Layer Perceptron of 12 Layers for 12 Features  you can use more if you want but it will turn into a complex zoo 
mlp = MLPClassifier(hidden_layer_sizes=(12,12,12,12,12,12)) 
#Training the MLP on our data 	
			 5/6   首页 上一页 3 4 5 6 下一页 尾页	
			

　　推荐阅读
　　Python一行代码完成并行任务
            
            众所周知，Python的并行处理才能很不睬想。我认为如不雅不推敲线程和GIL的标准参数(它们大年夜多是合法的)，其原因不是因为技巧不到位，而是我们的应用办法不恰当。大年夜多半关于Python线>>>详细阅读


本文标题：机器学习在恶意软件检测中的应用
地址：http://www.17bianji.com/lsqh/34783.html
 1/2    1