欧美xxxxxbb,欧美激情在线精品video,国产在线不卡午夜精品2021

集成方法是將機器學(xué)習(xí)模型組合成更強大的機器學(xué)習(xí)模型的方法。隨機森林是決策樹的集合，就是其中之一。它比單一決策樹好，因為在保留預(yù)測能力的同時，通過平均結(jié)果可以減少過度擬合。在這里，我們將在 scikit 學(xué)習(xí)癌癥數(shù)據(jù)集上實施隨機森林模型。

導(dǎo)入必要的軟件包 -

from sklearn.ensemble import RandomForestClassifier
from sklearn.model_selection import train_test_split
from sklearn.datasets import load_breast_cancer
cancer = load_breast_cancer()
import matplotlib.pyplot as plt
import numpy as np

現(xiàn)在，需要按照以下方式提供數(shù)據(jù)集

cancer = load_breast_cancer()
X_train, X_test, y_train,
y_test = train_test_split(cancer.data, cancer.target, random_state = 0)

在提供數(shù)據(jù)集之后，需要擬合可以如下完成的模型 -

forest = RandomForestClassifier(n_estimators = 50, random_state = 0)
forest.fit(X_train,y_train)

現(xiàn)在，獲得訓(xùn)練以及測試子集的準確性:如果增加估計器的數(shù)量，那么測試子集的準確性也會增加。

print('Accuracy on the training subset:(:.3f)',format(forest.score(X_train,y_train)))
print('Accuracy on the training subset:(:.3f)',format(forest.score(X_test,y_test)))

上面代碼，輸出結(jié)果如下所示 -

Accuracy on the training subset:(:.3f) 1.0
Accuracy on the training subset:(:.3f) 0.965034965034965

現(xiàn)在，與決策樹一樣，隨機森林具有 feature_importance 模塊，它將提供比決策樹更好的特征權(quán)重視圖。它可以如下繪制和可視化 -

n_features = cancer.data.shape[1]
plt.barh(range(n_features),forest.feature_importances_, align='center')
plt.yticks(np.arange(n_features),cancer.feature_names)
plt.xlabel('Feature Importance')
plt.ylabel('Feature')
plt.show()

執(zhí)行上面代碼，得到以下輸出結(jié)果 -

AI人工智能 隨機森林分類器

推薦文章

推薦教程

推薦課程

AI人工智能隨機森林分類器