Random Forest
Random Forest
y_pred = clf.predict(X_test)
QUESTION 2:
df = pd.DataFrame(data.data, columns=data.feature_names)
df['species'] = data.target
print(df.head())
data = load_iris()
X = data.data
y = data.target
y_pred = clf.predict(X_test)
OUTPUT:
sepal length (cm) sepal width (cm) petal length (cm) petal width
(cm) \
0 5.1 3.5 1.4
0.2
1 4.9 3.0 1.4
0.2
2 4.7 3.2 1.3
0.2
3 4.6 3.1 1.5
0.2
4 5.0 3.6 1.4
0.2
species
0 0
1 0
2 0
3 0
4 0
Accuracy on Test Set: 1.0
QUESTION 3:
from sklearn.datasets import load_breast_cancer
import pandas as pd
data = load_breast_cancer()
X = data.data
y = data.target
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.2, random_state=42)
y_pred = clf.predict(X_test)
OUTPUT:
[5 rows x 31 columns]
Accuracy on Test Set: 0.9649122807017544
QUESTION 4:
df = pd.DataFrame(data.data, columns=data.feature_names)
df['target'] = data.target
print(df.head())
y = data.target
X_train, X_test, y_train, y_test = train_test_split(X, y,
test_size=0.2, random_state=42)
y_pred = clf.predict(X_test)