Making - End - To - End - Project - Without - Pipeline - Jupyter Notebook
Making - End - To - End - Project - Without - Pipeline - Jupyter Notebook
In [27]: df=pd.read_csv("titanic.csv")
In [28]: df.head()
Out[28]:
PassengerId Survived Pclass Name Sex Age SibSp Parch Ticket Fare Cabin Embarked
0 1 0 3 Braund, Mr. Owen Harris male 22.0 1 0 A/5 21171 7.2500 NaN S
STON/O2.
2 3 1 3 Heikkinen, Miss. Laina female 26.0 0 0 7.9250 NaN S
3101282
3 4 1 1 Futrelle, Mrs. Jacques Heath (Lily May Peel) female 35.0 1 0 113803 53.1000 C123 S
localhost:8888/notebooks/100DaysMLCourse/Making_end_to_end_project_without_pipeline.ipynb 1/5
3/18/24, 12:17 AM Making_end_to_end_project_without_pipeline - Jupyter Notebook
In [30]: df.head()
Out[30]:
Survived Pclass Sex Age SibSp Parch Fare Embarked
In [32]: df.isnull().sum()
Out[32]: Survived 0
Pclass 0
Sex 0
Age 177
SibSp 0
Parch 0
Fare 0
Embarked 2
dtype: int64
localhost:8888/notebooks/100DaysMLCourse/Making_end_to_end_project_without_pipeline.ipynb 2/5
3/18/24, 12:17 AM Making_end_to_end_project_without_pipeline - Jupyter Notebook
In [86]: # applying the one hot encoding on sex and embarked because they are catogrical data
ohe_sex= OneHotEncoder(sparse=False, handle_unknown='ignore')
ohe_embarked= OneHotEncoder(sparse=False, handle_unknown='ignore')
x_train_sex = ohe_sex.fit_transform(x_train[['Sex']])
x_train_embarked = ohe_embarked.fit_transform(x_train_embarked)
x_test_sex = ohe_sex.fit_transform(x_test[['Sex']])
x_test_embarked = ohe_embarked.fit_transform(x_test_embarked)
localhost:8888/notebooks/100DaysMLCourse/Making_end_to_end_project_without_pipeline.ipynb 3/5
3/18/24, 12:17 AM Making_end_to_end_project_without_pipeline - Jupyter Notebook
In [113]: x_train_sex
In [95]: x_train_transformed=np.concatenate((x_train_rem,x_train_age,x_train_sex,x_train_embarked),axis=1)
x_test_transformed=np.concatenate((x_test_rem,x_test_age,x_test_sex,x_test_embarked),axis=1)
In [98]: x_test_transformed.shape
In [102]: clf=DecisionTreeClassifier()
clf.fit(x_train_transformed,y_train)
Out[102]: ▾ DecisionTreeClassifier
DecisionTreeClassifier()
localhost:8888/notebooks/100DaysMLCourse/Making_end_to_end_project_without_pipeline.ipynb 4/5
3/18/24, 12:17 AM Making_end_to_end_project_without_pipeline - Jupyter Notebook
Out[109]: 74.86033519553072
In [112]: pickle.dump(ohe_sex,open('models/ohe_sex.pkl','wb'))
pickle.dump(ohe_embarked,open('models/ohe_embarked.pkl','wb'))
pickle.dump(clf,open('models/clf.pkl','wb'))
In [ ]:
localhost:8888/notebooks/100DaysMLCourse/Making_end_to_end_project_without_pipeline.ipynb 5/5