1) Identify The Dependent Variable in The Above Data: Ans
1) Identify The Dependent Variable in The Above Data: Ans
Ans: RESPONSE
3) Seed and split used by our group according to the convention informed earlier
Ans: Seed= (18123+18073+18194+18087)/4 = 18119
Split= 70: 30
4) Running various classifiers (Logistic Regression, Classification Tree with Cross validation,
Random Forests and Neural Nets) and noting the Classification Accuracy on both training and test
datasets and the significant variables in the model
Ans: Classification Accuracy
Training Dataset Validation Dataset
Neural Nets
Significant Variables:
AUC
Random Forests
Neural Nets
6) Which classifier gives the best model? Note down the significant variables from this model. Your
model must fulfil the assumptions required for developing that model.
Ans:
7) If you wish to find all potential defaulters, how much minimum records you need to sift through
based on your model.
Ans:
8) A customer approaches the bank for credit. His details are as follows:
Checking Account > 200 DM;
History: Delay in Paying Off;
Savings Account: Greater than 1000 DM;
Purpose of Credit: New Car;
Amount: 1000;
Employment: 4-7 Years;
Instalment Rate: 3;
Marital Status: Male Married;
Co-Applicant: Applicant has a guarantor;
Present Residence: 2: 2-3 years;
Real Estate: Applicant owns no property;
Age: 35;
Other Instalments: No;
Residence: No; ;
Number of Credits: 2;
Job: Skilled Employee;
Number of Dependents: 2;
Telephone: Owns a phone;
Foreign: No.
Should the bank give him loan or not