0% found this document useful (0 votes)

60 views7 pages

Setup: Step-By-Step Instructions For How To Create The Solutions File From The Code Provided

The document provides step-by-step instructions for building machine learning models to predict data outcomes. It involves preprocessing data to generate feature sets, training individual models using libraries like liblinear and libsvm, combining models into ensembles, and generating predictions on test data to create a final submissions file. The process requires using Java and R code to extract features, split data, train models, make predictions, and evaluate results across multiple folds of cross-validation.

Uploaded by

Akshat Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

60 views7 pages

Setup: Step-By-Step Instructions For How To Create The Solutions File From The Code Provided

Uploaded by

Akshat Verma

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 7

Step-by-step instructions for how to create the solutions file from the code provided: Setup Prerequisites Compile

Java Code Generate Top Features Using MutualInformation Criterion Generate the seven data sets required for liblinear/libsvm models: Build Individual Models and Predict for Test Data Liblinear Models Libsvm Models Naive Bayes model# 10 Weighted k-NN models (11) and (12) Multinomial Naive Bayes model# 20 Weighted k-NN models (21) and (22) Train the Two Ensembles Predict for Test Data Using the Two Ensembles Building the Final Submission File

Setup
EMC_ROOT refers to the directory containing Java/R code and data. (i) Extract emc.zip to <EMC_ROOT> (ii) Copy all data files (the files provided by Kaggle/EMC) to <EMC_ROOT>/data directory. (iii) In com.emc.util.Config.java, set the value of EMC_ROOT field appropriately.

Prerequisites
(1) JDK 1.6 or higher (2) Javas Ant build tool (3) R version 2.14 or higher (4) Rs glmnet package (5) Install liblinear (6) Install libsvm

Compile Java Code

$ cd <EMC_ROOT>/java $ ant After running ant, <EMC_ROOT>/java/classes directory will have the compiled Java classes.

Generate Top Features Using MutualInformation Criterion

Run Java class com.emc.featureSelection.MutualInformationBasedFeatureSelector

$ cd <EMC_ROOT>/java $ java -Xmx1024m -cp classes:lib/commons-math-2.2.jar com.emc.featureSelection.MutualInformationBasedFeatureSelector

Generate the seven data sets required for liblinear/libsvm models:

(1) Run the following Java classes to generate the training and cross-validation files: i. com.emc.liblinear.LiblinearFileGen ii. com.emc.liblinear.LibLinear_TF_IDF__minTermCount iii. com.emc.liblinear.LiblinearFileGen__minTermCount iv. com.emc.liblinear.LibLinear_TF_IDF__TfEqualOne_minTermCount v. com.emc.liblinear.LibLinear_TF_IDF__featureSelection vi. com.emc.liblinear.LibLinear_raw__featureSelection vii. com.emc.liblinear.LibLinear_TF_IDF__TfEqualOne__featureSelection Example: $ cd <EMC_ROOT>/java $ java -Xmx1024m -cp classes:lib/commons-math-2.2.jar com.emc.liblinear.LiblinearFileGen It will generate 11 files: -> five files for training [one for each fold of the 5-fold cross-validation] each file contains 80% of training data file name: *_tr_[1-5].csv -> five files for cross validation [one for each fold of the 5-fold cross-validation] each file contains 20% of training data file name: *_cv_[1-5].csv -> one file containing the entire training data file name: *_tr_-1.csv (2) Run the following Java classes to generate the test files: i. com.emc.liblinear.LiblinearFileGen_test ii. com.emc.liblinear.LibLinear_TF_IDF__minTermCount_test iii. com.emc.liblinear.LiblinearFileGen__minTermCount_test iv. com.emc.liblinear.LibLinear_TF_IDF__TfEqualOne_minTermCount_test v. com.emc.liblinear.LibLinear_TF_IDF__featureSelection__test vi. com.emc.liblinear.LibLinear_raw__featureSelection__test vii. com.emc.liblinear.LibLinear_TF_IDF__TfEqualOne__featureSelection__test Example: $ cd <EMC_ROOT>/java $ java -Xmx1024m -cp classes:lib/commons-math-2.2.jar com.emc.liblinear.LiblinearFileGen_test It will generate one file: -> one file for test data file name: *_test.csv

Build Individual Models and Predict for Test Data

Liblinear Models
Build each liblinear model described in the Model table in emc_my_solution document by following the steps outlined below: Concrete steps for building models: Step (1): Train using liblinears train command. Example: ./train -s 1 -c 1 <EMC_ROOT>/outputFiles/liblinear/liblinear_tr_<cvf>.csv liblinear_tr_<cvf>__s-1_c-1.model -> Use the training files corresponding to the models data set. Each models data set can be found in the Model table in emc_my_solution document. (These files were generated in the prior section Generate the seven data sets required for liblinear/libsvm models.) -> Use the model parameters from the Model table in emc_my_solution document -> Perform this step 5 times [once for each fold (<cvf> = 1 to 5)]. Step (2): Predict for cross validation data, using liblinears predict command. Example: ./predict -b 1 <EMC_ROOT>/outputFiles/liblinear/liblinear_cv_<cvf>.csv liblinear_tr_<cvf>__s-1_c-1.model liblinear_tr_<cvf>__s-1_c-1.out -> Use the cross validation files corresponding to the models data set. Each models data set can be found in the Model table in emc_my_solution document. (These files were generated in the prior section Generate the seven data sets required for liblinear/ libsvm models.) -> Perform this step 5 times [once for each fold (<cvf> = 1 to 5)]. Step (3): Run com.emc.liblinear.LiblinearCalibration Java class to generate the output CSV file. This Java class reads the prediction files from step(2) and generates the output CSV. Before running this class, make the following two changes to LiblinearCalibration.java: (i) Modify line 107 to point to liblinear prediction files from step(2) BufferedReader br = new BufferedReader(new FileReader("...")); (ii) Modify line 37 so that the file name matches the model's file name in emc_train_e9b.r [one of data<model_number> variables] fileWriter = new FileWriter("..."); $ cd <EMC_ROOT>/java $ ant $ java -Xmx1024m -cp classes:lib/commons-math-2.2.jar com.emc.liblinear.LiblinearCalibration Concrete steps for generating predictions for test data: Step (1): Train using liblinears train command. Example: ./train -s 1 -c 1 <EMC_ROOT>/outputFiles/liblinear/liblinear_tr_-1.csv liblinear_tr_-1__s-1_c-1.model

-> Use the training file containing the entire training data (*_tr_-1.csv) corresponding to the models data set. Each models data set can be found in the Model table in emc_my_solution document. (These files were generated in the prior section Generate the seven data sets required for liblinear/libsvm models.) -> Use the model parameters from the Model table in emc_my_solution document Step (2): Predict for test data, using liblinears predict command. Example: ./predict -b 1 <EMC_ROOT>/outputFiles/liblinear/liblinear_test.csv liblinear_tr_1__s-1_c-1.model liblinear_tr_-1__s-1_c-1.out -> Use the test file corresponding to the models data set. Each models data set can be found in the Model table in emc_my_solution document. (These files were generated in the prior section Generate the seven data sets required for liblinear/libsvm models.) Step (3): Run com.emc.liblinear.LiblinearExp_subm Java class to generate the output CSV file. This Java class reads the prediction files from step(2) and generates the output CSV. Before running this class, make the following two changes: (i) Modify line 24 to point to liblinear prediction files from step(2) BufferedReader br = new BufferedReader(new FileReader("...")); (ii) Modify line 34 so that the file name matches the model's file name in either emc_sub_e9b.r or emc_sub_e9a_FS-10k-all-15.r [one of data<model_number> variables] FileWriter fileWriter = new FileWriter("..."); $ cd <EMC_ROOT>/java $ ant $ java -Xmx1024m -cp classes:lib/commons-math-2.2.jar com.emc.liblinear.LiblinearExp_subm

Libsvm Models
The steps for libsvm are the same as liblinear except for different commands used for training and prediction. For completeness, libsvm steps are documented here. Build each libsvm model described in the Model table in emc_my_solution document by following the steps outlined below: Concrete steps for building models: Step (1): Train using libsvms svm-train command. Example: ./svm-train -m 2000 -c 8 -g 0.1 -b 1 <EMC_ROOT>/outputFiles/liblinear/ raw_minTermCount-3_tr_<cvf>.csv raw_minTermCount-3_tr_<cvf>__c-8_g-0.1.model -> Use the training files corresponding to the models data set. Each models data set can be found in the Model table in emc_my_solution document. (These files were generated in the prior section Generate the seven data sets required for liblinear/libsvm models.) -> Use the model parameters from the Model table in emc_my_solution document -> Perform this step 5 times [once for each fold (<cvf> = 1 to 5)]. Step (2): Predict for cross validation data, using libsvms svm-predict command. Example: ./svm-predict -b 1 <EMC_ROOT>/outputFiles/liblinear/raw_minTermCount3_cv_<cvf>.csv raw_minTermCount-3_tr_<cvf>__c-8_g-0.1.model raw_minTermCount3_tr_<cvf>__c-8_g-0.1.out

-> Use the cross validation files corresponding to the models data set. Each models data set can be found in the Model table in emc_my_solution document. (These files were generated in the prior section Generate the seven data sets required for liblinear/ libsvm models.) -> Perform this step 5 times [once for each fold (<cvf> = 1 to 5)]. Step (3): Same as that for liblinear Concrete steps for generating predictions for test data: Step (1): Train using libsvms svm-train command. Example: ./svm-train -m 2000 -c 8 -g 0.1 -b 1 <EMC_ROOT>/outputFiles/liblinear/ raw_minTermCount-3_tr_-1.csv raw_minTermCount-3_tr_-1__c-8_g-0.1.model -> Use the training file containing the entire training data (*_tr_-1.csv) corresponding to the models data set. Each models data set can be found in the Model table in emc_my_solution document. (These files were generated in the prior section Generate the seven data sets required for liblinear/libsvm models.) -> Use the model parameters from the Model table in emc_my_solution document Step (2): Predict for test data, using libsvms svm-predict command. Example: ./svm-predict -b 1 <EMC_ROOT>/outputFiles/liblinear/raw_minTermCount3_test.csv raw_minTermCount-3_tr_-1__c-8_g-0.1.model raw_minTermCount-3_tr_1__c-8_g-0.1.out -> Use the test file corresponding to the models data set. Each models data set can be found in the Model table in emc_my_solution document. (These files were generated in the prior section Generate the seven data sets required for liblinear/libsvm models.) Step (3): Same as that for liblinear

Naive Bayes model# 10

Concrete steps for building model: Steps (1), (2), and (3): Run Java class com.emc.nb.NBExp1 $ cd <EMC_ROOT>/java $ java -Xmx1024m -cp classes:lib/commons-math-2.2.jar com.emc.nb.NBExp1 Concrete steps for generating predictions for test data: Steps (1), (2), and (3): Run Java class com.emc.nb.NBExp1_Subm java -Xmx1024m -cp classes:lib/commons-math-2.2.jar com.emc.nb.NBExp1_Subm

Weighted k-NN models (11) and (12)

Concrete steps for building models: Steps (1), (2), and (3): Run Java class com.emc.knn.KnnExp1. This class automatically builds both models 11 and 12. command line arguments: argument 1: number of threads [set to as high as possible: number of cores] argument 2: sleepIntervalInSeconds [set to 60] [used for printing progress] $ cd <EMC_ROOT>/java

$ java -Xmx6000m -cp classes:lib/commons-math-2.2.jar com.emc.knn.KnnExp1 4 60 Concrete steps for generating predictions for test data: Steps (1), (2), and (3): Run Java class com.emc.knn.KnnExp1_Subm. This class automatically generates test predictions for both models 11 and 12. command line arguments: argument 1: number of threads [set to as high as possible: number of cores] argument 2: sleepIntervalInSeconds [set to 60] [used for printing progress] $ cd <EMC_ROOT>/java $ java -Xmx6000m -cp classes:lib/commons-math-2.2.jar com.emc.knn.KnnExp1_Subm 4 60

Multinomial Naive Bayes model# 20

Concrete steps for building model: Steps (1), (2), and (3): Run Java class com.emc.nb.exp1_withTF.NBExp1__withTF $ cd <EMC_ROOT>/java $ java -Xmx1024m -cp classes:lib/commons-math-2.2.jar com.emc.nb.exp1_withTF.NBExp1__withTF Concrete steps for generating predictions for test data: Steps (1), (2), and (3): Run Java class com.emc.nb.exp1_withTF.NBExp1__withTF_Subm $ cd <EMC_ROOT>/java $ java -Xmx1024m -cp classes:lib/commons-math-2.2.jar com.emc.nb.exp1_withTF.NBExp1__withTF_Subm

Weighted k-NN models (21) and (22)

Concrete steps for building models: Steps (1), (2), and (3): Run Java class com.emc.knn.KnnExp2_win. This class automatically builds both models 21 and 22. command line arguments: argument 1: number of threads [set to as high as possible: number of cores] argument 2: sleepIntervalInSeconds [set to 60] [used for printing progress] $ cd <EMC_ROOT>/java $ java -Xmx6000m -cp classes:lib/commons-math-2.2.jar com.emc.knn.KnnExp2_win 4 60 Concrete steps for generating predictions for test data: Steps (1), (2), and (3): Run Java class com.emc.knn.KnnExp2_win_Subm. This class automatically generates test predictions for both models 21 and 22. command line arguments: argument 1: number of threads [set to as high as possible: number of cores] argument 2: sleepIntervalInSeconds [set to 60] [used for printing progress] $ cd <EMC_ROOT>/java $ java -Xmx6000m -cp classes:lib/commons-math-2.2.jar com.emc.knn.KnnExp2_win_Subm 4 60

Train the Two Ensembles

Ensemble (1): e9a_FS_10k_all_15 Run the following command: setwd('<EMC_ROOT>/R'); source('emc_train_e9a_FS-10k-all-15.r'); ens.glmnetTest(cvFold=5, alpha=0.5) Ensemble (2): e9b Run the following command: setwd('<EMC_ROOT>/R'); source('emc_train_e9b.r'); ens.glmnetTest(cvFold=5, alpha=0.5)

Predict for Test Data Using the Two Ensembles

Ensemble (1): e9a_FS_10k_all_15 Run the following command: setwd('<EMC_ROOT>/R'); source('emc_sub_e9a_FS-10k-all-15.r'); build_sub() Predictions will be saved in file e9a_FS-10k-all-15.csv Ensemble (2): e9b Run the following command: setwd('<EMC_ROOT>/R'); source('emc_sub_e9b.r'); build_sub() Predictions will be saved in file e9b.csv

Building the Final Submission File

Take the weighted average of predictions from two ensembles using the following code snippet:
e9a_FS_10k_all_15 = read.csv('e9a_FS-10k-all-15.csv') e9b = read.csv('e9b.csv') tmp = (0.3 * e9a_FS_10k_all_15) + (0.7 * e9b) weighted_average = e9b weighted_average[,2:98] = tmp[,2:98] write.table(weighted_average, 'final_submission.csv', row.names=F, col.names=T, sep=',')

CPSC540: Machine Learning Machine Learning Machine Learning Machine Learning
No ratings yet
CPSC540: Machine Learning Machine Learning Machine Learning Machine Learning
387 pages
CPSC540: Machine Learning Machine Learning Machine Learning Machine Learning
No ratings yet
CPSC540: Machine Learning Machine Learning Machine Learning Machine Learning
91 pages
Ansys Rocky Installation 24R1
No ratings yet
Ansys Rocky Installation 24R1
127 pages
SVM, Neural Network and Random Forest in R
No ratings yet
SVM, Neural Network and Random Forest in R
45 pages
6CS4 22 Machine Learning Lab Manual
50% (2)
6CS4 22 Machine Learning Lab Manual
46 pages
Eeg-Emg Multimodal Prosthesis
No ratings yet
Eeg-Emg Multimodal Prosthesis
30 pages
Lecture-03 - Vectors and Matrices
No ratings yet
Lecture-03 - Vectors and Matrices
27 pages
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
No ratings yet
A1388404476 - 64039 - 23 - 2023 - Machine Learning II
10 pages
SVMLab Notebook
No ratings yet
SVMLab Notebook
11 pages
EIE520 Neural Computation: The Hong Kong Polytechnic University
No ratings yet
EIE520 Neural Computation: The Hong Kong Polytechnic University
14 pages
cs229 MT Review
No ratings yet
cs229 MT Review
54 pages
2B Naive Bayes
No ratings yet
2B Naive Bayes
90 pages
Ex 6
No ratings yet
Ex 6
16 pages
2024f Java Uml Programming
No ratings yet
2024f Java Uml Programming
5 pages
ASSIGNMENT 3 - Probabilistic Models, GBDT, SVM
No ratings yet
ASSIGNMENT 3 - Probabilistic Models, GBDT, SVM
3 pages
MLT 9
No ratings yet
MLT 9
10 pages
S20220020307 Assignment 2
No ratings yet
S20220020307 Assignment 2
4 pages
Final f04
No ratings yet
Final f04
13 pages
sectionSVM PDF
No ratings yet
sectionSVM PDF
10 pages
Ex Eval 1
No ratings yet
Ex Eval 1
3 pages
ML Lab Experiments (1) - Pages-3
No ratings yet
ML Lab Experiments (1) - Pages-3
11 pages
CSL0777 L23
No ratings yet
CSL0777 L23
39 pages
Manual Matlab
No ratings yet
Manual Matlab
7 pages
C++ Armadillo Specifications
No ratings yet
C++ Armadillo Specifications
15 pages
Assignment II Machine Learning
No ratings yet
Assignment II Machine Learning
8 pages
ML Unit 3 Part B Material
No ratings yet
ML Unit 3 Part B Material
15 pages
Step by Step Tutorials For Assembly Programming Using DosBox and MASM
0% (1)
Step by Step Tutorials For Assembly Programming Using DosBox and MASM
1 page
B24 ML Exp-3
No ratings yet
B24 ML Exp-3
10 pages
DBA Sheet v5.8
No ratings yet
DBA Sheet v5.8
419 pages
Linux Cheat Sheet Sponsored by Loggly
100% (1)
Linux Cheat Sheet Sponsored by Loggly
1 page
Unit 2
No ratings yet
Unit 2
7 pages
Multi-Disease Prediction With Machine Learning
No ratings yet
Multi-Disease Prediction With Machine Learning
7 pages
UNIT-II-Support Vector Machine Algorithm
No ratings yet
UNIT-II-Support Vector Machine Algorithm
13 pages
Intro To UNIX
No ratings yet
Intro To UNIX
67 pages
Lesson 06 - Introduction To CLI
No ratings yet
Lesson 06 - Introduction To CLI
46 pages
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
No ratings yet
Presented By: M. Saqib Iqbal Gull Muhammad Presented To: Mr. Imran Ali Khan Artificial Intelligence National College of Bussiness Administration & Economics Multan
11 pages
Lab-1: Picc-Based Labs: Timers, Interrupts and Rtos Apis
No ratings yet
Lab-1: Picc-Based Labs: Timers, Interrupts and Rtos Apis
15 pages
A Introduction To SVM PDF
No ratings yet
A Introduction To SVM PDF
48 pages
Lecture 3
No ratings yet
Lecture 3
51 pages
Agniva
No ratings yet
Agniva
16 pages
Week 5 Slides
No ratings yet
Week 5 Slides
25 pages
Machine Learning Quick Start Guide
No ratings yet
Machine Learning Quick Start Guide
1 page
ML Mdu 2024 10939237
No ratings yet
ML Mdu 2024 10939237
20 pages
ADM10 - Archiving in T24-R10 (1) .01 PDF
100% (1)
ADM10 - Archiving in T24-R10 (1) .01 PDF
23 pages
CNS 320 Week3 Lab
No ratings yet
CNS 320 Week3 Lab
84 pages
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
No ratings yet
Machine Learning and Data Mining: Prof. Alexander Ihler Fall 2012
36 pages
TE ML LAB Mannual
No ratings yet
TE ML LAB Mannual
21 pages
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
No ratings yet
01 Apply Data Preprocessing On Heart Dataset and Evaluate Performance Using Confusion Matrix
19 pages
AI LAB Assignment 10
No ratings yet
AI LAB Assignment 10
4 pages
IT Professional Anil Parashar - Remove Shortcut Link Virus & Retrieve Unhide File-Folder Attributes by Using Command
No ratings yet
IT Professional Anil Parashar - Remove Shortcut Link Virus & Retrieve Unhide File-Folder Attributes by Using Command
2 pages
Deep Spar Disk Imager
No ratings yet
Deep Spar Disk Imager
3 pages
8086 Microprocessor Interrupts
No ratings yet
8086 Microprocessor Interrupts
21 pages
Lab Mannual of ML
No ratings yet
Lab Mannual of ML
43 pages
Accessdata A30-327 Dumps
No ratings yet
Accessdata A30-327 Dumps
4 pages
CPU Scheduling: Bilkent University Department of Computer Engineering CS342 Operating Systems
No ratings yet
CPU Scheduling: Bilkent University Department of Computer Engineering CS342 Operating Systems
75 pages
Device-Independent I/O Software
No ratings yet
Device-Independent I/O Software
2 pages
HP-UX For Experienced UNIX System Administrators
No ratings yet
HP-UX For Experienced UNIX System Administrators
6 pages
Practice 13: Backup and Recovery in CDB and Pdbs
No ratings yet
Practice 13: Backup and Recovery in CDB and Pdbs
8 pages
DSCI 303: Machine Learning For Data Science Fall 2020
No ratings yet
DSCI 303: Machine Learning For Data Science Fall 2020
5 pages
Running in Parallel
No ratings yet
Running in Parallel
24 pages
Chapter 18
No ratings yet
Chapter 18
13 pages
Installation Manual
No ratings yet
Installation Manual
132 pages
HEU25 Debug
No ratings yet
HEU25 Debug
26 pages
TDSSKiller.3.1.0.28 03.04.2023 18.31.29 Log
No ratings yet
TDSSKiller.3.1.0.28 03.04.2023 18.31.29 Log
15 pages
Using Sigtool
No ratings yet
Using Sigtool
6 pages
Lab11 B
No ratings yet
Lab11 B
9 pages
Dumpstate
No ratings yet
Dumpstate
9 pages
Hud Sight
No ratings yet
Hud Sight
4 pages
DD P Boostfs 7 10 Linux Configuration Guide en Us
No ratings yet
DD P Boostfs 7 10 Linux Configuration Guide en Us
32 pages
Bugreport IV2201 - IND TP1A.220905.001 2024 08 06 08 38 42 Dumpstate - Log 11776
No ratings yet
Bugreport IV2201 - IND TP1A.220905.001 2024 08 06 08 38 42 Dumpstate - Log 11776
34 pages
Ch. 4 Managing Linux Users Account Mangment
No ratings yet
Ch. 4 Managing Linux Users Account Mangment
29 pages
Question Bank - UNIT 3 - CCS335 - CLOUD COMPUTING-1
No ratings yet
Question Bank - UNIT 3 - CCS335 - CLOUD COMPUTING-1
3 pages
The Art of WebAssembly: Build Secure, Portable, High-Performance Applications
From Everand
The Art of WebAssembly: Build Secure, Portable, High-Performance Applications
Rick Battagline
No ratings yet
DevOps for the Desperate: A Hands-On Survival Guide
From Everand
DevOps for the Desperate: A Hands-On Survival Guide
Bradley Smith
No ratings yet
MySQL for Python
From Everand
MySQL for Python
Albert Lukaszewski
5/5 (1)
Spring MVC Blueprints
From Everand
Spring MVC Blueprints
Sherwin John Calleja Tragura
No ratings yet
Getting started with Spring Framework: A Hands-on Guide to Begin Developing Applications Using Spring Framework
From Everand
Getting started with Spring Framework: A Hands-on Guide to Begin Developing Applications Using Spring Framework
Ashish Sarin
4.5/5 (2)
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
From Everand
Python Advanced Programming: The Guide to Learn Python Programming. Reference with Exercises and Samples About Dynamical Programming, Multithreading, Multiprocessing, Debugging, Testing and More
Marcus Richards
No ratings yet
C# Interview Questions, Answers, and Explanations: C Sharp Certification Review
From Everand
C# Interview Questions, Answers, and Explanations: C Sharp Certification Review
equitypress
4.5/5 (3)
IBM WebSphere Application Server 8.0 Administration Guide
From Everand
IBM WebSphere Application Server 8.0 Administration Guide
Steve Robinson
No ratings yet
Firebase Storage for Angular: A reliable file upload solution for your applications
From Everand
Firebase Storage for Angular: A reliable file upload solution for your applications
Abdelfattah Ragab
No ratings yet
Administering Microsoft Azure SQL Solutions DP 300
From Everand
Administering Microsoft Azure SQL Solutions DP 300
Manish Soni
No ratings yet
Couchbase Certified Java Developer - Exam Practice Tests
From Everand
Couchbase Certified Java Developer - Exam Practice Tests
Cristian Scutaru
No ratings yet
Creation of Postfix Mail Server Based on Virtual Users and Domains
From Everand
Creation of Postfix Mail Server Based on Virtual Users and Domains
Dr. Hidaia Mahmood Alassouli
No ratings yet
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
From Everand
Oracle Certified Professional Java Programmer OCPJP 1Z0 809
Manish Soni
No ratings yet
Microsoft Visual Basic Interview Questions: Microsoft VB Certification Review
From Everand
Microsoft Visual Basic Interview Questions: Microsoft VB Certification Review
Equity Press
No ratings yet
Navigating the Worlds of C and C++: Masters of Code
From Everand
Navigating the Worlds of C and C++: Masters of Code
Kameron Hussain
No ratings yet
C# for Beginners: Learn in 24 Hours
From Everand
C# for Beginners: Learn in 24 Hours
Alex Nordeen
No ratings yet
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
From Everand
Salesforce Certified Platform Developer I CRT-450 Exam Preparation
Georgio Daccache
No ratings yet
Some Tutorials in Computer Networking Hacking
From Everand
Some Tutorials in Computer Networking Hacking
Dr. Hidaia Mahmood Alassouli
No ratings yet

Setup: Step-By-Step Instructions For How To Create The Solutions File From The Code Provided

Uploaded by

Setup: Step-By-Step Instructions For How To Create The Solutions File From The Code Provided

Uploaded by

Step-by-step instructions for how to create the solutions file from the code provided: Setup Prerequisites Compile

Compile Java Code

Generate Top Features Using MutualInformation Criterion

$ cd <EMC_ROOT>/java $ java -Xmx1024m -cp classes:lib/commons-math-2.2.jar com.emc.featureSelection.MutualInformationBasedFeatureSelector

Generate the seven data sets required for liblinear/libsvm models:

Build Individual Models and Predict for Test Data

Naive Bayes model# 10

Weighted k-NN models (11) and (12)

Multinomial Naive Bayes model# 20

Weighted k-NN models (21) and (22)

Train the Two Ensembles

Predict for Test Data Using the Two Ensembles

Building the Final Submission File

You might also like