0% found this document useful (0 votes)

24 views6 pages

IEEE

The document discusses predicting and analyzing campus placements using machine learning. It compares the performance of logistic regression, decision tree, K nearest neighbors, and random forest models on campus placement prediction tasks. The highest accuracy was achieved using a random forest model.

Uploaded by

Venkat Karthik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

24 views6 pages

IEEE

Uploaded by

Venkat Karthik

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 6

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

net/publication/360130609

Campus Placements Prediction & Analysis using Machine Learning

Conference Paper · March 2022

DOI: 10.1109/ESCI53509.2022.9758214

CITATIONS READS
3 1,770

1 author:

Priyanka Shahane

6 PUBLICATIONS 14 CITATIONS

SEE PROFILE

All content following this page was uploaded by Priyanka Shahane on 25 October 2022.

The user has requested enhancement of the downloaded file.

2022 International Conference on Emerging Smart Computing and Informatics (ESCI)
AISSMS Institute of Information Technology, Pune, India. Mar 9-11, 2022

Campus Placements Prediction & Analysis using

Machine Learning
Priyanka Shahane
Department of Artificial Intelligence & Data Science,
AISSMS Institute of Information Technology,
Pune, Maharashtra, India
[email protected]
2022 International Conference on Emerging Smart Computing and Informatics (ESCI) | 978-1-6654-0073-2/22/$31.00 ©2022 IEEE | DOI: 10.1109/ESCI53509.2022.9758214

Abstract — Campus placement is an activity of participating, considered the features such as matriculation score, senior
identifying and hiring young talent for internships and entry secondary score, scores of the subjects in various semesters &
level positions. demographics. Here, dataset used is of GuruNanak Dev
Engineering College (GNDEC), Ludhiana. This model gave an
Reputation and yearly admissions of the institute invariably accuracy of around 83.33%.
depend upon the placements provided by the institute to the
students. Therefore, most of the institutions, assiduously, try to Elayidom et. al. constructed multi way decision trees using
boost their placement department in order to improve their various parameters such as branch, sector, sex & rank. Here, the
organization on a full scale. Any assistance during this specific dataset used is received from the National Technical Manpower
space can have a good impact on the institute’s capability to Information System (NTMIS) via the Nodal center. This model
position it’s students. gave an accuracy of 80%.
In this study, the target is to analyze student's placement data Nagaria et. al. used the Random Forest model where he has
of last year and use it to determine the probability of campus considered various parameters such as degree type, work
placement of the present students. For this we have experimented experience, e test percentage, specialization, MBA percentage.
with four different machine learning algorithms i.e. Logistic The dataset used is taken from Kaggle. This model gave the
Regression, Decision Tree, K Nearest Neighbours and Random highest accuracy of 85 %.
Forest.
S.Venkatachalam et. al. designed the fuzzy inference system
Index Terms — Machine Learning, Campus placements using Naive Bayes algorithm for campus placement prediction.
prediction, Logistic Regression, Decision Tree, KNN, Random The dataset is prepared with the help of primary & secondary
Forest data collection sources. This model gave the highest accuracy of
86.15%.
I. INTRODUCTION
Manvitha et. al. designed used the Random Forest model
NOWADAYS the number of educational institutes is where she has considered various parameters such as credit ,
growing day by day. The aim of each higher educational backlogs , whether placed or not, b.tech %. The dataset is
institute is to help their students to get a well-paid job through collected from the placement department of Sreenidhi Institute
their placement cell. One of the biggest challenges that higher of Science and Technology. This model gave the highest
learning institutes face these days is to uplift the placement accuracy of 86%.
performance of scholars.
The goal of this system is to predict whether the student III. METHODOLOGY
will get a campus placement or not based on various The steps involved in this system are as follows,
parameters such as gender, SSC percentage, HSC percentage,
HSC stream, degree percentage, degree type, work experience A. Data Acquisition:
& e-test percentage. The campus placement dataset is collected from Kaggle
This research focuses on various algorithms of machine website. Here is the link for the dataset:
learning such as Logistic Regression, Decision Tree, K-Nearest https://www.kaggle.com/benroshan/factors-affecting-campus
Neighbours and Random Forest in order to produce placement?select=Placement_Data_Full_Class.csv
economical and correct results for campus placement The dataset consists of various attributes such as Serial
prediction. This system follows a supervised machine learning Number, Gender, SSC percentage, SSC Board - Central/ Others,
approach as it uses class labelled data for training the HSC percentage, HSC Board, HSC Specialization, Degree
classification algorithm. Percentage, UG Degree Stream, Work Experience, E -test
Percentage, Degree Specialization, Degree Percentage,
II. LITERATURE SURVEY Placement Status & Salary. The size of dataset is 19.71 KB & it
Sharma et. al. developed the placement predictor system has total 215 records.
i.e. PPS by using a model of logistic regression. For this he has
1) Handling missing values:

978-1-6654-0073-2/22/$31.00 ©2022 IEEE 1

Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on May 02,2022 at 10:01:58 UTC from IEEE Xplore. Restrictions apply.
In our dataset missing values are present only in the salary 3) Feature Selection:
column as these values correspond to the students who didn’t Here, various features are visualized to understand their
get placed in any placement drive. So it is assumed that the correlation with the target feature.
missing values in Salary Column are Zero & replaced them by
zero using fillna(0,inplace=True) function in Python.
2) Handling categorical data:
Since we cannot deal with categorical values directly,
mapping is done for attributes having categorical values.
Gender attribute has values M (Male) & Female (M). Here,
M is replaced by 0 & F is replaced by 1. SSC & HSC Board
attributes has values ‘Central’ & ‘Other.’ Here, Central is
replaced by 1 & Other is replaced by 0. Work Experience
attribute has values ‘Yes’ & ‘No’. Here, ‘Yes’ is replaced by 1
and ‘No’ is replaced by 0. Degree specialization attribute has
values ‘Marketing & Finance’ & ‘Marketing & HR’. Here, Fig. 2. M/F ratio
‘Marketing & Finance’ is replaced by 1 and ‘Marketing & HR’
is replaced by 0. Status attribute has values ‘Placed’ and ‘Not Here, male : female ratio for one batch of students is
Placed’. Here, ‘Placed’ is replaced by 1 and ‘Not Placed’ is approximately equal to 2. It means that there are 2 male
replaced by 0. This is achieved through map function in candidates appearing for placement drives for every 1 female
Python. candidate.
For e.g.,
x df['gender']=df['gender'].map({'M':0,'F':1})
x df['ssc_b']=df['ssc_b'].map({'Central':1,'Others':0})
x df['workex']=df['workex'].map({'Yes':1,'No':0})

Fig. 3. Placement count vs. gender

From the above graph it can be concluded that the count of

placed male candidates in a batch is higher as compared to
female candidates & the placement count is dependent on
gender.

Fig. 1. Architecture Diagram Fig. 4. 10th standard percentage distribution

Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on May 02,2022 at 10:01:58 UTC from IEEE Xplore. Restrictions apply.
In the above graph, class 1 represents students having
scores between 80-100%, class 2 represents students having
scores between 60-80% and class 3 represents students having
less than 60 % score in 10th standard.

Fig. 7. Placement count vs. 12th percentage

Fig. 5. Placement count vs. 10th percentage From the above graph, it's observed that all the students
having scores between 80-100% in 12th standard got placed.
From the above graph, it's observed that all the students Very few students having scores between 60-80% in 12th
having scores between 80-100% in 10th standard got placed. standard couldn’t get placed. Whereas, most of the students
Very few students having scores between 60-80% in 10th having below 60% score in 12th standard couldn’t get placed.
standard couldn’t get placed. Whereas, most of the students
having below 60% score in 10th standard couldn’t get placed.

Fig. 6. 12th standard percentage distribution

Fig. 8. UG percentage distribution
In the above graph, class 1 represents students having
scores between 80-100% , class 2 represents students having In the above graph, class 1 represents students having scores
scores between 60-80% and class 3 represents students having between 80-100%, class 2 represents students having scores
less than 60 % score in 12th standard. between 60-80% and class 3 represents students having less than
60 % score in UG degree.

Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on May 02,2022 at 10:01:58 UTC from IEEE Xplore. Restrictions apply.
Fig. 11. Placement count vs. MBA percentage

Fig. 9. Placement count vs. UG percentage In the above graph we can see that more students from class
2 got placed as compared to class 3.
From the above graph, it's observed that most of the
students having scores between 80-100% in UG got placed. Hence, it is clear that placement count of the students is
Very few students having scores between 60-80% in UG dependent on various features such as Gender, SSC percentage,
couldn’t get placed. Whereas, most of the students having SSC Board - Central/ Others, HSC percentage, HSC Board,
below 60% score in UG couldn’t get placed. HSC Specialization, Degree Percentage, UG Degree Stream,
Work Experience, E -test Percentage, Degree Specialization,
Degree Percentage.
4) Split data:
Here, data is divided into two parts i.e. training data &
testing data. Where 80 % data is taken for training our machine
learning algorithm and remaining 20 % data is used for testing
whether our trained machine learning model is working
correctly or not.
5) Machine Learning Algorithm:
a) Logistic Regression:
Logistic regression is a statistical method used to determine
the outcome of a dependent variable (y) based on the values of
independent variable (x).
In our problem dependent variable is placement status and
independent variables are the features selected by us in the
previous step.
This algorithm is mostly used for the problems of binary
classification.
b) Decision Tree:
Fig. 10. MBA percentage distribution A decision tree is a graph like a tree where nodes represent
the position where we select the feature and ask a question,
After studying MBA percentage data it is observed that no edges represent the answers of the question; and the leaves
student has secured more than 80% marks. So the class 1 data represent the final output or label of the class.
isn’t available for percentage of MBA.
c) KNN:
K-NN stores all the training data into different classes based
on the class labels and classifies new data by checking its
similarity with data in the available classes.

Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on May 02,2022 at 10:01:58 UTC from IEEE Xplore. Restrictions apply.
d) Random Forest: III. CONCLUSION
Random Forest classifier consists of a number of decision The problem of campus placement prediction can be solved
trees which apply on different subsets of our dataset and the with the help of different machine learning algorithms such as
average of outputs of all the decision trees is taken to improve Logistic regression, Decision Tree, KNN & Random Forest.
the accuracy of output prediction.
Here, the Logistic Regression algorithm gave the highest
6) Evaluate results: accuracy of 95. 34 % for campus placements prediction.
Accuracy is calculated by following formula,
The selected features i.e. Gender, SSC percentage, SSC
Accuracy = (TP + TN) / (TP + FP + TN + FN) Board - Central/ Others, HSC percentage, HSC Board, HSC
Specialization, Degree Percentage, UG Degree Stream, Work
Where,
Experience, E -test Percentage, Degree Specialization & Degree
TP: True Positive (the number of cases correctly identified Percentage lead to higher classification accuracy.
as placed)
IV. FUTURE SCOPE
TN: True Negative (the number of cases correctly
identified as unplaced). Accuracy may further increase by application of more
advanced techniques such as deep learning & experimenting
FP: False Positive (the number of cases incorrectly with different activation functions of neural networks such as
identified as placed) linear, sigmoid, tan h & ReLU.
FN: False Negative (the number of cases incorrectly We can also experiment with different cross validation
identified as unplaced) techniques such as 3 Fold, 5 Fold, 10 Fold, 15 Fold cross
validation in order to analyze the change in accuracy.
TABLE I. TP, FP, FN & TN VALUES OF DIFFERENT MODELS
Model TP FP FN TN REFERENCES
Logistic Regression 16 1 1 25 [1] A. S. Sharma, S. Prince, S. Kapoor and K. Kumar, "PPS —
Decision Tree 13 3 4 23 Placement prediction system using logistic regression," 2014 IEEE
International Conference on MOOC, Innovation and Technology in
KNN 14 1 3 25 Education (MITE), 2014, pp. 337-341, doi:
Random Forest 13 2 4 24 10.1109/MITE.2014.7020299.
[2] S. Elayidom, S. M. Idikkula, J. Alexander and A. Ojha, "Applying Data
Mining Techniques for Placement Chance Prediction," 2009 International
TABLE II. CAMPUS PLACEMENT PREDICTION ACCURACY OF DIFFERENT Conference on Advances in Computing, Control, and Telecommunication
MODELS. Technologies, 2009, pp. 669-671, doi: 10.1109/ACT.2009.169.
Model Accuracy [3] J. Nagaria and S. V. S, "Utilizing Exploratory Data Analysis for the
Logistic Regression 95.34 % Prediction of Campus Placement for Educational Institutions," 2020 11th
Decision Tree 83.72 % International Conference on Computing, Communication and Networking
Technologies (ICCCNT), 2020, pp. 1-7, doi:
KNN 90.69 %
10.1109/ICCCNT49239.2020.9225441.
Random Forest 88.67 %
[4] S.Venkatachalam,“Data Mining Classification and analytical model of
prediction for Job Placements using Fuzzy Logic,” 2021 IEEE
International Conference on Trends in Electronics and Informatics
(ICOEI), 2021.
[5] Pothuganti Manvitha, Neelam Swaroopa “Campus Placement Prediction
Using Supervised Machine Learning Techniques,” 2019 International
Journal of Applied Engineering Research, pp. 2188-2191.

Fig. 12. Comparison of Campus placement prediction accuracy of different

models.

Authorized licensed use limited to: INDIAN INSTITUTE OF TECHNOLOGY BOMBAY. Downloaded on May 02,2022 at 10:01:58 UTC from IEEE Xplore. Restrictions apply.
View publication stats

Student Placement Prediction Using Machine Learnin
No ratings yet
Student Placement Prediction Using Machine Learnin
7 pages
Campus Placement
No ratings yet
Campus Placement
13 pages
Prediction of Admission in Engineering College
No ratings yet
Prediction of Admission in Engineering College
10 pages
Advance Machine Learning
No ratings yet
Advance Machine Learning
49 pages
Review 3
No ratings yet
Review 3
25 pages
Maximizing Campus Placement Through Machine Learni
No ratings yet
Maximizing Campus Placement Through Machine Learni
7 pages
Placement Analysisfor Studentsusing Machine Learning 2
No ratings yet
Placement Analysisfor Studentsusing Machine Learning 2
16 pages
Student Placement Analyzer
No ratings yet
Student Placement Analyzer
6 pages
Tracking and Predecting Students Performance With Machine Learning
0% (1)
Tracking and Predecting Students Performance With Machine Learning
47 pages
Prognostication of The Placement of Students Applying Machine Learning Algorithms
No ratings yet
Prognostication of The Placement of Students Applying Machine Learning Algorithms
5 pages
Students Placement Prediction Using Machine Learning
No ratings yet
Students Placement Prediction Using Machine Learning
6 pages
Assignment 2
No ratings yet
Assignment 2
11 pages
Apoorva CSITSS
No ratings yet
Apoorva CSITSS
5 pages
C2C - Predictive Analysis of Student Campus Placement PDF
No ratings yet
C2C - Predictive Analysis of Student Campus Placement PDF
16 pages
21ZC63 - Industrial Visit and Technical Seminar
No ratings yet
21ZC63 - Industrial Visit and Technical Seminar
15 pages
Students Placement Prediction Using Machine Learning Algorithms
No ratings yet
Students Placement Prediction Using Machine Learning Algorithms
14 pages
Campus Placements Prediction & Analysis Using Machine Learning
No ratings yet
Campus Placements Prediction & Analysis Using Machine Learning
22 pages
Studentplacement
No ratings yet
Studentplacement
10 pages
MARKETING
No ratings yet
MARKETING
5 pages
Cse 01506423&01506451
No ratings yet
Cse 01506423&01506451
15 pages
Wa0018.
No ratings yet
Wa0018.
13 pages
Iv 1
No ratings yet
Iv 1
5 pages
Fin Irjmets1711516698
No ratings yet
Fin Irjmets1711516698
5 pages
Student Placement
No ratings yet
Student Placement
14 pages
Article 3
No ratings yet
Article 3
4 pages
Data Mining Project Proposal
No ratings yet
Data Mining Project Proposal
3 pages
SEC - Accepted Student List - ZOHO
No ratings yet
SEC - Accepted Student List - ZOHO
7 pages
Comprehensive Decision-Making Guide Predicting Colleges Based On User Profile Using Ensemble ML Model
No ratings yet
Comprehensive Decision-Making Guide Predicting Colleges Based On User Profile Using Ensemble ML Model
8 pages
Student Performance Prediction
No ratings yet
Student Performance Prediction
4 pages
Prediction of Final Result and Placement of Studen
No ratings yet
Prediction of Final Result and Placement of Studen
7 pages
Batch 24 Major Project Review 1
No ratings yet
Batch 24 Major Project Review 1
22 pages
Recruitment System With Placement - R 11
No ratings yet
Recruitment System With Placement - R 11
5 pages
Comprehensive Career Placement Predictor An Analytical Tool For Optimizing Job Placement Outcomes
No ratings yet
Comprehensive Career Placement Predictor An Analytical Tool For Optimizing Job Placement Outcomes
6 pages
Student Campus Placement Prediction Analysis Using ChiSquared Test On Machine Learning Algorithms-IJRASET
No ratings yet
Student Campus Placement Prediction Analysis Using ChiSquared Test On Machine Learning Algorithms-IJRASET
10 pages
Final DM
No ratings yet
Final DM
7 pages
Name: Abhinandita Banerjee REG NO:20BCE2080 Theory Digital Assignment Data Visualization
No ratings yet
Name: Abhinandita Banerjee REG NO:20BCE2080 Theory Digital Assignment Data Visualization
6 pages
College Predictor - Thesis
No ratings yet
College Predictor - Thesis
37 pages
Placment Predection Using Machine Learning
No ratings yet
Placment Predection Using Machine Learning
9 pages
Intern ReportFSDFSDF
No ratings yet
Intern ReportFSDFSDF
18 pages
R4 - Placement Prediction
No ratings yet
R4 - Placement Prediction
9 pages
Student Placement Prediction
No ratings yet
Student Placement Prediction
4 pages
Students Placement Prediction System
No ratings yet
Students Placement Prediction System
5 pages
R - 12 - An Advanced Machine Learning Approach For Student Placement Prediction and Analysis
No ratings yet
R - 12 - An Advanced Machine Learning Approach For Student Placement Prediction and Analysis
11 pages
Abstracts DS
No ratings yet
Abstracts DS
7 pages
Placement
No ratings yet
Placement
5 pages
Irjet V10i395
No ratings yet
Irjet V10i395
4 pages
12 IV April 2024
No ratings yet
12 IV April 2024
8 pages
Anticipating College Admissions: An Algorithmic Approach
No ratings yet
Anticipating College Admissions: An Algorithmic Approach
8 pages
13 - Construct Food Safety Traceability System For People's Health Under The Internet of Things and Big Data
No ratings yet
13 - Construct Food Safety Traceability System For People's Health Under The Internet of Things and Big Data
89 pages
A Minor Project Report On DMT
No ratings yet
A Minor Project Report On DMT
11 pages
Predicting The Admissions of Students in Masters Program Using Machine Learning
No ratings yet
Predicting The Admissions of Students in Masters Program Using Machine Learning
16 pages
E10380585S19
No ratings yet
E10380585S19
6 pages
TNP Portal Using Web Development and Machine Learning
No ratings yet
TNP Portal Using Web Development and Machine Learning
9 pages
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
No ratings yet
Placement Prediction Using Various Machine Learning Models and Their Efficiency Comparison
5 pages
Deep Learning Based Campus Placement Prediction
No ratings yet
Deep Learning Based Campus Placement Prediction
19 pages
Educational Data Mining For Student Placement Prediction Using Machine Learning Algorithms - Sreenivasa Rao - International Journal of Engineering & Technology
No ratings yet
Educational Data Mining For Student Placement Prediction Using Machine Learning Algorithms - Sreenivasa Rao - International Journal of Engineering & Technology
4 pages
University Admission
No ratings yet
University Admission
17 pages
Educational Data Mining For Student Placement Prediction Using Machine Learning Algorithms
No ratings yet
Educational Data Mining For Student Placement Prediction Using Machine Learning Algorithms
4 pages
Expert System For Student Placement Prediction
No ratings yet
Expert System For Student Placement Prediction
5 pages
3 - Image Forgery Detection Based On Fussion of Light Weight Deep Learning Models
No ratings yet
3 - Image Forgery Detection Based On Fussion of Light Weight Deep Learning Models
78 pages
27 - Optimize The Storage Volume Using Data Mining Techniques
No ratings yet
27 - Optimize The Storage Volume Using Data Mining Techniques
71 pages
8 - Asymmetric Hash Code Learning For Remote Sensing Image Retreval
No ratings yet
8 - Asymmetric Hash Code Learning For Remote Sensing Image Retreval
63 pages
Campus Placement Analyzer: Using Supervised Machine Learning Algorithms
No ratings yet
Campus Placement Analyzer: Using Supervised Machine Learning Algorithms
5 pages
6 - Comparative Analysis of Liver Dieases by Using Machine Learning Techniques
No ratings yet
6 - Comparative Analysis of Liver Dieases by Using Machine Learning Techniques
67 pages
Cyber Security For Beginners PDF
No ratings yet
Cyber Security For Beginners PDF
28 pages

IEEE

Uploaded by

IEEE

Uploaded by

See discussions, stats, and author profiles for this publication at: https://www.researchgate.

Campus Placements Prediction & Analysis using Machine Learning

Conference Paper · March 2022

The user has requested enhancement of the downloaded file.

Campus Placements Prediction & Analysis using

978-1-6654-0073-2/22/$31.00 ©2022 IEEE 1

Fig. 3. Placement count vs. gender

From the above graph it can be concluded that the count of

Fig. 1. Architecture Diagram Fig. 4. 10th standard percentage distribution

Fig. 7. Placement count vs. 12th percentage

Fig. 6. 12th standard percentage distribution

Fig. 12. Comparison of Campus placement prediction accuracy of different

You might also like