0% found this document useful (0 votes)
30 views6 pages

Research Paper

This document proposes a new methodology for using machine learning and natural language processing to predict personality traits from job applicants' resumes and CVs. The methodology involves collecting a large dataset of resumes/CVs, pre-processing the data, extracting relevant features, identifying personality traits, training a machine learning model like KNN using the data, and then using the trained model to predict personality traits of new applicants based on their resume/CV text. The goal is to help hiring managers better understand candidates' personalities to make more informed hiring decisions.

Uploaded by

Sagar Nathani
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
0% found this document useful (0 votes)
30 views6 pages

Research Paper

This document proposes a new methodology for using machine learning and natural language processing to predict personality traits from job applicants' resumes and CVs. The methodology involves collecting a large dataset of resumes/CVs, pre-processing the data, extracting relevant features, identifying personality traits, training a machine learning model like KNN using the data, and then using the trained model to predict personality traits of new applicants based on their resume/CV text. The goal is to help hiring managers better understand candidates' personalities to make more informed hiring decisions.

Uploaded by

Sagar Nathani
Copyright
© © All Rights Reserved
We take content rights seriously. If you suspect this is your content, claim it here.
Available Formats
Download as PDF, TXT or read online on Scribd
You are on page 1/ 6

Personality Prediction Using

CV Analysis

Sagar Nathani1 Pratham Sharma2 Shaifali Sharma3


Computer Science Engineering Computer Science Engineering Computer Science Engineering
Chandigarh University Chandigarh University Chandigarh University
Mohali, Punjab, India Mohali, Punjab, India Mohali, Punjab, Isdia
[email protected] [email protected] [email protected]

Abstract—Personality prediction using CV One of the major motivations behind personality


analysis is a fascinating area of research that prediction using CV analysis is the need for
involves the use of machine learning and organizations to hire employees that not only
natural language processing techniques to have the necessary skills and experience but also
extract and analyze data from job applicants' align with the company culture and values.
resumes. This approach allows for the Personality traits such as emotional stability,
development of predictive models that can conscientiousness, openness, and agreeableness
accurately infer personality traits and are considered essential for determining the fit
characteristics from textual information, such of a candidate for a particular role.
as a candidate's work experience, education,
and skills. By leveraging these models, The process of personality prediction using CV
employers and recruiters can gain valuable analysis involves extracting relevant information
insights into job candidates' personalities, from the job applicant's resume or CV and using
enabling them to make more informed hiring that information to train a predictive model.
decisions that align with their company culture
and values. Overall, personality prediction The extracted information includes the
using CV analysis has the potential to applicant's educational background, work
revolutionize the hiring process and experience, certifications, and skills. This
significantly improve the quality of new hires. information is then subjected to natural language
processing techniques, which extract semantic
Keywords: Personality Prediction,Logistic meaning from the text.
Regression, CV.
Machine learning algorithms are then applied to
I. INTRODUCTION the processed information to identify patterns
and trends that are indicative of specific
Personality prediction using CV analysis is a
personality traits. These algorithms learn from
fascinating and rapidly evolving field that
the data and use it to develop predictive models
combines the principles of psychology, machine
that can accurately predict personality traits
learning, and natural language processing to
based on the information contained in the job
analyze the text of job applicants' resumes or
applicant's CV
curriculum vitae (CV) to infer their personality
traits.
II. LITERATURE SURVEY Despite these promising results, there are also
some challenges associated with using CV
In recent years, there has been a growing interest analysis to predict personality traits. One of the
in using computer vision (CV) analysis to main challenges is the potential for bias in the
predict personality traits from CVs. This data. For example, if certain groups of people
approach is based on the idea that an individual's are underrepresented in the data, this could lead
CV can provide insights into their personality, as to inaccurate predictions or reinforce existing
it typically contains information about their biases.
education, work experience, and other personal
characteristics. To address this issue, several studies have
proposed using more diverse datasets and
One of the most popular personality models used developing algorithms that are less susceptible
in CV analysis is the Big Five model, which is to bias. For example, in a study by Li et al.
based on five broad dimensions of personality: (2021), the authors used a dataset of job
extraversion, agreeableness, conscientiousness, candidates from a variety of industries to predict
neuroticism, and openness to experience. personality traits from resumes. They found that
Several studies have used CV analysis to predict their approach was able to predict these traits
these traits, with promising results. with high accuracy, even when the dataset was
more diverse.
For example, in a study by Kucherbaev et al.
(2021), machine learning algorithms were used III. METHODOLOGY
to predict the Big Five traits from LinkedIn
profiles. The authors found that it is possible to Personality prediction using CV analysis is an
predict these traits with high accuracy using CV emerging field that has gained significant
analysis. They also found that certain features of attention in recent years. It involves using
a LinkedIn profile, such as the number of machine learning and natural language
connections and the type of content posted, were processing techniques to analyze the language
particularly predictive of certain personality used in job candidate's CVs to predict their
traits. personality traits. Here is a new methodology on
Personality Prediction Using CV Analysis:
Similarly, in a study by Jain et al. (2021), the
authors used computer vision and natural 1. Data Collection: Collect a large dataset of
language processing techniques to predict the CVs from job candidates, including
Big Five traits from resumes. They found that information such as education, work
their approach was able to predict these traits experience, skills, and personal interests.
with high accuracy, and that certain features of a 2. Pre-processing: Pre-processing is an
resume, such as the length of employment essential step in preparing the data for
history and the use of certain keywords, were analysis. This includes removing irrelevant
particularly predictive of certain personality information, such as contact details and
traits. irrelevant experience, and standardizing the
text format to ensure consistency.
Other studies have used CV analysis to predict 3. Feature Extraction: The next step is to
specific personality traits, such as creativity. For extract features from the pre-processed data.
example, in a study by Wang et al. (2021), the This involves identifying key words and
authors used CV analysis to predict creative phrases that are relevant to personality traits.
potential from portraits. They found that certain For example, words such as "team player"
facial features, such as the distance between the and "collaboration" might be indicative of a
eyes and the width of the nose, were predictive candidate who is extroverted and sociable.
of creative potential. 4. Personality Trait Identification: Once
features have been extracted, the next step is
to identify which personality traits they are like skills and experience, but to understand
associated with. There are several personality it is difficult to guess from CV alone.
established personality models that can be
used for this, such as the Big Five model or In this model we are proposing using a Machine
the Myers-Briggs Type Indicator. learning model to let the hiring process for the
5. Model Training: After identifying relevant personality prediction and resume and cv
features and personality traits, the next step analysis. The model we are using is the KNN
is to train a machine learning model to algorithm.
predict personality traits based on the CV
data. This involves using the pre-processed The KNN (K-Nearest Neighbors) algorithm can
data to train the model and then evaluating be used for personality prediction by following
its accuracy. these steps:
6. Personality Prediction: The final step is to
use the trained model to predict the 1. Collect a dataset of individuals with known
personality traits of new job candidates personality traits. The dataset should include
based on their CVs.The predictions can be a set of features that are believed to be related
used to inform hiring decisions and identify to personality, such as demographics,
candidates who are the best fit for a interests, hobbies, and behaviors.
particular role. 2. Preprocess the dataset by cleaning the data,
handling missing values, and normalizing the
features.
3. Split the dataset into training and testing sets.
IV. EXISTING SYSTEM 4. Choose a value for K, which is the number of
neighbors that will be used to predict the
Traditional systems for personality prediction personality of a new individual.
using CV analysis typically involve a manual 5. To predict the personality of a new
process of analyzing a job candidate's resume or individual, the algorithm calculates the
CV to determine their personality traits. This distance between the new individual and each
process is often carried out by a human resource individual in the training set using a distance
manager or recruiter, and may involve subjective metric such as Euclidean distance. The K
judgments based on the candidate's education, nearest neighbors to the new individual are
work experience, and other factors. selected.
6. The personality trait of the new individual is
predicted based on the most common
The traditional approach to personality
personality trait among its K nearest
prediction using CV analysis typically involves
neighbors. This can be done using a majority
Data collection,Data preprocessing,Personality
vote or a weighted vote, where the closer
assessment(Making subjective assessments of
neighbors have a higher weight.
the candidate's personality traits based on the
7. Evaluate the performance of the algorithm by
information gathered from their CV) and
measuring the accuracy of the predictions on
Validation
the testing set.
8. Repeat the following steps for choosing the K
V. PROPOSED SYSTEM value to evaluate the performance of the
algorithm with different values of K to find
There are many existing systems that attempt to the best value for K which helps in yielding
use machine learning algorithms to predict better results and for high efficiency
behavior based on analysis of personal data. The
current system is manual classification of
resumes. Employers manually review each
resume to find the right candidate for each job.
With traditional methods one can learn things
VI. RESULT
When a candidate wants to apply for a job, he questions.The CV or CV is printed by the model
has to take the exam. A window will open where and the personality of the candidate is predicted
the candidate has to fill in the details and submit based on the ocean model seen as the output.
the CV . Candidates write down details, submit a The results obtained are as follows:
resume or résumé, and answer personal

Fig 3 :Result of details

Here is the detailed result of the candidate in the above window and the predicted
showing the personality predicted by the model personality is ‘Responsible’.
based on the information filled by the candidate

Fig 4: Final result of personality prediction


assumptions about the underlying distribution of
the data and it stores the training data rather than
building a model.
VII. CONCLUSION
In this project, we determine the personality of The KNN algorithm works by finding the k
different people based on a test called the BIG closest data points in the training set to a given
FIVE TESTfor personality analysis and KNN test data point and using the most common class
algorithm for training the model. or the average value of those k neighbors to
make a prediction for that test data point. The
The Big Five test, also known as the Five Factor value of k is a hyperparameter that needs to be
Model, is a widely used model in psychology for tuned based on the data and the problem at hand.
assessing personality traits. It is based on the
idea that human personality can be described The KNN algorithm is simple to implement and
using five broad dimensions: works well on small datasets. It also doesn't
require any assumptions about the underlying
1. Openness to experience: This trait measures data distribution, which makes it a useful tool
a person's willingness to try new things and for exploratory data analysis. However, it can be
be open to different ideas and experiences. computationally expensive and may not work
2. Conscientiousness: This trait reflects a well on high-dimensional data or when the
person's level of organization, dependability, training set is large.
and responsibility.
3. Extraversion: This trait measures a person's Overall, the KNN algorithm is a powerful tool in
tendency to be outgoing, assertive, and the machine learning toolkit that can be used for
sociable. a wide range of applications, from image
4. Agreeableness: This trait reflects a person's classification to predicting customer behavior.
tendency to be cooperative, empathetic, and Its simplicity and flexibility make it a valuable
compassionate towards others. addition to any data scientist's toolbox
5. Neuroticism: This trait measures a person's
emotional stability, resilience, and ability to VIII. REFERENCES
handle stress and anxiety.
[1] PARMAR, J., PEREIRA, A., PEREIRA, S.,
The Big Five test is often used in both academic & Guide, S. K. (2018). Personality Prediction
research and in applied settings, such as career through CV Analysis (HR Helper). ST.
counseling, hiring, and team building. It is FRANCIS INSTITUTE OF TECHNOLOGY.
considered a reliable and valid measure of
personality and has been shown to predict a [2] Allan Robey, Kaushik Shukla, Kashish
wide range of behaviors, including job Agarwal, Keval Joshi, Personality Prediction
performance, job satisfaction, and leadership System through CV Analysis
effectiveness. The Big Five test can be
administered through self-report questionnaires [3] Jayashree, R., Sudhir, B., Pooja, Y., &
or through computer-based assessments, and Nirmiti, P. (2019). Personality evaluation and
scores are interpreted based on a standardized CV analysis using machine learning algorithm.
scoring system Int. J. Comput. Sci. Eng, 7, 1852-1857.

And we are using KNN for training the [4] Kulkarni, A., Shankarwar, T., & Thorat, S.
model,The k-nearest neighbors (KNN) algorithm (2021). Personality Prediction Via CV Analysis
is a machine learning algorithm used for both
using Machine Learning. (pp. 544-547).
classification and regression tasks. It is a
non-parametric and instance-based learning Mumbai: IJERT.
method, meaning that it doesn't make any
[5] VVCET – CSE (2019). Personality
Prediction System Through cv Analysis

[6] Monalisa, A. K., & Mahin, M. K. (2021).


PERSONALlTY PREDICTION SYSTEM
THROUGH CV ANALYSIS. Dhaka: daffodil
international university.

[7] Kaur, G., & Maheshwari, S. (2019).


Personality Prediction through Curriculum Vitae
Analysis involving Password Encryption and
Prediction Analysis.

[8] Pragya Sanjay Chauhan, Aishwarya Popat


Bondre, Prathamesh Goraksha Waphare, Sachin
Vaidya, Personality Evaluation and CV Analysis
Using Machine Learning Algorithms.

[9] kallar, H., Vaidyanathan, P., thirumalainambi,


P., Sudaroli, S., & Lohiya, S. (2019). Personality
prediction using CV analysis. (pp. 417-418).
mumbai: IJ Publication.

You might also like