Approach Based On Personality Traits
Approach Based On Personality Traits
Article
A Data-Driven Approach to Team Formation in Software
Engineering Based on Personality Traits
Jan Vasiljević and Dejan Lavbič *
Faculty of Computer and Information Science, University of Ljubljana, 1000 Ljubljana, Slovenia;
[email protected]
* Correspondence: [email protected]
Abstract: Collaboration among individuals with diverse skills and personalities is crucial to pro-
ducing high-quality software. The success of any software project depends on the team’s cohesive
functionality and mutual complementation. This study introduces a data-centric methodology for
forming Software Engineering (SE) teams centred around personality traits. Our study analysed data
from an SE course where 157 students in 31 teams worked through four project phases and were evalu-
ated based on deliverables and instructor feedback. Using the Five-Factor Model (FFM) and a variety
of statistical tests, we determined that teams with higher levels of extraversion and conscientiousness,
and lower neuroticism, consistently performed better. We examined team members’ interactions and
developed a predictive model using extreme gradient boosting. The model achieved a 74% accuracy
rate in predicting inter-member satisfaction rankings. Through graphical explainability, the model
underscored incompatibilities among members, notably those with differing levels of extraversion.
Based on our findings, we introduce a team formation algorithm using Simulated Annealing (SA)
built upon the insights derived from our predictive model and additional heuristics.
1. Introduction
Citation: Vasiljević, J.; Lavbič, D. A
Data-Driven Approach to Team Team formation (TF) is critical in many domains, including business, sports, and
Formation in Software Engineering academia [1,2]. In Software Engineering (SE), however, TF assumes a unique and pivotal
Based on Personality Traits. Electronics role. SE, an engineering discipline encompassing all aspects of software production [3],
2024, 13, 178. https://doi.org/ is deeply rooted in collaboration. The complexity of software projects often necessitates
10.3390/electronics13010178 a team of engineers with diverse specializations, making these teams’ efficiency, commu-
nication, and synergy crucial for project success. This study aims to bridge the gap in TF
Academic Editors: Mohamed Wiem
Mkaouer and Chunping Li
by integrating psychological metrics with machine learning techniques to optimize team
composition in SE.
Received: 30 November 2023 Historically, TF has relied heavily on empirical metrics, but recent trends have shifted to-
Revised: 22 December 2023 wards incorporating psychological aspects to enhance team dynamics and performance [4].
Accepted: 28 December 2023 The dynamics of how team members interact and engage with each other are crucial in SE.
Published: 30 December 2023
Research has consistently shown a strong correlation between positive team dynamics and
the success of high-performing teams [5–8]. However, integrating psychological traits into
automated TF systems, especially in SE, requires further exploration.
Copyright: © 2023 by the authors.
This gap is particularly evident when considering the subtler yet influential fac-
Licensee MDPI, Basel, Switzerland. tor of team members’ psychological and personality traits. These traits can profoundly
This article is an open access article impact how individuals approach problems, interact with colleagues, and respond to
distributed under the terms and stress or success. There has been a growing interest in using personality frameworks,
conditions of the Creative Commons such as the Big Five Personality Traits, to build technically proficient and psychologically
Attribution (CC BY) license (https:// compatible teams.
creativecommons.org/licenses/by/ The following section reviews the existing literature to identify gaps and limitations
4.0/). and inform our proposed solution. Subsequent sections outline our methodologies, describe
the data collection process, detail the models used, and provide an initial overview of the
proposed algorithm. This is followed by a presentation of our results, encompassing an
overview of the collected data, the trained model, and the newly developed TF algorithm.
The paper concludes with a discussion of our findings, summarizing our contributions,
acknowledging limitations, and suggesting guidelines for future research.
2. Related Work
The interplay between individual personality traits and team performance, particularly
in collaborative and skill-intensive environments, has been widely studied. A hierarchical
model developed by [9] categorizes key personality facets vital for team performance,
offering a detailed understanding of how broad personality traits correlate with specific
team requirements. This model utilizes specific facets of higher-level dimensions, such
as adjustment, flexibility, and dependability, to predict team adaptability, interpersonal
cohesion, and decision making.
A study by [10] investigated the impact of personality traits on the performance
of product design teams comprising undergraduate engineering students. The findings
indicated a significant positive correlation among conscientiousness, openness, and team
performance. In the study, groups of three students were tasked with building a bridge
using limited resources, emphasizing the profound influence of personality traits over
other factors, including cognitive ability and demographic diversity.
Research by [11] delved into the predictive capacity of conscientiousness facets within
engineering student project teams over a 6.5-month-long task. The primary conclusion was
that conscientiousness effectively forecasted team performance. However, it was noted that
other traits, such as agreeableness, extraversion, and neuroticism, did not have a significant
predictive impact on team performance.
In the realm of SE, ref. [12] assessed the impact of personality traits on SE team effec-
tiveness using the Myers–Briggs Type Indicator (MBTI). They highlighted the significant
role of personality clashes in software project failures and pointed out a gender-based
variance in MBTI traits among programmers. Suggestions for optimal trait balances for
male and female team members were presented.
The research study [13] undertook a systematic literature review from 1970 to 2010
centring on individual personalities in SE. The review offers an extensive overview of the
field’s current understanding, notably highlighting the diversity of results.
The study [14] provided direct evidence of the impact of specific personality traits
in an SE context, emphasizing the significance of extraversion in promoting effective
team dynamics. Additionally, the study revealed that openness to experience positively
correlates with team performance, not team climate. However, the limited sample size of
respondents might limit the generalizability of these findings.
In their research, ref. [15] mapped the job requirements of various SE roles to the Big
Five Personality Traits, suggesting specific personality traits beneficial for different SE roles.
This mapping includes the need for extraversion and agreeableness in system analysts;
openness and conscientiousness in software testers; and a combination of extraversion,
openness, and agreeableness in programmers.
The authors in [16] extended the application of personality traits in team dynamics
to an educational perspective, focusing on team dynamics within the context of student
programming projects. This study addresses the gap between academic projects and real-
world software development, offering insights into managing and facilitating student
projects that closely mimic professional environments.
In reviewing the foundational work of [9], we see a detailed model of personality
facets relevant to team performance. Although theoretically robust, the model does not
offer a direct pathway for practical application in SE team assembly.
Similarly, the study by [12] proposed guidelines for balancing teams based on members’
traits and gender but did not provide a practical framework for applying these findings.
Electronics 2024, 13, 178 3 of 17
3. Proposed Solution
In our work, we address the gap in TF by introducing a data-driven approach tai-
lored for SE. Recognizing the limitations in practical applications highlighted by previous
studies, our approach builds upon their theoretical foundations. We focus on develop-
ing practical solutions that are informed by methodological advancements. Our key
contributions include the following:
• Data collection in a controlled SE environment: Our research involved collecting
data on key aspects, such as personality traits, team performance, and inter-member
satisfaction. The collection was carried out in a controlled SE environment, which
helped ensure that our data were accurate and relevant to typical SE settings.
• Predictive model for inter-member satisfaction: We developed a predictive model
with our collected data. This model is aimed at understanding and forecasting inter-
member satisfaction in SE teams. It considers various factors, such as personal-
ity traits and work contributions, providing valuable insights into team dynamics
and cohesion.
• Team formation algorithm: Our algorithm integrates the predictive model directly,
utilizing its forecasts and various heuristics to aid in forming effective SE teams.
Designed with practical implementation in mind, it provides a systematic method for
real-world team assembly. The algorithm functions by accepting participants with
associated personality traits as input and outputting strategically formed teams.
To conduct our research, we relied on established methods. Figure 1 visually summa-
rizes our approach, illustrating these methods’ integration in addition to data collection
and a high-level overview of the proposed TF algorithm. Section 4.1 further describes the
process of data acquisition. We used the Five-Factor Model (FFM) (see details in Section 4.2)
to assess personality traits, the Extreme Gradient Boosting algorithm (XGBoost) (see details
in Section 4.3) for developing our predictive model, and Simulated Annealing (SA) (see
details in Section 4.4) for our TF algorithm (see details in Section 4.5). Python was our
primary tool for data analysis.
Electronics 2024, 13, 178 4 of 17
O random split T
N 1
each into 31 teams of
completed 5-6 individuals
T2
Start project
A C stage evaluation
…
157 for a given stage
participants T Team and Teaching assistant
E 31
member self deliverables
FFM psychometric self-
assessment questionnaire
evaluation evaluation
Project stage
evaluation
Data Aquasition
completed
Start Team
Exploratory Data Formation
Analysis
Start data Metropolis
analysis criterion met?
Statistical
tests Random assignment of Simulated annealing
Team Formation
Data Analysis
T1 … T1
Visualizations …
…
…
Classifier
constructed … Tn
… Tn
Training a Team
binary classifier Formation
complete
maximum possible score for that set, max_score( Q). Mathematically, this is represented as
∑q∈Q R(q)× I (q)
S= max_score( Q)
.
Despite some criticism, such as not fully accounting for all variances in human person-
ality [18] or the lack of complete independence among variables [19], the FFM is a reliable
and valid model for measuring personality traits in SE domains [20].
ARG 1 ARG 2
k balanced
O C E A N
PROC 1
k team sizes simulated teams optimized scoring
annealing function between i-th
p1
and j-th team
4 × 10 +
n participants
=n
PROC 2
p2 5 × 12 satisfaction i × 10
maximization
…
ARG 3
j × 12
objective
PROC 3
pn objective function
function maximization high satisfaction
weights
between members
5. Results
5.1. Data Overview
5.1.1. Phase Results
After each project phase, students were evaluated and graded by TAs. The grading
criteria were based on quantifiable aspects, such as the volume of work completed, project
deliverables, and the overall quality of the work. Rather than team grades, individual
grades were assigned on a scale from 0 to 100. These grades were communicated to each
student during a meeting with the TA and their respective team. The distribution of these
grades across different phases is illustrated in the histograms shown in Figure 3.
40 40 40 40
30 30 30 30
Count
Count
Count
Count
20 20 20 20
10 10 10 10
0 0 0 0
0 25 50 75 100 0 25 50 75 100 0 25 50 75 100 0 25 50 75 100
Points Points Points Points
The average grades for the first two phases were relatively consistent, with µ1 = 77.7 for
the first phase and µ2 = 77.8 for the second. However, the third phase proved more
challenging, reflected by a drop in the average grade to µ3 = 73.0. The average grade
increased again in the fourth phase to µ4 = 84.0. This pattern aligns with student feedback,
which indicated that the third and fourth phases were the most difficult. The improved
performance in the fourth phase can be attributed to its focus on project implementation, a
task with which the students were the most familiar.
Table 2. Correlation matrix of FFM dimensions among students. Symbols denote significance levels:
* p < 0.05, ** p < 0.01, *** p < 0.001.
O C E A N
O (openness) -
C (conscientiousness) 0.18 * -
E (extraversion) 0.20 * 0.23 ** -
A (agreeableness) 0.18 * 0.00 −0.01 -
N (neuroticism) −0.07 −0.33 *** −0.17 * −0.08 -
10
11
12
13
14
15 End of phase 1
End of phase 2
End of phase 3
End of phase 4
Group
16
17
18
19
20
21
22
23
24
25
26
27
28
30
31
32
Figure 4. Stripplot of filtered-out commits of various groups. The vertical dashed orange lines
represent the deadlines for individual phases.
Team performance
0.8
Low
High
0.7
Mean group FFM score
0.6
0.5
0.4
0.3
0.2
Openness Conscientiousness Extraversion Agreeableness Neuroticism
FFM dimension
Before assessing the significance of our findings, we checked if the conditions for a
parametric t-test were met. Levene’s test confirmed that the variances between the two
groups were equal. However, the Shapiro–Wilk test revealed that only the extraversion and
conscientiousness dimensions were normally distributed. Considering these points, we
opted for the non-parametric Mann–Whitney U test to examine the mean differences across
the dimensions. The test highlighted significant differences between high-performing
and low-performing teams in the following dimensions: conscientiousness (p = 0.005),
extraversion (p = 0.0001), and neuroticism (p = 0.004).
90 90 90 90
Average mark
Average mark
Average mark
Average mark
80 80 80 80
70 70 70 70
60 60 60 60
50 50 50 50
r = 0.03 r = 0.39 r = −0.09 r = 0.32
40 40 40 40
0 1 2 0 1 2 0 1 2 0 1 2
Satisfaction Satisfaction Satisfaction Satisfaction
Figure 6. Regression plot of team satisfaction versus team performance for each project phase.
While R1 correlates with performance, its suitability for our TF method is questionable
due to potential data skewness, defined as
3
n xi − x̄
Skewness (g1) =
(n − 1)(n − 2) ∑ s
(2)
Our dataset’s skewness was −1.22, indicating strong negative skewness. The Intraclass
Correlation Coefficient (ICC) [27] further revealed inconsistencies in the data, possibly due
to biased evaluations by students concerned about peer grades.
Consequently, we use the ranked work contribution score (R2 ) for further analysis.
ICC(3,k) tests on R2 showed adequate consistency, with 61 out of 108 groups (56.48%)
achieving an ICC over 0.5, and 72 groups (66.67%), one exceeding 0.4. These thresholds
were based on varied interpretations of the ICC value [28,29]. The chosen threshold is
deemed good by both standards. This indicates a general agreement on the top contributors,
though not unanimous.
A hypothesis for subsequent investigation is that a significant portion of these rankings
could be explained by tangible metrics, like commit counts and lines of code, with the
remaining variance potentially being linked to distinct personality traits within the teams.
Work Contribution
To effectively measure individual contributions during different project phases, we
introduce a metric denoted as work_ration , with n representing the project phase. This
metric is derived from Git data, following a specific preprocessing approach.
The calculation of work_ration involves several steps:
1. The number of lines each member adds (m) is normalized by taking the square root.
This adjustment favours smaller, frequent commits instead of larger, sporadic ones.
Lines that have been removed are not considered, as they do not reliably indicate
work contribution.
2. The normalized line additions for each member (m) are summed up.
3. The total normalized line additions for the team (t) are calculated.
1
4. The expected work ratio (r) per student is determined as r = len(team) . For example, in
a five-member team, the expected ratio is 0.2.
5. The individual contribution proportion (p) is computed as p = mt .
p
6. Finally, the work ratio for phase n is calculated as work_ration = r .
The work_ration metric provides insights into a student’s relative contribution:
• A value above 1 indicates contributions exceeding expectations.
• A value below 1 suggests contributions falling short of expectations.
• A value around 1 implies contributions meeting expectations.
The relationship between work contribution and ranking is examined using regression
plots (Figure 7). It is important to interpret these findings correctly:
• A lower work contribution suggests that the student contributed more than expected.
• A higher work contribution indicates that the student contributed less than expected.
Electronics 2024, 13, 178 11 of 17
6 6 6 6
r = −0.44 r = −0.39 r = −0.38 r = −0.60
5 5 5 5
Work Contribution
Work Contribution
Work Contribution
Work Contribution
4 4 4 4
3 3 3 3
2 2 2 2
1 1 1 1
0 0 0 0
0 1 2 3 4 0 1 2 3 4 0 1 2 3 4 0 1 2 3 4
work ratio work ratio work ratio work ratio
Figure 7. Regression plots of member work contribution ranking versus calculated member work
(work_ratio) for each project phase.
The correlation coefficients for each project phase are as follows: r1 = −0.44,
r2 = −0.39, r3 = −0.38, and r4 = −0.60. These values indicate a moderate negative
correlation between work contribution and ranking. However, the correlation is not strong
enough to fully explain the variance in rankings. The following section will explain the
remaining variance using gradient boosting.
High
Amount of work T
Conscientiousness T
Extraversion T
Openness T
Feature value
Neuroticism T
Agreeableness T
Openness R
Conscientiousness R
Agreeableness R
Sum of 2 other features
Low
−4 −2 0 2 4
SHAP value (impact on model output)
Figure 8. SHAP beeswarm plot of the XGBoost model displaying feature importance for Rater (R)
and Target (T) variables.
Figure 9. Waterfall plots of SHAP values for the XGBoost model, illustrating the influence of
individual features on the prediction. (a) Represents a positive classification. (b) Represents a
negative classification.
Extraversion R and Extraversion T
0.75
1.0
SHAP interaction value for
0.70
0.5 0.65
Extraversion T
0.60
0.0 0.55
0.50
−0.5
0.45
−1.0 0.40
0.35
0.2 0.4 0.6 0.8
Extraversion R
Figure 10. SHAP dependence plot of Raters’ (R) and Targets’ (T) extraversion. The dashed lines
represent a linear fit applied to the most prominent data points from opposite extremes.
• Ensure teams exhibit high levels of conscientiousness (Qc ) and extraversion (Qe ).
• Aim for low levels of neuroticism (Qn ) within teams.
• Maintain minimal standard deviation across teams for each dimension: Qc , Qe ,
and Qn .
• Incorporate at least one student with high conscientiousness (studentmax (c) ) in each
team, as suggested by [24]. This approach triggers a beneficial “conscientiousness
shock,” improving team dynamics.
The Simulated Annealing algorithm was implemented with the following parameters:
1. Initial solution (Sinit ): Teams are formed randomly considering team size distributions,
with the constraint that each team includes a student with high conscientiousness.
This is achieved by selecting the top N students, where N equals the number of teams.
These students are given locked status to prevent team swaps.
2. Generation of new solution (S′ ): A candidate solution is formed by exchanging two
students between different teams while ensuring that students with locked status
remain fixed.
3. Objective function (E(S)): The function E(S) = winter × QTinter + wc × Qc + we × Qe +
wn × NQn integrates various dimensions, such as interpersonal satisfaction (QTinter ),
conscientiousness (Qc ), extraversion (Qe ), and neuroticism (NQn ). Weights winter , wc ,
we , and wn allow for the adjustment of each dimension’s influence.
Inter-member satisfaction (QTinter ): For a team Tk , the average satisfaction (µ Tk ) is
calculated using the binary classifier’s predictions (p( f i,j )) for each student pair.
The global average and standard deviation across all teams are computed to derive
QEinter = µ − σ.
Scores for Q and NQ: For each dimension, the team average (µ Tk ) and global averages
(µ and σ) are computed. The resulting scores are defined as Q = µ − σ for Qc and Qe ,
and NQ = (1 − µ) − σ for Qn .
4. Stopping condition: The algorithm stops after Istop = 710N − 1740 iterations, where
N is the number of teams. This formula is based on a linear regression model, with a
minimum threshold of 6000 iterations for N < 10.
We evaluated the proposed algorithm by comparing it with other methodologies and
dimensions, as detailed below:
1. Shuffled: Teams were formed randomly from a predefined set of students.
2. Course: This method represents the teams in their original formation during the
course. Although this data point might appear similar to the Shuffled method, it was
included separately, as the students’ choices could have influenced team compositions.
3. Sorted by conscientiousness: Students were ranked according to their conscientious-
ness scores and then assigned to teams in a round-robin fashion. This method was
employed to compare the SA algorithm’s performance with a basic heuristic.
Figure 11 illustrates the operation of the SA algorithm and its comparison with the
aforementioned methodologies. The first scenario (left) displays the application of the
algorithms on the original dataset, whereas the second scenario (right) applies them to
a synthetic dataset. This synthetic dataset was generated by calculating the mean and
standard deviation for each dimension of the original dataset and then creating a new
dataset with an equivalent number of students and teams. Random values from a normal
distribution, based on the mean and standard deviation of the original data, were used
to populate this new dataset. This approach aimed to test the algorithm’s robustness and
confirm that the peculiarities of the original dataset did not bias the results. In the SA
algorithm, the weights applied were wc = 2, we = 1, wc = 1, and winter = 0.75. These were
chosen to underscore the importance of the conscientiousness dimension while ensuring
balanced representation across other dimensions.
Electronics 2024, 13, 178 14 of 17
QC QC
QE QE
Dimension
Dimension
NQ N NQ N
Qintra Qintra
0.0 0.2 0.4 0.6 0.8 1.0 0.0 0.2 0.4 0.6 0.8 1.0
Objective value Objective value
Figure 11. Distribution of scores before and after SA algorithm using 10,000 iterations, with weights
of the objective function wc = 2, we = 1, wc = 1, and winter = 0.75
In the analysis of the original data, the Sorted by conscientiousness method yielded the
lowest score, with E(s) = 1.66, followed by the Shuffled and Course methods, each scoring
E(s) = 1.70. In contrast, the SA algorithm consistently outperformed these approaches,
achieving an average score of E(s) = 2.14. Notably, the conscientiousness dimension,
evaluated separately, matched the performance of the third method, with E(s)c = 0.67.
This indicates that the SA algorithm effectively sorts students based on conscientiousness
and maintains a balanced distribution across the other dimensions.
To contextualize the computational efficiency of our algorithm, one can consider
the total number of unique team configurations that would be examined in an exhaus-
tive combinatorial search. Utilizing the formula (number of students
number of teams ), an exhaustive evalu-
ation for selecting just one team of 5 from a pool of 150 participants would necessitate
investigating 591,600,030 distinct combinations. In stark contrast, the SA algorithm sub-
stantially mitigates this computational demand. Specifically, it required searching only
710 × 30 − 1740 = 19,560 configurations to achieve the results presented in Figure 11.
6. Discussion
Our research focused on the Five-Factor Model of personality traits, hypothesizing
that these data could positively affect team formation. Numerous studies support this
idea [4,9–12,14,15], showing the impact of personality traits on team dynamics and perfor-
mance. Our findings indicate that traits like conscientiousness and extraversion tend to
improve team performance, while neuroticism tends to hinder it. Different studies high-
light various traits. For example, [14] also found that extraversion and openness positively
impact team performance but other traits had a lesser effect.
In contrast, ref. [11] pointed out the importance of conscientiousness but found other
traits less significant. These differences might be due to the different study environments,
participant groups, and research methods. As [12] notes, gender is important in team
dynamics. However, many studies, ours included, mostly have participants of one gender,
which limits the ability to consider gender differences fully.
Our study distinguished itself from others through our approach to data utilization.
Beyond collecting psychometric data, we integrated information from Git repositories and
team satisfaction surveys over an extended period. This integration provided a comprehen-
sive view of team dynamics and performance. Utilizing this dataset, we developed a model
capable of predicting team satisfaction with accuracy of 74% and precision of 69%. Despite
the challenges posed by the limited size of our dataset and the complexities inherent in
modelling human interactions, these performance metrics are noteworthy.
There were confounding factors not fully accounted for in our study, such as the
technical skills of the students and their pre-existing relationships. Given that TF was
largely random, it is plausible that some students had prior connections through earlier
Electronics 2024, 13, 178 15 of 17
courses or projects, potentially influencing team dynamics. Although our model has some
limitations, its practical application demonstrated potential for success. When applied
repeatedly within the TF algorithm, the predictive model effectively contributed to the
formation of better-performing teams.
7. Conclusions
This study explored the impact of the Five-Factor Model of personality traits on team
dynamics, analysing data from 157 third-year undergraduates formed into 31 teams. The
results showed that teams typically perform better with more extroverted and conscientious
but less neurotic members. Using an XGBoost model, we successfully predicted team satis-
faction with 74% accuracy and 69% precision. We also introduced an innovative method
for automatically forming teams based on the FFM. By applying a Simulated Annealing
technique, we developed an efficient algorithm that effectively groups participants accord-
ing to specific criteria. This method ensures a well-balanced distribution of personality
traits among teams and enhances overall member satisfaction.
Our study’s approach involved collecting a diverse range of variables about the
participants, but the limitations primarily stem from the scope of the data collected. To
enhance the model’s precision, a broader dataset is essential. A single course was used,
which limited the generalization of our findings. Expanding the range of variables to
include factors like technical skills and gathering data from various courses with a more
diverse participant group could mitigate gender biases and other disparities. Additionally,
the study did not sufficiently focus on the impact of participants’ technical abilities and the
influence of varied roles within SE projects on team dynamics, since the project required all
students to perform similar tasks.
Our recommendation for future research is to broaden the scope of the study by
including a more comprehensive range of courses and a larger, more diverse group of
participants. Additionally, integrating data on technical proficiencies and specific roles
within teams could offer deeper insights into the nuances of team dynamics.
Author Contributions: Conceptualization, D.L. and J.V.; methodology, D.L.; software, J.V.; validation,
J.V. and D.L.; formal analysis, J.V. and D.L.; investigation, J.V. and D.L.; data curation, J.V. and
D.L.; writing—original draft preparation, J.V. and D.L.; writing—review and editing, D.L. and J.V.;
visualization, J.V.; supervision, D.L.; project administration, D.L. All authors have read and agreed to
the published version of the manuscript.
Funding: This research received no external funding.
Informed Consent Statement: Informed consent was obtained from all subjects involved in the study.
Data Availability Statement: The data that support the findings of this study are not publicly
available due to privacy restrictions. The participants in this research self-reported their answers
regarding personality traits, and ensuring the confidentiality and privacy of their responses is of
utmost importance. To protect the identity and sensitive information of the participants, we are
unable to share the raw data.
Acknowledgments: Sincere gratitude goes to everyone who has contributed to completing this
research, including the participating students and especially Marko Poženel and Aljaž Zrnec, for
their dedicated assistance and support during the course Software Engineering.
Conflicts of Interest: The authors declare no conflicts of interest.
Abbreviations
The following abbreviations are used in this manuscript:
TF Team formation
SE Software Engineering
MBTI Myers–Briggs Type Indicator
FFM Five-Factor Model
Electronics 2024, 13, 178 16 of 17
O Openness
C Conscientiousness
E Extraversion
A Agreeableness
N Neuroticism
ICC Intraclass Correlation Coefficient
References
1. Zainal, D.; Razali, R.; Mansor, Z. Team Formation for Agile Software Development: A Review. Int. J. Adv. Sci. Eng. Inf. Technol.
2020, 10, 555. [CrossRef]
2. Budak, G.; Kara, I.; Ic, Y.; Kasimbeyli, R. New mathematical models for team formation of sports clubs before the match. Cent.
Eur. J. Oper. Res. 2019, 27, 93–109. [CrossRef]
3. Sommerville, I. Software Engineering, 9th ed.; Addison-Wesley: Bruke, WA, USA, 2010.
4. Costa, A.; Ramos, F.; Perkusich, M.; Dantas, E.; Dilorenzo, E.; Chagas, F.; Meireles, A.; Albuquerque, D.; Silva, L.; Almeida, H.;
et al. Team Formation in Software Engineering: A Systematic Mapping Study. IEEE Access 2020, 8, 145687–145712. [CrossRef]
5. Fiore, S.; Salas, E.; Cuevas, H.; Bowers, C. Distributed coordination space: Toward a theory of distributed team process and
performance. Theor. Issues Ergon. Sci. 2003, 4, 340–364. [CrossRef]
6. Mathieu, J.E.; Rapp, T.L. Laying the foundation for successful team performance trajectories: The roles of team charters and
performance strategies. J. Appl. Psychol. 2009, 94, 90–103. [CrossRef] [PubMed]
7. Burke, S. Is there a “Big Five” in Teamwork? Small Group Res. 2005, 36, 555–599.
8. Fiore, S.M.; Schooler, J.W. Process mapping and shared cognition: Teamwork and the development of shared problem models. In
Team Cognition: Understanding the Factors that Drive Process and Performance; American Psychological Association: Washington, DC,
USA, 2004.
9. Driskell, J.E.; Goodwin, G.F.; Salas, E.; O’Shea, P.G. What makes a good team player? Personality and team effectiveness. Group
Dyn. Theory Res. Pract. 2006, 10, 249–271. [CrossRef]
10. Kichuk, S.L.; Wiesner, W.H. The big five personality factors and team performance: Implications for selecting successful product
design teams. J. Eng. Technol. Manag. 1997, 14, 195–221. [CrossRef]
11. O’Neill, T.A.; Allen, N.J. Personality and the Prediction of Team Performance. Eur. J. Personal. 2011, 25, 31–42. [CrossRef]
12. Gilal, A.; Jaafar, J.; Omar, M.; Basri, S.; Izzatdin, A. Balancing the Personality of Programmer: Software Development Team
Composition. Malays. J. Comput. Sci. 2016, 29, 145–155. [CrossRef]
13. Cruz, S.; Da Silva, F.; Monteiro, C.; Santos, C.; Dos Santos, M. Personality in software engineering: Preliminary findings from a
systematic literature review. In Proceedings of the 15th Annual Conference on Evaluation & Assessment in Software Engineering
(EASE 2011), Durham, UK, 11–12 April 2011; IET: Stevenage, UK, 2011; pp. 1–10. [CrossRef]
14. Soomro, A.B.; Salleh, N.; Nordin, A. How personality traits are interrelated with team climate and team performance in software
engineering? A preliminary study. In Proceedings of the 2015 9th Malaysian Software Engineering Conference (MySEC),
Kuala Lumpur, Malaysia, 16–17 December 2015; pp. 259–265. [CrossRef]
15. Rehman, M.; Mahmood, A.K.; Salleh, R.; Amin, A. Mapping job requirements of software engineers to Big Five Personality Traits.
In Proceedings of the 2012 International Conference on Computer & Information Science (ICCIS), Kuala Lumpur, Malaysia, 12–14
August 2012; pp. 1115–1122. [CrossRef]
16. Scott, T.J.; Tichenor, L.H.; Bisland, R.B.; Cross, J.H. Team dynamics in student programming projects. ACM Sigcse Bull. 1994,
26, 111–115. [CrossRef]
17. Costa, P.; McCrae, R. The Five-Factor Model, Five-Factor Theory, and Interpersonal Psychology. In Handbook of Interpersonal
Psychology: Theory, Research, Assessment, and Therapeutic Interventions; John Wiley & Sons, Inc.: Hoboken, NJ, USA, 2012; pp. 91–104.
ISBN 9780470471609. [CrossRef]
18. John, O.P.; Srivastava, S. The Big Five Trait taxonomy: History, measurement, and theoretical perspectives. In Handbook of
Personality: Theory and Research; Guilford Press: New York, NY, USA, 1999.
19. Ashton, M.; Lee, K.; Goldberg, L.; de Vries, R. Higher Order Factors of Personality: Do They Exist? Personal. Soc. Psychol. Rev.
2009, 13, 79–91. [CrossRef] [PubMed]
20. Jia, J.; Zhang, P.; Zhang, R. A comparative study of three personality assessment models in software engineering field. In
Proceedings of the 2015 6th IEEE International Conference on Software Engineering and Service Science (ICSESS), Beijing, China,
23–25 September 2015; pp. 7–10. [CrossRef]
21. Chen, T.; Guestrin, C. XGBoost: A Scalable Tree Boosting System. In Proceedings of the 22nd ACM SIGKDD International
Conference on Knowledge Discovery and Data Mining, San Francisco, CA, USA, 13–17 August 2016; ACM: New York, NY, USA,
2016; pp. 785–794. [CrossRef]
22. Lundberg, S.M.; Lee, S.I. A Unified Approach to Interpreting Model Predictions. In Proceedings of the Advances in Neural Information
Processing Systems; Guyon, I., Luxburg, U.V., Bengio, S., Wallach, H., Fergus, R., Vishwanathan, S., Garnett, R., Eds.; Curran
Associates, Inc.: New York, NY, USA, 2017; Volume 30.
23. Luke, S. Essentials of Metaheuristics, 2nd ed.; Lulu: Morrisville, NC, USA, 2013.
24. Hancock, S.A.; Hill, A.J. The effect of teammate personality on team production. Labour Econ. 2022, 78, 102248. [CrossRef]
Electronics 2024, 13, 178 17 of 17
25. Linden, D.v.d.; Nijenhuis, J.t.; Bakker, A.B. The General Factor of Personality: A meta-analysis of Big Five intercorrelations and a
criterion-related validity study. J. Res. Personal. 2010, 44, 315–327. [CrossRef]
26. ICERI 2015: 8th International Conference of Education Research and Innovation, Seville (Spain), 16–18 November 2015: Proceedings; Iated
Academy: Casablanca, Morocco, 2015.
27. Liljequist, D.; Elfving, B.; Roaldsen, K.S. Intraclass correlation: A discussion and demonstration of basic features. PLoS ONE 2019,
14, e0219854. [CrossRef] [PubMed]
28. Cicchetti, D. Guidelines, Criteria, and Rules of Thumb for Evaluating Normed and Standardized Assessment Instrument in
Psychology. Psychol. Assess. 1994, 6, 284–290. [CrossRef]
29. Koo, T.K.; Li, M.Y. A Guideline of Selecting and Reporting Intraclass Correlation Coefficients for Reliability Research. J. Chiropr.
Med. 2016, 15, 155–163. [CrossRef] [PubMed]
Disclaimer/Publisher’s Note: The statements, opinions and data contained in all publications are solely those of the individual
author(s) and contributor(s) and not of MDPI and/or the editor(s). MDPI and/or the editor(s) disclaim responsibility for any injury to
people or property resulting from any ideas, methods, instructions or products referred to in the content.