Using Big Data and Machine Learning in Personality Measurement

USo de big data y machne learning en la medición de la personalidad

Uploaded by

jozeluiz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

29 views

Using Big Data and Machine Learning in Personality Measurement

USo de big data y machne learning en la medición de la personalidad

Uploaded by

jozeluiz

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 17

European Journal of Personality, Eur. J. Pers.

(2020)
Published online in Wiley Online Library (wileyonlinelibrary.com) DOI: 10.1002/per.2305

Using Big Data and Machine Learning in Personality Measurement:

Opportunities and Challenges

LEO ALEXANDER III*, EVAN MULFINGER and FREDERICK L. OSWALD

Department of Psychological Sciences, Rice University, Houston, TX USA
[email protected]

Abstract: This conceptual paper examines the promises and critical challenges posed by contemporary personality
measurement using big data. More specifically, the paper provides (i) an introduction to the type of technologies that
give rise to big data, (ii) an overview of how big data is used in personality research and how it might be used in the
future, (iii) a framework for approaching big data in personality science, (iv) an exploration of ideas that connect
psychometric reliability and validity, as well as principles of fairness and privacy, to measures of personality that
use big data, (v) a discussion emphasizing the importance of collaboration with other disciplines for personality psy-
chologists seeking to adopt big data methods, and finally, (vi) a list of practical considerations for researchers seeking
to move forward with big data personality measurement and research. It is expected that this paper will provide in-
sights, guidance, and inspiration that helps personality researchers navigate the challenges and opportunities posed
by using big data methods in personality measurement. © 2020 European Association of Personality Psychology

Key words: big data; machine learning; personality measurement

Today’s technologies, and the big data that they give rise to, data to support and improve future research and application
have dramatically infused our everyday lives and continue to efforts.
transform society in many domains, such as through social All these potential applications of big data for measuring
media, marketing, online education, and voting, and through and analysing personality and its outcomes pose new and
continued developments in areas such as autonomous interrelated technological, legal, and ethical questions and
vehicles, personalized medicine, and automation in the work- challenges for personality researchers and practitioners.
force. This ongoing transformation not only reflects the inex- Fortunately, the science of personality is ready to take on
orable impact of technology on society but also promises to this future, given its long and rich history of developing,
benefit science by revealing unique undiscovered aspects of implementing, and evaluating measures of personality. In this
individuals’ lives and personality. As researchers continue context, the current paper explores the research opportunities
to investigate personality within the arena of big data and ar- and challenges associated with big data personality measure-
tificial intelligence (AI), knowledge of an interrelated and ment, connecting future promise to acquired research wisdom
evolving system of benefits and challenges along ethical, le- by building on traditional psychometric concepts of reliability
gal, and scientific fronts is beginning to accrue. Furthermore, and validity, as well as principles of fairness and privacy.
as we continue to think about and investigate personality in Out of necessity, our overview will be general, given that
the domains of big data and machine learning, many ways personality research is just beginning to employ and investi-
of viewing big data personality measurement are likely to gate questions involving big data, AI, and machine learning,
be useful, for example, (i) as a cultural factor that allows a fast‐moving arena that opens many exciting doors for
socio‐technological contexts, relationships, and communica- personality research. First, we will introduce technologies
tions to develop in ways not possible just decades ago, creat- that have allowed for the collection of big data relevant to
ing new social behaviours and norms relevant to personality; personality. Second, we will provide a brief overview of
(ii) as an advisor and trainer that reciprocates between the small‐but‐growing body of existing big data personality
behaviour and adaptive self‐management interventions in a research. Third, we will offer an organizing framework for
real‐time system reflective of personality; (iii) as an analyser thinking about big data in personality science. Fourth, we
and informant that gathers, analyses, summarizes, and will re‐examine traditional psychometric evaluations of reli-
reports personality‐relevant information to decision makers ability and validity, as well as fairness and privacy, within
(e.g. teachers, supervisors, groups, and teams); and (iv) as a the modern context of big data algorithms as applied to
scientific resource that ideally provides personality‐relevant personality‐relevant big data. Fifth, we reflect on the critical
importance of multidisciplinary collaboration for personality
psychologists seeking to adopt big data methods. Finally, we
*Correspondence to: Leo Alexander III, Department of Psychological
Sciences, Rice University, 6100 Main St. MS25, Houston, TX 77005, USA.
will list key considerations that can inform future big data
E‐mail: [email protected] personality measurement and research.

Handling editor: John Rauthmann

Received 11 November 2019
© 2020 European Association of Personality Psychology Revised 25 August 2020, Accepted 25 August 2020
L. Alexander III et al.

DIGITAL TRACES OF BEHAVIOUR Wearable devices are particularly relevant to within‐

person variation in behaviour, as they are not only helpful
Modern digital technologies have given rise to many to a user’s understanding and self‐management, but they also
potential sources of personality‐relevant big data (see Woo, open up great possibilities for intervening in a more timely
Tay, Jebb, Ford, & Kern, 2020). Smartphones and other and tailored manner upon people at work, at school, or
personal electronic devices contain a variety of sensors in daily life (Ihsan & Furnham, 2018; Mardonova &
(e.g. microphones, cameras, light sensors, accelerometers, Choi, 2018). Personality may influence the type of interven-
and proximity sensors) and data logs (e.g. call logs, text mes- tion selected as well as how the intervention is responded to;
saging logs, web browser logs, and application use logs), and the interventions in turn may induce new habits, if not
both of which provide a rich source of longitudinal longer lasting personality change. For example, in the sports
behavioural data (Harari et al., 2016). Bluetooth wireless and medical domain, digital activity trackers (e.g. Fitbit and
data and GPS navigation features on cell phones can track Apple Watch) are being marketed to fitness enthusiasts as
where you are, when, and with whom—all of which may well as anyone seeking to improve their health (Aroganam,
be revealing of personality (Mønsted, Mollgaard, & Manivannan, & Harrison, 2019). The resultant fitness data
Mathiesen, 2018). Access to buildings, vehicles, and other may shed light, in aggregate groups and for individuals, on
secure locations increasingly rely on technologies that whether personality traits predict health decisions, processes,
record one’s specific whereabouts throughout the day (e.g. and outcomes such as (i) the choice to use these devices; (ii)
geographic location, building location via electronic access the nature and difficulty of self‐set health goals and; (iii) the
cards/codes, or video surveillance), which along with the as- process of how and when people revise, maintain, and attain
sociated time, and perhaps the people with which one was as- those goals effectively.
sociated (and their data), might yield personality‐relevant Among the many advances in digital technologies, social
information. Online downloads and cloud‐streaming services media platforms stand out as particularly rich sources of big
store data on whether you are listening to Mozart or data for personality researchers. Given the large number of
Metallica, which can be translated into musical preferences people who now regularly use social media networks
that, in turn, inform personality (Rentfrow & Gosling, 2003). (Lenhart et al., 2015; Perrin & Anderson, 2019), they often
Digital financial transactions such as online purchases, debit reflect not only extremely large samples and wide‐ranging
card, credit card, and payment app transactions continue to demographics (race/ethnicity, gender, age, geography, and
increase in popularity (Foster, Schuh, & Zhang, 2013) and culture) but also intensive longitudinal sampling, with digital
can be revealing of personality, as was demonstrated by trace data often accumulating multiple times per day. Perhaps
Gladstone, Matz, and Lemaire (2019), who used machine it is no wonder, then, as we will discuss later, much of the
learning techniques to predict personality from individuals’ existing big data personality research has relied upon data
bank records. Although the accuracy of their predictions supplied by social media platforms.
were small for the broad Big Five personality traits
(rs ¼ .12 to .19), they found larger correlations between their
model’s predictions and scores on survey‐based measures of THE CURRENT STATE OF PERSONALITY
the more narrow and substantively relevant traits of material- MEASUREMENT USING BIG DATA
ism and self‐control (rs ¼ .33 and .26, respectively).
Technological advances not only offer new sources of Initial research efforts using trace data for personality mea-
personality‐relevant data but also allow us to collect those surement were conducted primarily in the field of computer
data in new and more intensive ways. The trail for science, where predictive models were applied to big data
considering these data has been blazed by personality and to predict scores on measures of the Five Factor model of
methodological researchers who have focused strongly on personality (McCrae & John, 1992). This burgeoning body
investigating within‐person variability that is configural of research has identified many big data predictors of person-
(e.g. Molenaar & Campbell, 2009), context‐driven (e.g. ality as extracted from a variety of digital sources, such as in-
Fleeson, 2004), and time‐driven (Hamaker, Nesselroade, & formation from Facebook (Bachrach, Kosinski, Graepel,
Molenaar, 2007). We are thus witnessing a dramatic resur- Kohli, & Stillwell, 2012; Golbeck, Robles, & Turner, 2011;
gence of personality research focused on empirical tests of Schwartz et al., 2013; Sumner, Byers, & Shearing, 2011;
reliable patterns of within‐person variation that is either Wald, Khoshgoftaar, & Sumner, 2012; Youyou, Kosinski,
related to or directly reflective of different personality traits & Stillwell, 2015; Youyou, Stillwell, Schwartz, &
(see the review by Beck & Jackson, 2019a), for example, Kosinski, 2017), Twitter (e.g. Golbeck, Robles, Edmondson,
examining relatively fast change, using daily diaries in & Turner, 2011; Quercia, Kosinski, Stillwell, &
clinical assessment (Zimmermann et al., 2019); momentary Crowcroft, 2011; Sumner, Byers, Boochever, & Park, 2012),
assessments of personality‐performance relationships at Flickr (e.g. Yan et al., 2015), smartphone data (e.g.
work (Ilies & Judge, 2002; Judge, Simon, Hurst, & Chittaranjan, Blom, & Gatica‐Perez, 2013), and personal
Kelley, 2014; Minbashian, Wood, & Beckmann, 2010); lon- weblogs (e.g. Iacobelli, Gill, Nowson, & Oberlander, 2011).
ger term life‐event‐driven changes in personality (Bleidorn, Researchers continue to explore a variety of trace data from
Hopwood, & Lucas, 2018); and developmental personality these and many other sources, including social media profile
changes in making school‐to‐work transitions (Lüdtke, information, images, various user logs, messages, posts, and
Roberts, Trautwein, & Nagy, 2011). Facebook and Twitter ‘likes’.