0% found this document useful (0 votes)

51 views5 pages

Caliskan Et Al. - 2017 - Semantics Derived Automatically From Language Corp

This document discusses how machine learning models can acquire human-like biases and stereotypes from the text data they are trained on. The researchers were able to replicate known implicit biases related to things like race, gender, and stereotypes about insects and flowers using word embedding models trained on web text. They showed that standard machine learning absorbs cultural stereotypes as easily as any other information from textual data, reflecting the biases and prejudices that exist in everyday human culture and language use.

Uploaded by

Augusta Crow

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

0% found this document useful (0 votes)

51 views5 pages

Caliskan Et Al. - 2017 - Semantics Derived Automatically From Language Corp

Uploaded by

Augusta Crow

We take content rights seriously. If you suspect this is your content, claim it here.

Available Formats

Download as PDF, TXT or read online on Scribd

You are on page 1/ 5

R ES E A RC H

◥ the reaction latencies of four pairings (flowers +

REPO R T pleasant, insects + unpleasant, flowers + unpleasant,
and insects + pleasant). Greenwald et al. measured
effect size in terms of Cohen’s d, which is the
COGNITIVE SCIENCE difference between two means of log-transformed
latencies in milliseconds, divided by the standard

Semantics derived automatically deviation. Conventional small, medium, and large

values of d are 0.2, 0.5, and 0.8, respectively. With
32 participants, the IAT comparing flowers and
from language corpora contain insects resulted in an effect size of 1.35 (P < 10−8).
Applying our method, we observed the same

human-like biases expected association with an effect size of 1.50

(P < 10−7). Similarly, we replicated Greenwald et al.’s
finding (5) that musical instruments are signifi-
Aylin Caliskan,1* Joanna J. Bryson,1,2* Arvind Narayanan1* cantly more pleasant than weapons (see Table 1).
Notice that the word embeddings “know” these
Machine learning is a means to derive artificial intelligence by discovering patterns in properties of flowers, insects, musical instruments,
existing data. Here, we show that applying machine learning to ordinary human language and weapons with no direct experience of the
results in human-like semantic biases. We replicated a spectrum of known biases, as world and no representation of semantics other
measured by the Implicit Association Test, using a widely used, purely statistical than the implicit metrics of words’ co-occurrence
machine-learning model trained on a standard corpus of text from the World Wide Web. statistics with other nearby words.
Our results indicate that text corpora contain recoverable and accurate imprints of our We then used the same technique to demon-
historic biases, whether morally neutral as toward insects or flowers, problematic as strate that machine learning absorbs stereotyped
toward race or gender, or even simply veridical, reflecting the status quo distribution of biases as easily as any other. Greenwald et al. (5)

Downloaded from https://www.science.org at University of Chicago on September 21, 2021

gender with respect to careers or first names. Our methods hold promise for identifying found extreme effects of race as indicated simply
and addressing sources of bias in culture, including technology. by name. A bundle of names associated with being

W
European American was found to be significantly
e show that standard machine learning response times when subjects are asked to pair more easily associated with pleasant than unpleas-
can acquire stereotyped biases from tex- two concepts they find similar, in contrast to two ant terms, compared with a bundle of African-
tual data that reflect everyday human cul- concepts they find different. We developed our American names.
ture. The general idea that text corpora first method, the Word-Embedding Association In replicating this result, we were forced to
capture semantics, including cultural Test (WEAT), a statistical test analogous to the slightly alter the stimuli because some of the
stereotypes and empirical associations, has long IAT, and applied it to a widely used semantic rep- original African-American names did not occur
been known in corpus linguistics (1, 2), but our resentation of words in AI, termed word embeddings. in the corpus with sufficient frequency to be in-
findings add to this knowledge in three ways. Word embeddings represent each word as a vector cluded. We therefore also deleted the same number
First, we used word embeddings (3), a powerful in a vector space of about 300 dimensions, based of European-American names, chosen at random,
tool to extract associations captured in text cor- on the textual context in which the word is found. to balance the number of elements in the sets of
pora; this method substantially amplifies the sig- We used the distance between a pair of vectors two concepts. Omissions and deletions are indi-
nal found in raw statistics. Second, our replication (more precisely, their cosine similarity score, a cated in our list of keywords (see the supplemen-
of documented human biases may yield tools and measure of correlation) as analogous to reaction tary materials).
insights for studying prejudicial attitudes and time in the IAT. The WEAT compares these vec- In another widely publicized study, Bertrand
behavior in humans. Third, since we performed tors for the same set of words used by the IAT. We and Mullainathan (7) sent nearly 5000 identical
our experiments on off-the-shelf machine learn- describe the WEAT in more detail below. résumés in response to 1300 job advertisements,
ing components [primarily the Global Vectors for Most closely related to this paper is concurrent varying only the names of the candidates. They
Word Representation (GloVe) word embedding], we work by Bolukbasi et al. (6), who propose a meth- found that European-American candidates were
show that cultural stereotypes propagate to artificial od to “debias” word embeddings. Our work is 50% more likely to be offered an opportunity to be
intelligence (AI) technologies in widespread use. complementary, as we focus instead on rigorously interviewed. In follow-up work, they argued that
Before presenting our results, we discuss key demonstrating human-like biases in word embed- implicit biases help account for these effects (8).
terms and describe the tools we use. Terminology dings. Further, our methods do not require an al- We provide additional evidence for this hypo-
varies by discipline; these definitions are intended gebraic formulation of bias, which may not be thesis using word embeddings. We tested the names
for clarity of the present article. In AI and ma- possible for all types of bias. Additionally, we studied in their study for pleasantness associations. As
chine learning, bias refers generally to prior infor- the relationship between stereotyped associations before, we had to delete some low-frequency names.
mation, a necessary prerequisite for intelligent and empirical data concerning contemporary society. We confirmed the association using two different
action (4). Yet bias can be problematic where such Using the measure of semantic association de- sets of “pleasant/unpleasant” stimuli: those from
information is derived from aspects of human scribed above, we have been able to replicate every the original IAT paper and also a shorter, revised
culture known to lead to harmful behavior. Here, stereotype that we tested. We selected IATs that set published later (9).
we will call such biases “stereotyped” and actions studied general societal attitudes, rather than those Turning to gender biases, we replicated a find-
taken on their basis “prejudiced.” of subpopulations, and for which lists of target and ing that female names are more associated with
We used the Implicit Association Test (IAT) as attribute words (rather than images) were avail- family than career words, compared with male
our primary source of documented human biases able. The results are summarized in Table 1. names (9). This IAT was conducted online and
(5). The IAT demonstrates enormous differences in Greenwald et al. introduced and validated the thus has a vastly larger subject pool but far fewer
IAT by studying biases that they consider nearly keywords. We replicated the IAT results even with
universal in humans and about which there is no these reduced keyword sets. We also replicated an
1
Center for Information Technology Policy, Princeton social concern (5). We began by replicating these online IAT finding that female words (e.g., “woman”
University, Princeton, NJ, USA. 2Department of Computer
inoffensive results for the same purposes. Spe- and “girl”) are more associated than male words
Science, University of Bath, Bath BA2 7AY, UK.
*Corresponding author. Email: [email protected] (A.C.); cifically, they demonstrated that flowers are sig- with the arts than with mathematics (9). Finally,
[email protected] (J.J.B.); [email protected] (A.N.) nificantly more pleasant than insects, based on we replicated a laboratory study showing that

Caliskan et al., Science 356, 183–186 (2017) 14 April 2017 1 of 4

R ES E A RC H | R E PO R T

2 2

occupation word vector with female gender

name vector with female gender

Strength of association of

Strength of association of
1 1

0 0

−1 −1

−2 −2
0 20 40 60 80 100 0 20 40 60 80 100
Percentage of workers in occupation who are women Percentage of people with name who are women

Fig. 1. Occupation-gender association. Pearson’s correlation co- Fig. 2. Name-gender association. Pearson’s correlation coefficient
efficient r = 0.90 with P < 10−18. r = 0.84 with P < 10−13.

female words are more associated with the arts sionality reduction to substantially amplify the signal where

Downloaded from https://www.science.org at University of Chicago on September 21, 2021

than with the sciences (10). found in simple co-occurrence probabilities. In pilot
Having established that word embeddings experiments along the lines of those presented here sðw; A; BÞ
→ → → →
contain stereotypes matching those documented (on free associations rather than implicit associ- ¼ meana∈A cosðw ; a Þ − meanb∈B cosðw ; b Þ
with the IAT, we turned to examine how the same ations), raw co-occurrence probabilities were shown
embeddings related to veridical data on gender to lead to much weaker results (14, 15). In other words, s(w,A,B) measures the associ-
distributions. It has been suggested that implicit Rather than train the embedding ourselves, ation of w with the attribute, and s(X,Y,A,B) mea-
gender-occupation biases are linked to gender we used pretrained GloVe embeddings distrib- sures the differential association of the two sets of
gaps in occupational participation; however, the uted by its authors. This ensures impartiality, target words with the attribute.
relationship between these is complex and may be simplifies reproducing our results, and allows us Let {(Xi,Yi)}i denote all the partitions of X∪Y
mutually reinforcing (11). To better understand to replicate the effects that may be found in real into two sets of equal size. The one-sided P value
the relationship, we examined the correlation be- applications of machine learning. We used the of the permutation test is
tween the gender association of occupation words largest of the four corpora provided—the “Com-
and labor-force participation data. The x axis of mon Crawl” corpus obtained from a large-scale Pri ½sðXi ; Yi ; A; BÞ > sðX ; Y ; A; BÞ
Fig. 1 is derived from 2015 data released by the crawl of the Internet, containing 840 billion
U.S. Bureau of Labor Statistics (https://www.bls. tokens (roughly, words). Tokens in this corpus The effect size is
gov/cps/cpsaat11.htm), which provides informa- are case sensitive, resulting in 2.2 million dif-
tion about occupational categories and the per- ferent ones. Each word corresponds to a 300- meanx∈X sðx; A; BÞ − meany∈Y sðy; A; BÞ
centage of women who have certain occupations dimensional vector derived from counts of other std dev w∈X ∪Y sðw; A; BÞ
under these categories. By applying a second method words that co-occur with it in a 10-word window.
that we developed, the Word-Embedding Factual In the supplementary materials, we also pre- This is a normalized measure of how separated
Association Test (WEFAT), we found that GloVe sent substantially similar results using an alter- the two distributions (of associations between the
word embeddings correlate strongly with the per- native corpus and word embedding. target and attribute) are. We reiterate that these
centage of women in 50 occupations in the United The details of the WEAT are as follows. Bor- P values and effect sizes do not have the same
States in 2015. rowing terminology from the IAT literature, con- interpretation as the IAT because the “subjects” in
Similarly, we looked at the veridical association sider two sets of target words (e.g., programmer, our experiments are words, not people.
of gender to androgynous names—that is, names engineer, scientist; and nurse, teacher, librarian) The WEFAT allows us to further examine how
used by either gender. In this case, the most recent and two sets of attribute words (e.g., man, male; word embeddings capture empirical information
information that we were able to find was the and woman, female). The null hypothesis is that about the world embedded in text corpora. Consi-
1990 census name and gender statistics. Perhaps there is no difference between the two sets of target der a set of target concepts, such as occupations,
because of the age of our name data, our cor- words in terms of their relative similarity to the and a real-valued, factual property of the world
relation was weaker than for the 2015 occupation two sets of attribute words. The permutation test associated with each concept, such as the percen-
statistics, but still strikingly significant. In Fig. 2, measures the (un)likelihood of the null hypothesis tage of workers in the occupation who are women.
the x axis is derived from the 1990 U.S. census by computing the probability that a random per- We would like to investigate whether the vectors
data (https://www.census.gov/main/www/cen1990. mutation of the attribute words would produce the corresponding to the concepts embed knowledge
html), and the y axis is as before. observed (or greater) difference in sample means. of the property—that is, whether there is an algo-
A word embedding is a representation of words In formal terms, let X and Y be two sets of target rithm that can extract or predict the property, given
as points in a vector space (12). For all results in words of equal size, and A,B the two sets of attri- the vector. In principle, we could use any algo-
→
this paper, we used the state-of-the-art GloVe bute words. Let cosða→ ; b Þ denote the cosine of the rithm, but in this work we tested the association
→ →
word-embedding method, in which, at a high lev- angle between vectors a and b . The test statistic is of the target concept with some set of attribute
el, the similarity between a pair of vectors is re- words, analogous to the WEAT.
lated to the probability that the words co-occur with X X Formally, consider a single set of target words
sðX ; Y ; A; BÞ ¼ sðx; A; BÞ − sðy; A; BÞ
other words similar to each other in text (13). Word- x∈ X y∈Y W and two sets of attribute words A, B. There is a
embedding algorithms such as GloVe exploit dimen- property pw associated with each word w ∈ W.

Caliskan et al., Science 356, 183–186 (2017) 14 April 2017 2 of 4

R ES E A RC H | R E PO R T

Table 1. Summary of Word-Embedding Association Tests.We replicated eight P values (P, rounded up) to emphasize that the statistical and substantive
well-known IAT findings using word embeddings (rows 1 to 3 and 6 to 10); we significance of both sets of results is uniformly high; we do not imply that our
also help explain prejudiced human behavior concerning hiring in the same way numbers are directly comparable with those of human studies. For the online
(rows 4 and 5). Each result compares two sets of words from target concepts IATs (rows 6, 7, and 10), P values were not reported but are known to be below
about which we are attempting to learn with two sets of attribute words. In the significance threshold of 10−2. Rows 1 to 8 are discussed in the text; for
each case, the first target is found compatible with the first attribute, and the completeness, this table also includes the two other IATs for which we were
second target with the second attribute. Throughout, we use word lists from able to find suitable word lists (rows 9 and 10). We found similar results with
the studies we seek to replicate. N, number of subjects; NT, number of tar- word2vec, another algorithm for creating word embeddings, trained on a
get words; NA, number of attribute words. We report the effect sizes (d) and different corpus, Google News (see the supplementary materials).

Original finding Our finding

Target words Attribute words
Ref. N d P NT NA d P

Flowers vs. insects Pleasant vs. unpleasant (5) 32 1.35 10−8 25 × 2 25 × 2 1.50 10−7
............................................................................................................................................................................................................................................................................................................................................
−10
Instruments vs. weapons Pleasant vs. unpleasant (5) 32 1.66 10 25 × 2 25 × 2 1.53 10−7
............................................................................................................................................................................................................................................................................................................................................
European-American vs. African-American names Pleasant vs. unpleasant (5) 26 1.17 10−5 32 × 2 25 × 2 1.41 10−8
............................................................................................................................................................................................................................................................................................................................................
European-American vs. African-American names Pleasant vs. unpleasant from (5) (7) Not applicable 16 × 2 25 × 2 1.50 10−4
............................................................................................................................................................................................................................................................................................................................................
European-American vs. African-American names Pleasant vs. unpleasant from (9) (7) Not applicable 16 × 2 8×2 1.28 10−3
............................................................................................................................................................................................................................................................................................................................................
Male vs. female names Career vs. family (9) 39k 0.72 <10−2 8×2 8×2 1.81 10−3
............................................................................................................................................................................................................................................................................................................................................
−2
Math vs. arts Male vs. female terms (9) 28k 0.82 <10 8×2 8×2 1.06 .018
............................................................................................................................................................................................................................................................................................................................................
Science vs. arts Male vs. female terms (10) 91 1.47 10−24 8×2 8×2 1.24 10−2
............................................................................................................................................................................................................................................................................................................................................
−3 −2

Downloaded from https://www.science.org at University of Chicago on September 21, 2021

Mental vs. physical disease Temporary vs. permanent (23) 135 1.01 10 6 × 2 7 × 2 1.38 10
............................................................................................................................................................................................................................................................................................................................................
Young vs. old people's names Pleasant vs. unpleasant (9) 43k 1.42 <10−2 8×2 8×2 1.21 10−2
............................................................................................................................................................................................................................................................................................................................................

The statistic associated with each word vector is Whorf hypothesis (17), because our work suggests supplementary materials). Further concerns may
a normalized association score of the word with that behavior can be driven by cultural history arise as AI is given agency in our society. If machine-
the attribute embedded in a term’s historic use. Such histories learning technologies used for, say, résumé screening
can evidently vary between languages. were to imbibe cultural stereotypes, it may result
sðw; A; BÞ ¼ We stress that we replicated every association in prejudiced outcomes. We recommend address-
→ → → →
documented via the IAT that we tested. The num- ing this through the explicit characterization of
meana∈ A cosðw ; a Þ − meanb∈B cosðw ; b Þ ber, variety, and substantive importance of our acceptable behavior. One such approach is seen in
→ →
std devx∈ A∪ B cosðw ;xÞ results raise the possibility that all implicit human the nascent field of fairness in machine learning,
biases are reflected in the statistical properties of which specifies and enforces mathematical formu-
The null hypothesis is that there is no asso- language. Further research is needed to test this lations of nondiscrimination in decision-making
ciation between s(w, A, B) and pw . We tested the hypothesis and to compare language with other (19, 20). Another approach can be found in mod-
null hypothesis using a linear regression analysis modalities, especially the visual, to see if they have ular AI architectures, such as cognitive systems,
to predict the latter from the former. similarly strong explanatory power. in which implicit learning of statistical regular-
We elaborate on further implications of our re- Our results also suggest a null hypothesis for ex- ities can be compartmentalized and augmented
sults. In psychology, our results add to the credence plaining origins of prejudicial behavior in humans, with explicit instruction of rules of appropriate
of the IAT by replicating its results in such a namely, the implicit transmission of ingroup/ conduct (21, 22). Certainly, caution must be used
different setting. Further, our methods may yield outgroup identity information through language. in incorporating modules constructed via unsu-
an efficient way to explore previously unknown That is, before providing an explicit or institutional pervised machine learning into decision-making
implicit associations. Researchers who conjecture explanation for why individuals make prejudiced systems.
implicit associations might first test them using the decisions, one must show that it was not a simple
WEAT on a suitable corpus before testing human outcome of unthinking reproduction of statisti- REFERENCES AND NOTES
subjects. Similarly, our methods could be used to cal regularities absorbed with language. Similarly, 1. M. Stubbs, Text and Corpus Analysis: Computer-Assisted
quickly find differences in bias between demo- before positing complex models for how stereo- Studies of Language and Culture (Blackwell, Oxford,
1996).
graphic groups, given large corpora authored by typed attitudes perpetuate from one generation to
2. J. A. Bullinaria, J. P. Levy, Behav. Res. Methods 39, 510–526
members of the respective groups. If substan- the next or from one group to another, we must (2007).
tiated through testing and replication, the WEAT check whether simply learning language is suffi- 3. T. Mikolov, J. Dean, Adv. Neural Inf. Process. Syst. 2013,
may also give us access to implicit associations cient to explain (some of) the observed transmis- 3111–3119 (2013).
4. C. M. Bishop, Pattern Recognition and Machine Learning
of groups not available for testing, such as his- sion of prejudice.
(Springer, London, 2006).
toric populations. Our work has implications for AI and machine 5. A. G. Greenwald, D. E. McGhee, J. L. Schwartz, J. Pers. Soc.
We have demonstrated that word embeddings learning because of the concern that these tech- Psychol. 74, 1464–1480 (1998).
encode not only stereotyped biases but also other nologies may perpetuate cultural stereotypes (18). 6. T. Bolukbasi, K.-W. Chang, J. Y. Zou, V. Saligrama,
A. T. Kalai, Adv. Neural Inf. Process. Syst. 2016, 4349–4357
knowledge, such as the visceral pleasantness of Our findings suggest that if we build an intelligent
(2016).
flowers or the gender distribution of occupations. system that learns enough about the properties of 7. M. Bertrand, S. Mullainathan, Am. Econ. Rev. 94, 991–1013 (2004).
These results lend support to the distributional language to be able to understand and produce it, 8. M. Bertrand, D. Chugh, S. Mullainathan, Am. Econ. Rev. 95,
hypothesis in linguistics, namely that the statis- in the process it will also acquire historical cultural 94–98 (2005).
9. B. A. Nosek, M. Banaji, A. G. Greenwald, Group Dyn. 6, 101–115
tical contexts of words capture much of what we associations, some of which can be objectionable. (2002).
mean by meaning (16). Our findings are also sure Already, popular online translation systems in- 10. B. A. Nosek, M. R. Banaji, A. G. Greenwald, J. Pers. Soc.
to contribute to the debate concerning the Sapir- corporate some of the biases we study (see the Psychol. 83, 44–59 (2002).

Caliskan et al., Science 356, 183–186 (2017) 14 April 2017 3 of 4

R ES E A RC H | R E PO R T

11. B. A. Nosek et al., Proc. Natl. Acad. Sci. U.S.A. 106, 19. C. Dwork, M. Hardt, T. Pitassi, O. Reingold, R. Zemel, Fairness research as a part of his undergraduate dissertation;
10593–10597 (2009). through awareness, Proceedings of the 3rd Innovations in and S. Barocas, M. Brundage, K. Crawford, C. Lai, and
12. P. D. Turney, P. Pantel, J. Artif. Intell. Res. 37, 141 (2010). Theoretical Computer Science Conference (ACM, 2012), M. Salganik for extremely useful comments on a draft of this
13. J. Pennington, R. Socher, C. D. Manning, EMNLP 14, 1532–1543 pp. 214–226. paper. We have archived the code and data on Harvard
(2014). 20. M. Feldman, S. A. Friedler, J. Moeller, C. Scheidegger, Dataverse (doi: 10.7910/DVN/DX4VWP).
14. T. MacFarlane, Extracting semantics from the Enron corpus, S. Venkatasubramanian, Certifying and removing disparate
University of Bath, Department of Computer Science Technical impact, Proceedings of the 21th ACM SIGKDD International
Report Series; CSBU-2013-08; http://opus.bath.ac.uk/37916/ Conference on Knowledge Discovery and Data Mining (ACM,
SUPPLEMENTARY MATERIALS
(2013). 2015), pp. 259–268.
21. K. R. Thórisson, Minds Mach. 17, 11–25 (2007). www.sciencemag.org/content/356/6334/183/suppl/DC1
15. W. Lowe, S. McDonald, The direct route: Mediated priming in
22. M. Hanheide et al., Artif. Intell. 2015, j.artint.2015.08.008 (2015). Materials and Methods
semantic space, Proceedings of the Twenty-Second Annual
23. L. L. Monteith, J. W. Pettit, J. Soc. Clin. Psychol. 30, 484–505 (2011). Supplementary Text
Conference of the Cognitive Science Society (LEA, 2000),
Table S1
pp. 806–811.
ACKN OWLED GMEN TS References
16. M. Sahlgren, Ital. J. Linguist. 20, 33 (2008).
17. G. Lupyan, Lang. Learn. 66, 516–553 (2016). We are grateful to W. Lowe for substantial assistance 17 November 2016; accepted 9 March 2017
18. S. Barocas, A. D. Selbst, Calif. Law Rev. 104, 2477899 (2014). in the design of our significance tests; T. Macfarlane for pilot 10.1126/science.aal4230

Downloaded from https://www.science.org at University of Chicago on September 21, 2021

Caliskan et al., Science 356, 183–186 (2017) 14 April 2017 4 of 4

Semantics derived automatically from language corpora contain human-like biases
Aylin CaliskanJoanna J. BrysonArvind Narayanan

Science, 356 (6334),

Machines learn what people know implicitly

AlphaGo has demonstrated that a machine can learn how to do things that people spend many years of concentrated
study learning, and it can rapidly learn how to do them better than any human can. Caliskan et al. now show that
machines can learn word associations from written texts and that these associations mirror those learned by humans,
as measured by the Implicit Association Test (IAT) (see the Perspective by Greenwald). Why does this matter?
Because the IAT has predictive value in uncovering the association between concepts, such as pleasantness and
flowers or unpleasantness and insects. It can also tease out attitudes and beliefs—for example, associations between

Downloaded from https://www.science.org at University of Chicago on September 21, 2021

female names and family or male names and career. Such biases may not be expressed explicitly, yet they can prove
influential in behavior.
Science, this issue p. 183; see also p. 133

View the article online

https://www.science.org/doi/10.1126/science.aal4230
Permissions
https://www.science.org/help/reprints-and-permissions

Use of think article is subject to the Terms of service

Science (ISSN 1095-9203) is published by the American Association for the Advancement of Science. 1200 New York Avenue NW,
Washington, DC 20005. The title Science is a registered trademark of AAAS.
Copyright © 2017, American Association for the Advancement of Science

Automated Bias Assessment in AI-Generated Educational Content Using CEAT Framework
No ratings yet
Automated Bias Assessment in AI-Generated Educational Content Using CEAT Framework
7 pages
Artificial Intelligence, Bias, and Ethics: Aylin Caliskan
No ratings yet
Artificial Intelligence, Bias, and Ethics: Aylin Caliskan
7 pages
Week 5
No ratings yet
Week 5
26 pages
Acta Psychologica: Mark Steyvers
No ratings yet
Acta Psychologica: Mark Steyvers
10 pages
Vector Based Models
No ratings yet
Vector Based Models
41 pages
Caliskan Et Al. (2022) .Gender Bias in Word Embeddings - A Comprehensive Analysis of Frequency, Syntax, and Semantics
No ratings yet
Caliskan Et Al. (2022) .Gender Bias in Word Embeddings - A Comprehensive Analysis of Frequency, Syntax, and Semantics
15 pages
1803 09288
No ratings yet
1803 09288
73 pages
Caliskan 2022
No ratings yet
Caliskan 2022
15 pages
Admin, 4015
No ratings yet
Admin, 4015
19 pages
Ratliff ImplicitAssociationTest 2024
No ratings yet
Ratliff ImplicitAssociationTest 2024
15 pages
M S S W: A S: Easurement of Emantic Imilarity Between Ords Urvey
No ratings yet
M S S W: A S: Easurement of Emantic Imilarity Between Ords Urvey
10 pages
Semantic Similarity For English and Arabic Texts: A Review: Alzahrani 2016
No ratings yet
Semantic Similarity For English and Arabic Texts: A Review: Alzahrani 2016
29 pages
Garg Et Al 2018 Word Embeddings Quantify 100 Years of Gender and Ethnic Stereotypes
No ratings yet
Garg Et Al 2018 Word Embeddings Quantify 100 Years of Gender and Ethnic Stereotypes
10 pages
Word Embeddings Quantify 100 Years of Gender Gender and Ethnic Stereotypes
No ratings yet
Word Embeddings Quantify 100 Years of Gender Gender and Ethnic Stereotypes
10 pages
Ling571 Class14 Distr Thes
No ratings yet
Ling571 Class14 Distr Thes
122 pages
The Design of A System For The Automatic Extraction of A Lexical Database Analogous To Wordnet From Raw Text
No ratings yet
The Design of A System For The Automatic Extraction of A Lexical Database Analogous To Wordnet From Raw Text
8 pages
Brain LSA WordNet
No ratings yet
Brain LSA WordNet
16 pages
Biological Cybernetics: Self-Organizing Semantic Maps
No ratings yet
Biological Cybernetics: Self-Organizing Semantic Maps
14 pages
Lecture 3. Vector Semantics
No ratings yet
Lecture 3. Vector Semantics
51 pages
Lecture12 - Word RepEmb
No ratings yet
Lecture12 - Word RepEmb
28 pages
Cabana Et Al - 2023
No ratings yet
Cabana Et Al - 2023
18 pages
A Review of Semantic Similarity Measures in WordNet
No ratings yet
A Review of Semantic Similarity Measures in WordNet
12 pages
A Survey of Numerous Text Similarity Approach
No ratings yet
A Survey of Numerous Text Similarity Approach
10 pages
算法偏见研究综述
No ratings yet
算法偏见研究综述
22 pages
ML4D-L6 nlp2
No ratings yet
ML4D-L6 nlp2
58 pages
Evolution of Semantic Similarity - A Survey
No ratings yet
Evolution of Semantic Similarity - A Survey
35 pages
COMP5046: Natural Language Processing
No ratings yet
COMP5046: Natural Language Processing
71 pages
21 Word2Vec 24 09 2024
No ratings yet
21 Word2Vec 24 09 2024
63 pages
Advanced Cogntive Science
No ratings yet
Advanced Cogntive Science
15 pages
Subspace Representation For Natural Language Processing
No ratings yet
Subspace Representation For Natural Language Processing
67 pages
Semantic Networks
100% (1)
Semantic Networks
68 pages
Week 2 and 3
No ratings yet
Week 2 and 3
76 pages
Semantic Similarity
No ratings yet
Semantic Similarity
14 pages
Topics in Cognitive Science - 2010 - McNamara - Computational Methods To Extract Meaning From Text and Advance Theories of
No ratings yet
Topics in Cognitive Science - 2010 - McNamara - Computational Methods To Extract Meaning From Text and Advance Theories of
15 pages
Hyperscanning Alone Cannot Prove Causality. Multibrain Stimulation Can
No ratings yet
Hyperscanning Alone Cannot Prove Causality. Multibrain Stimulation Can
4 pages
Evaluating The Stability of Embedding-Based Word Similarities
No ratings yet
Evaluating The Stability of Embedding-Based Word Similarities
14 pages
Gwald MCGH SCHW JPSP 1998.ocr
No ratings yet
Gwald MCGH SCHW JPSP 1998.ocr
17 pages
Measuring Individual Differences in Implicit Cognition: The Implicit Association Test
No ratings yet
Measuring Individual Differences in Implicit Cognition: The Implicit Association Test
17 pages
Shankara Digvijaya With Commentary (Sanskrit)
100% (2)
Shankara Digvijaya With Commentary (Sanskrit)
624 pages
baayenDavidsonBates PDF
No ratings yet
baayenDavidsonBates PDF
23 pages
11.chapter8 WordEmbedding
No ratings yet
11.chapter8 WordEmbedding
17 pages
Vector Semantics and Embedding (Part 1)
No ratings yet
Vector Semantics and Embedding (Part 1)
66 pages
Word and Document Embeddings
No ratings yet
Word and Document Embeddings
94 pages
Artificial Intelligence: Francisco Pereira, Matthew Botvinick, Greg Detre
No ratings yet
Artificial Intelligence: Francisco Pereira, Matthew Botvinick, Greg Detre
13 pages
CCS369 - TSS-Unit 2
No ratings yet
CCS369 - TSS-Unit 2
56 pages
Can Computers Understand Words Like Human Do
No ratings yet
Can Computers Understand Words Like Human Do
28 pages
Exploring What Is Encoded in Distributional Word Vectors
No ratings yet
Exploring What Is Encoded in Distributional Word Vectors
4 pages
Short Text Similarity Calculation Based On Jaccard and Semantic Mixture
No ratings yet
Short Text Similarity Calculation Based On Jaccard and Semantic Mixture
9 pages
Semantic Density Analysis: Comparing Word Meaning Across Time and Phonetic Space
No ratings yet
Semantic Density Analysis: Comparing Word Meaning Across Time and Phonetic Space
8 pages
Word Embeddings
No ratings yet
Word Embeddings
59 pages
NLP Unit 4
No ratings yet
NLP Unit 4
23 pages
Efficacy of Deep Neural Embeddings Based Semantic Similarity 1o9uaupg
No ratings yet
Efficacy of Deep Neural Embeddings Based Semantic Similarity 1o9uaupg
14 pages
Text Mining For Information Systems Researchers - An Annotated Top
No ratings yet
Text Mining For Information Systems Researchers - An Annotated Top
27 pages
Performance Enhancement of WSD Using Association Rules in WEKA
No ratings yet
Performance Enhancement of WSD Using Association Rules in WEKA
8 pages
A Literature Review of New Direction in Implicit and Explicit Stereotypes Researches
No ratings yet
A Literature Review of New Direction in Implicit and Explicit Stereotypes Researches
7 pages
Cecilia Asabre CV1
No ratings yet
Cecilia Asabre CV1
3 pages
Information Assurance and Security
No ratings yet
Information Assurance and Security
4 pages
Nerf in Digital Twin
No ratings yet
Nerf in Digital Twin
16 pages
Testng Interview Questions Level
No ratings yet
Testng Interview Questions Level
7 pages
Complete Project
No ratings yet
Complete Project
43 pages
Service Manual: TV-21ST3 TV-20ST5 TV-14ST5
No ratings yet
Service Manual: TV-21ST3 TV-20ST5 TV-14ST5
6 pages
BinaryEntryPoint MessageSpecificationGuidelines 8.0.0.1 EnUS
No ratings yet
BinaryEntryPoint MessageSpecificationGuidelines 8.0.0.1 EnUS
123 pages
Online Food Delivery App Foodie
No ratings yet
Online Food Delivery App Foodie
12 pages
Oop
No ratings yet
Oop
23 pages
MySQL Notes
No ratings yet
MySQL Notes
20 pages
Multimedia SYsytem Unit 1
No ratings yet
Multimedia SYsytem Unit 1
20 pages
NPM-D3A en 25 0101
No ratings yet
NPM-D3A en 25 0101
4 pages
Pa-1 Portion (Grade 11)
No ratings yet
Pa-1 Portion (Grade 11)
1 page
Community-Infineon
No ratings yet
Community-Infineon
6 pages
IEEE - Template-Referencia
No ratings yet
IEEE - Template-Referencia
5 pages
English Paper 1: Stage 9
No ratings yet
English Paper 1: Stage 9
48 pages
10.94.141.32 Tdprim
No ratings yet
10.94.141.32 Tdprim
49 pages
Exercises: 2 / Basic Structures: Sets, Functions, Sequences, Sums, and Matrices
No ratings yet
Exercises: 2 / Basic Structures: Sets, Functions, Sequences, Sums, and Matrices
2 pages
Information System Application
No ratings yet
Information System Application
53 pages
05 Ccnasec-Firewall - p3
No ratings yet
05 Ccnasec-Firewall - p3
34 pages
Calculation of Duty
No ratings yet
Calculation of Duty
898 pages
1Z0 1066 24 Demo
No ratings yet
1Z0 1066 24 Demo
5 pages
Javell: Address: 23 A East Avenue, Linstead P.O., Jamaica Email: Telephone: (876) 484-8766 1876-416-8765
No ratings yet
Javell: Address: 23 A East Avenue, Linstead P.O., Jamaica Email: Telephone: (876) 484-8766 1876-416-8765
3 pages
Assignment3 Functions
No ratings yet
Assignment3 Functions
5 pages
Physics With Arduino
No ratings yet
Physics With Arduino
44 pages
Jyothsna CV
No ratings yet
Jyothsna CV
1 page
PL 400notes230926
No ratings yet
PL 400notes230926
104 pages
Logcat CSC Update Log
No ratings yet
Logcat CSC Update Log
2,493 pages
Smart Car Parking System in Multiplexes
No ratings yet
Smart Car Parking System in Multiplexes
6 pages
TTP-245p 247 User Manual E
No ratings yet
TTP-245p 247 User Manual E
50 pages

Caliskan Et Al. - 2017 - Semantics Derived Automatically From Language Corp

Uploaded by

Caliskan Et Al. - 2017 - Semantics Derived Automatically From Language Corp

Uploaded by

R ES E A RC H

◥ the reaction latencies of four pairings (flowers +

Semantics derived automatically deviation. Conventional small, medium, and large

human-like biases expected association with an effect size of 1.50

Downloaded from https://www.science.org at University of Chicago on September 21, 2021

Caliskan et al., Science 356, 183–186 (2017) 14 April 2017 1 of 4

occupation word vector with female gender

name vector with female gender

Downloaded from https://www.science.org at University of Chicago on September 21, 2021

Caliskan et al., Science 356, 183–186 (2017) 14 April 2017 2 of 4

Original finding Our finding

Downloaded from https://www.science.org at University of Chicago on September 21, 2021

Caliskan et al., Science 356, 183–186 (2017) 14 April 2017 3 of 4

Downloaded from https://www.science.org at University of Chicago on September 21, 2021

Caliskan et al., Science 356, 183–186 (2017) 14 April 2017 4 of 4

Science, 356 (6334),

Machines learn what people know implicitly

Downloaded from https://www.science.org at University of Chicago on September 21, 2021

View the article online

Use of think article is subject to the Terms of service

You might also like