Mixing Memory: Gender, Math, Stereotype Threat, and Testosterone

Tuesday, July 19, 2005

Gender, Math, Stereotype Threat, and Testosterone

The recent debate of the remarks of a certain university president about gender differences in mathematical ability has focused on the role of innate factors in those differences. I've argued previously that there is no direct evidence for innate differences, though innate factors cannot be ruled out. There is, however, a great deal of evidence for social/environmental factors that influence the observed gender differences in performance on standardized math tests. Much of the recent research on social factors has focused on Claude Steele's concept of stereotype threat¹. According to Steele (1999), stereotype threat involves "the threat of being viewed through the lens of a negative stereotype or the fear of doing something that would inadvertently confirm that stereotype" (p. 46). More technically, in contexts in which particular stereotypes are active, individuals who are members of the negatively stereotyped groups will be conscious of the content of those stereotypes, and this may negatively affect their performance. In the context of performance on standardized math tests, women who are currently aware of gender stereotypes related to math ability may experience anxiety related to the confirmation of those stereotypes, and as a result, their performance on the math tests will suffer.

There is a lot of evidence that stereotype threat can affect intellectual performance. Steele's early work on the topic looked at the affect of racial stereotypes on the intellectual performance of African Americans. However, he's found that even the performance of white males can be affected by stereotype threat if task-related stereotypes are active. Specifically, when the stereotype that Asian males have better math skills than white males is active, white men perform worse on standardized math tests than when the stereotype is not activated². More recently, researchers (at Harvard, even) have shown that female undergrads in male-dominated majors (math, science, and engineering majors) experience high levels of stereotype threat compared to women in majors that are not male-dominated, to the extent that they are much more likely than males to consider changing majors³. If women experience stereotype threat in the context of math education, and given that stereotype threat has been shown to affect intellectual performance, it would not be surprising to find that stereotype threat affects women's performance on standardized math tests, and thus plays a role in the much-discussed gender differences in math ability.

The only study on the role of stereotype threat in gender differences in math ability that I remember seeing cited during the debate over the Summers remarks was one by Jason Osborne. Using data from more than 15,000 individuals drawn from a national study of high school seniors, Osborne found that anxiety (not necessarily stereotype-related anxiety) mediated the gender differences in scores on standardized math tests, though the effect of anxiety was fairly small. In other words, anxiety played a role in the gender differences Osborne observed in math performance, but the role was a small one. This studies strength is that it used an extremely large and diverse sample. It's weakness, however, is that it did not look at the role of stereotypes specifically, and that its only measure of anxiety was post-test self-report.

Other studies, which as far as I can tell have been neglected in the discussion of Summers' remarks, have looked at the role of stereotype threat more directly. Spencer et al., for example, found that for a sample of men and women who were highly and equally qualified, gender differences in mathematical performance could be eliminated by reducing stereotype threat⁴. After showing that their math test produced the frequently observed gender differences, they had participants perform the same test after being told either that the test does not produce gender differences, or that it does produce gender differences. Women who were told that it does not produce gender differences performed as well as men, while women who were told that it does produce them performed significantly worse than men.

Brown and Josephs conducted a similar study, with similar results⁵. In their first experiment, they either told participants that the math test they were about to take would show whether they were weak in math, or whether they were strong in math. Consistent with the hypothesis that stereotype threat would affect women's performance, female participants who had been told that the test would determine whether they were weak at math (the stereotype-consistent test description) performed worse than female participants who had been told that it twould test whether they were strong in math. Men showed the opposite pattern. They performed better when they were told that the test measured math strength than when they were told that it measured math weakeness. This indicates that the effects of gender-related math stereotypes can be positive or negative. If you are a member of a group that is negatively stereotyped, then stereotype threat will hurt your performance, whereas the performance of members of the group that is positively stereotyped will perform better when the stereotype is active.

Past research (see the paper for citations) has shown that the presence of an "external handicap," i.e., some external factor that may hurt performance, can alleviate stereotype threat. In experiments 2 and 3, Brown and Josephs allowed participants to practice math problems prior to completing the math test. However, for some participants, they produced an external handicap by having the computer crash at the beginning of the practice session. If stereotype threat is hurting women's performance on math tests, then alleviating that threat by providing an external handicap should reduce the effect of the stereotype, and increase performance. Consistent with this prediction, they found that emale participants who experienced the external handicap and were told that the test measured math weakness performed significantly better than participants in the "weakness" condition than women who did not experience the external handicap. Thus, experiments two and three provide another piece of evidence that gender-related math stereotypes are affecting women's math performance.

Another factor that influences the effect of stereotypes on performance is group identification. Individuals who highly identify with a stereotyped group will be more susceptible to the negative effects of that group's stereotypes. Thus, we would expect that women who place more importance on their gender identity will perform worse on math tests when math-related gender stereotypes are activated than women who place less importance on their gender identity. To test this, Schmader conducted a study using participants with either high or low gender identification, and found that women for whom their gender identity was important performed worse on a math test when they were told that the test produced gender differences than men, while women who placed little importance on their gender identity performed as well as men on the same test⁶.

In another set of studies, Josephs et al. looked at the relationship between stereotype threat and social status⁷. They hypothesized that for individuals who view math ability as important, high levels of social status concern will increase the efffects of stereotype threat. Josephs and others (see paper for citations) have shown that for both men and women, high levels of testosterone (high-T) are correlated with high levels of status concern. Thus, they predicted that high-T women (women with high levels of testosterone compared to other women),would be more affected by the stereotype, and thus their performance would suffer more than women who had low levels of testosterone (low-T). To test this prediction, they first took saliva samples from participants who rated math ability as important, to measure their current testosterone levels, and then had them complete a short questionaire. Half of the participants completed a questionaire that had previously been shown to prime math-related gender stereotypes, while the other half completed a control questionaire. The participants then completed 20 questions from the quantiative section of the GRE. In their fist study, they found that high-T women performed worse when the stereotype was activated (when they had completed the stereotype-activating questionaire) than when it was not, whereas low-T women performed equally well in both conditions. In fact, high-T women in the stereotype-prime condition performed worse than low-T women in both the stereotype-prime and control conditions, indicating that the effect of the stereotype was quite large for high-T women. Interestingly, in their second experiment, they showed that, consistent with the findings of Brown and Josephs, the effects of testosterone on men are reversed. High-T men actually performed better when gender stereotypes were activated than when they weren't, and better than low-T men in either condition, while low-T men's performance did not differ in the two conditions. High-T men who have had the gender stereotype primed apparently view the test as a way to show off their mathematical abilities, and thus increase their social status.

Finally, researchers have begun to look at the ways in which stereotype threat might affect women's performance on math tests. In one study, Johannes Keller used methods similar to those of Steele et al. (2002), in which participants complete a math test either after having the gender stereotype primed or without having it primed⁸. Consistent with the other research, he found that females (he used high school students) performed worse than men when the stereotype was active, but as well as men when it was not. He also found that the decrease in performance was largely due to self-handicapping, which is common in the face of stereotype threat. Self-handicapping involves things like decreased effort and attention, procrastination, and similar performance-reducing behaviors. In Keller's study, female participants form who the stereotype was primed were much more likely to perform self-handicapping behaviors.

In another study, Quinn and Spencer primed math-related gender stereotypes, and again showed that women in the primed condition performed worse than women for whom the stereotypes were not active, and that for the latter group, performance was equal to that of men⁹. In addition to completing the math test, participants in their study also described their problem-solving strategies. Quinn and Spencer then coded these strategies, and found that women in the stereotype-primed condition, women produced fewer and less effective problem-solving strategies. This indicates that stereotype-threat made it more difficult for women to produce problem-solving strategies, and that this reduced their performance on the test. Another interesting result from their study that is not directly related to the role of stereotype threat was that in their first experiment, in which stereotypes were not activated, women performed as well as men on numerical problems, but worse on word problems.

The point of all of this is that several studies have shown that stereotype threat influences women's performance on math tests, and thus is likely responsible for at least part of the observed gender differences in math ability. In fact, given that in most of the studies, gender differences were eliminated when stereotype-threat was absent or reduced, either through indicating that the test does not produce gender differences or producing an external handicap, or in participants for whom stereotypes are not as relevant due to the low importance of gender identity or low levels of social status concern, the role of stereotype-threat in gender differences may be quite large. The fact that Osborne found only a small, though statistically significant effect of anxiety is interesting, but it doesn't speak directly to the role of stereotype-threat, and is overshadowed by the long list of studies that directly demonstrated large effects of stereotype-threat. It would be difficult for innate factors, even those relating to the influence of spatial reasoning ability, to account for much of the data on gender differences in math ability. As of yet, no one has a theory explaining how innate differences account for the fact that gender differences disappear in untimed tests, in numerical problems vs. word problems, and when stereotype threat is alleviated. This doesn't mean that there aren't innate differences, but it does mean that for now, the best evidence we have indicates that social factors play a strong role in gender differences in math, and it would be a mistake to overlook them, particularly in the search for innate differences that cannot explain the data.

¹ See e.g., Steele, C. M. (1997). A threat in the air: How stereotypes shape intellectual identity and Performance. American Psychologist, 52, 613-629, or Steele, C.M. (1999, August). Thin ice: “Stereotype threat” and Black college students. Atlantic Monthly, 44-54.
² Aronson, J., Lustina, M., Good, C., Keough, K., Steele, C., & Brown, J. (1999). When white men can't do math: Necessary factors in stereotype threat. Journal of Experimental Social Psychology, 35, 29-46.
³ Steele, J., James, J.B., & Barnett, R.C. (2002). Learning in a man’s world: Examining the perceptions of undergraduate women in male-dominated academic areas. Psychology of Women Quarterly, 26, 46-50.
⁴ Spencer, S.J., Steele, C.M., & Quinn, D.M. (1999). Stereotype threat and women's math performance. Journal of Experimental Social Psychology, 35(1), 4-28. See also Walsh, M., Crystal, H., & Duffy, J. (1999). Influence of item content and stereotype situation on gender differences in mathematical problem solving. Sex Roles, 41(3-4), 219-240, for a study using similar methods and producing similar results in a sample of middle school-aged participants.
⁵ Brown, R.B., & Josephs, R.A. (1999). A burden of proof: Stereotype relevance and gender differences in math performance. Journal of Personality and Social Psychology, 76(2), 246-257.
⁶ Schmader, T. (2002). Gender identification moderates stereotype threat effects on women's math performance. Journal of Experimental Social Psychology. 38(2), 194-201.
⁷ Josephs, R.A., Newman, M.L, Brown, R.P., & Beer, J.M. (2003). Status, testosterone, and human intellectual performance: Stereotype threat as status concern. Psychological Science, 14, 158-163.
⁸ Keller, J. (2002). Blatant stereotype threat and women's math performance: Self-handicapping as a strategic means to cope with obtrusive negative performance expectations. Sex Roles, 47(3-4), 193-198.
⁹ Quinn, D.M., & Spencer, S.J. (2001). The interference of stereotype threat with women's generation of mathematical problem-solving strategies. Journal of Social Issues, 57(1), 55-71.

15 comments:

sherifffruitfly said...: All the arrgle-bargle aside, I can say from my years of teaching university math that girls tended to perform noticeably better at lower levels (the "science" of math) than the boys. I gave it a little bit of thought at the time, and concluded that the difference was mostly due to the fact that girls tended to follow the instructors' instructions more often than the boys did.

How did they compare at the upper levels in math (the "art" of mathematics)? Don't know - there were too few girls there to draw any serious conclusion - which is a pity.; 7/21/2005 2:16 PM
Anonymous said...: ...female participants who had been told that the test would determine whether they were weak at math (the stereotype-consistent test description) performed worse than female participants who had been told that it twould test whether they were strong in math. Men showed the opposite pattern. They performed better when they were told that the test measured math strength than when they were told that it measured math weakeness.

Do you mean men performed better when they were told the test measured math weakness than when told it measured math strength? That would be the opposite pattern, wouldn't it?; 7/21/2005 4:08 PM
Anonymous said...: Oh! Never mind, I understand now. I should have kept reading before asking.; 7/21/2005 4:10 PM
Steve Sailer said...: As an old marketing researcher, I'd like to point out the alternative explanation to Steele's theory. In the marketing research business, a major problem is that survey respondents try to figure out what the answer is that you'd like to hear and give it to you -- if their are no costs to them.

The widespread enthusiasm for Claude Steele's "stereotype threat" theory is particularly odd because its assumption that blacks collapse under a pressure would seem racially derogatory. Back in the bad old days, it was bigoted whites who jeered that black sports pioneers like Joe Louis, Jesse Owens, and Jackie Robinson would choke as soon as the spotlight was on them. They didn't. Similarly, Paul Robeson didn't suddenly forget his lines when the curtain came up on Othello, nor did Marian Anderson sing off-key at the Lincoln Memorial. In fact, they all seemed to experience the opposite of stereotype threat: "stereotype stimulation," a burning desire to prove their naysayers wrong.

So eventually, that old stereotype died out.

In reality, however, nobody cares about these logical implications because nobody truly believes in stereotype theory.

Stereotype theory's fans just want to use it to wish away the white-black test score gap.

Unfortunately for them, the January 2004 issue of the scientific journal American Psychologist, the publication of the American Psychology Association, ran a pointed article by Paul R. Sackett, Chaitra M. Hardison, and Michael J. Cullen documenting that Steele's research is

"[w]idely misinterpreted in both popular and scholarly publications as showing that eliminating stereotype threat eliminates the African American-White difference in test performance.”

The psychologists’ point:

"[R]ather than showing that eliminating threat eliminates the large score gap on standardized tests, the research actually shows something very different. Specifically, absent stereotype threat, the African American–White difference is just what one would expect based on the African American–White difference in SAT scores, whereas in the presence of stereotype threat, the difference is larger than would be expected based on the difference in SAT scores."

In other words, Steele only showed he could persuade black students to do worse than they did on the SAT. He did not show he could make black Stanford students score better than they had on the Verbal SAT—which was about a half-standard deviation below the white Stanford students in the study.

Far from than debunking the SAT, Steele tacitly relied on the SAT as a fair measure of ability. (Curtis Crawford of the www.DebatingRacialPreference.org website has examined the new critique in detail for the National Association of Scholars.)

What Steele's fans have failed to grasp is that Steele was not investigating how the SAT was too hard on blacks, but how it was too easy on them. Blacks at elite colleges tend to get worse grades than their SAT or ACT scores (or high school GPA) would predict.

In a 1992 Atlantic article, Steele dealt frankly with this little-known fact:

"This pattern has been documented so broadly across so many regions of the country, and by so many investigations (literally hundreds), that it is virtually a social law in this society--as well as a racial tragedy."; 7/23/2005 5:37 PM
Richard Y Chappell said...: "Men showed the opposite pattern. They performed better when they were told that the test measured math strength than when they were told that it measured math weakeness. This indicates that the effects of gender-related math stereotypes can be positive or negative. If you are a member of a group that is negatively stereotyped, then stereotype threat will hurt your performance, whereas the performance of members of the group that is positively stereotyped will perform better when the stereotype is active."

I'm puzzled by this reasoning. Let me clarify how I'm reading it (and please do correct me if I've misunderstood):

1. DATA: Both males and females do better in positively ("strength") than negatively ("weakness") framed conditions.

2. The stereotype treats men and women differently -- associating the former with math strength, and the latter with weakness.

3. EXPLANATION: This inversion can be cancelled out by suggesting that stereotype threat will respectively help and hinder those positively and negatively stereotyped.

4. By this double inversion, we get back to the original result. Thus it is explained.

Now, wouldn't it be far more logical to skip the whole double inversion thing, and simply stop at step 1? This would yield the sensible conclusion that everybody is helped by positive framing and hindered by negative. Different stereotypes (plus different reactions to those stereotypes, in order to cancel the inversion) don't seem to have anything to do with it. It just seems like a pointless complication: like adding "+ 2 - 2" to a maths equation.

(That doesn't speak to the rest of the post at all. I'm just suspicious of this specific bit of reasoning, is all.); 7/25/2005 6:54 AM
Chris said...: Richard, that's one possible interpretation, but I don't think it explains the whole of the data. Since I didn't include numbers, or all of the relative differences, it's really my fault. Men in the have the strongest performance overall. In other words, their performance increases relative to women in all conditions and men in the weak condition. This is why they conclude that there is a bump. It could be said that the "strength" condition improves women's performance, but it doesn't affect their performance to the extent that it affects the men's performance.; 7/25/2005 2:23 PM
Richard Y Chappell said...: Ahkay, thanks for clarifying that.; 7/25/2005 11:16 PM
profanity said...: Also check out Barb Fredrickson's study which demonstrated that self-objectification of one's body uses up cognitive resources and hurts women's math performance:

Fredrickson, B. L. Roberts, T., Noll, S. M., Quinn, D. M., & Twenge, J. M. (1998). That swimsuit becomes you: Sex differences in self-objectification, restrained eating and math performance. Journal of Personality and Social Psychology, 75, 269-284.

http://socialpsych.uconn.edu/Fredrickson_et_al_JPSP1998.pdf; 3/19/2008 11:46 AM
Anonymous said...: 1)
cheap hair straighteners
chi hair straightener
chi flat iron

new polo shirts
cheap Lacoste polo shirts
cheap Lacoste polo shirts

cheap handbags
cheap bags
puma chaussures
chaussures puma
chaussure puma

Men's North Face
Women's North Face

hair straighteners
sexy lingerie store
cheap ugg boots
tattoo wholesale
men's clothing
women's clothing

2009 nike shoes
new nike shoes
Women's max
Men's max 93
nike shox
Nike air force
Nike air max 2003
nike air max ltd
nike air max tn
Nike air rift
Nike air Yeezy
nike airmax
Nike air max 90
Nike air max 97
nike birds nest shoes
nike dunk
nike RT1 shoes
nike SB
nike shox shoes
Nike shox OZ shoes
Nike shox R2 shoes
Nike shox R3 shoes
Nike shox R4 shoes
Nike shox R5 shoes
Nike shox TL3
nike trainers lovers

tennis rackets
Wilson tennis rackets
HEAD tennis rackets
Babolat tennis rackets; 6/10/2009 12:17 AM
kinggool said...: Interesting article on science! I know of an exciting website which sells men’s and women’s apparel online. Also fashionable accessories, health and fitness gadgets and beauty kits are available for sale. As designer clothes showcases your tastes, the modern-day lifestyle depends on the factors of style, sophistication and compatibility.; 9/26/2009 4:49 AM
Anonymous said...: Anothre set of ralph lauren polo in three colors. Ralph

Lauren Polo Shirts come with a graduation of colors on the chest. The new shiki Cheap polo

shirts logo is on the chest. Comes in all layres and can be worned tucked in or out Wholesale Polo Shirts -50% OFF.
Also comes with sculpted collar and arm cuffs.Flat knit collar, Contrast placket and half

moon, Mothre-of-pearl buttons and tennis tail, Soft double knit piqué,100% cotton ralph lauren

polo shirts. If you lend your embroidrere a sample cheapralph lauren polos that will help you get

the closest match to the original embroidreed design. The artwork you provide will sreve as a

template for your embroidrey set-up but discount ralph lauren polos can't be used to identify some

stitch types, cheap polo shirts Cheap Polo Shirtsgive

precise sizing of embroidreed elements and show thread polo clothing colors.; 12/28/2009 7:58 PM
Anonymous said...: Lucyliu IS Lucyliu; 1/28/2010 7:36 PM
Anonymous said...: Cheap Sunglasses or Discount Sunglasses, Armani Sunglasses, Burberry Sunglasses , Cartier Sunglasses , D&G Sunglasses ( cheap D&G sunglasses or discount D&G sunglasses), Dior Sunglasses and Fendi Sunglasses， Gucci Sunglasses ( discount gucci sunglasses or cheap gucci sunglasses)， Oakley Sunglasses ( in other words cheap oakley sunglasses or discount oakley sunglasses ), Police Sunglasses , ED Hardy Sunglasses , LV Sunglasses , Versace Sunglasses ， Ray Ban Sunglasses, Roberto Cavalli Sunglasses, Prada Sunglasses( cheap prada sunglasses or discount prada sunglasses， Chanel Sunglasses , Coach Sunglasses , Sunglasses Outlet; 3/12/2010 2:58 AM
Anonymous said...: jordan 1
jordan 2
jordan 3
jordan 4
jordan 5
jumpmen23; 4/02/2010 10:16 PM
ai said...: http://mixingmemory.blogspot.com/2005/07/gender-math-stereotype-threat-and.html; 4/19/2010 11:16 PM