(C) PLOS One

(C) PLOS One
This story was originally published by PLOS One and is unaltered.
. . . . . . . . . .

Impact of gender on the formation and outcome of formal mentoring relationships in the life sciences [1]

['Leah P. Schwartz', 'Oregon Hearing Research Center', 'Oregon Health', 'Science University', 'Portland', 'Oregon', 'United States Of America', 'Jean F. Liénard', 'Stephen V. David']

Date: 2022-09

We examined training relationships with end dates between 2000 and 2020, excluding data from training areas focused on business and clinical medicine (2.9% of training relationships excluded). The resulting dataset included 109,784 mentors, 23,721 postdocs, and 365,446 students from a wide range of research areas in science, technology, engineering, and mathematics (STEM), humanities, and the social sciences ( S2A Fig ). For training at institutions in the United States, the gender composition of graduate students and postdocs across research areas was consistent with demographic data collected by the National Science Foundation (NSF) ( S2B and S2C Fig ).

We inferred mentor and trainee gender solely from first names. Gender inference was performed using Genderize, an algorithm that estimates the probability that a typical user of the name identifies as a man or a woman based on social media data recording how the name is commonly used [ 22 ]. Gender probabilities were available for 93.7% of individuals in AFT. Among this group, we excluded data from 4.3% of individuals whose first names did not have high probability of association with one gender (see Methods ; S1 Fig and S2 Table ).

We analyzed data from Academic Family Tree (AFT; available at www.academictree.org ), a crowdsourced database of academic genealogy [ 3 , 21 ]. The database integrates user-contributed and public data on academic training relationships and publications. A mentoring relationship was either explicitly indicated by database users or inferred from authorship and supervision of a dissertation listed in ProQuest’s collection of dissertations and theses.

The temporal trends observed at the level of research areas can be observed at the level of mentors grouped by academic seniority. We examined the subset of mentors with at least 2 trainees and independent career start dates after 1970 (n = 37,962 mentors). We quantified the fraction of women mentors, fraction of women students, and homophily as a function of both mentor’s career start date (1970 to 2015) and student’s graduation date (2000 to 2015). The fraction of women beginning careers as mentors increased over time from 1970 to 2015 ( S7A Fig ). After controlling for mentor’s training end date, there was no relationship between the fraction of women mentors and trainee’s training end date. This result suggests that the increase in women mentors was not driven by mentor retirements between 2000 and 2015. The fraction of women students trained by the mentors also increased ( S7B Fig ). The decrease in homophily during this period was related to time (i.e., graduation year), but not mentors’ academic age ( S7C Fig ).

Gender homophily is decreasing over time in some fields. In 6 of the 7 broad research areas, there was a significant linear decrease in homophily between 2000 and 2020 ( Fig 1B , p<0.05, t test on linear regression with time as independent variable and homophily as dependent variable). At the level of narrow research areas with more than 1,000 students, 10/29 showed a significant decrease and 19/29 showed no significant temporal trends ( S6 Fig and S1 Table ).

The degree of homophily varied considerably across research areas, with the strongest homophily in humanities and social sciences and the least in physical sciences and engineering ( S4A Fig ). The degree of homophily within a research area was uncorrelated with its gender composition ( S4B and S4C Fig ). However, comparing narrower research areas with at least 1,000 students sampled showed a trend toward correlation between homophily and the fraction of women mentors or students ( S5 Fig , n = 29 research areas, Pearson’s correlation coefficient, homophily versus fraction women students, r = 0.36, p = 0.06, homophily versus fraction women mentors, r = 0.37, p = 0.05), consistent with recent work on gender homophily in coauthorship [ 25 ].

(A) Gender ratio of all PhD students from 2000 to 2020, split across broad academic fields and mentor gender. Colors indicate mentor’s gender. Field abbreviations are reported in the legend. (B) Temporal trends in homophily, the tendency toward same-gender pairing between trainees and mentors. Each panel shows data from one major field. Lines show a linear regression of homophily as a function of graduation year. P values indicate significance of temporal trend. (C) Homophily within narrow research areas (n = 73 areas). Solid line indicates median. The data and code needed to generate this figure are available on Zenodo (DOI: 10.5281/zenodo.4722020 ).

Gender homophily occurs among all general research areas we examined ( Fig 1A and 1B ). In all fields and all years, homophily was positive, indicating a tendency for mentors and students of the same gender to associate (median homophily across all fields and years = 20.5%). This trend is also apparent at the level of narrower fields (Figs 1C and S4 and S1 Table , median homophily across 73 fields with any women mentors = 20.3%).

Homophily, the tendency for individuals to form relationships with those similar to themselves, occurs to varying degrees for many aspects of social life (race, class, gender, age, education, behavior, attitudes and beliefs, etc.) [ 23 ]. To quantify gender homophily in mentoring relationships, we calculated the degree to which same-gender mentoring relationships exceeded the proportion expected if trainees matched to mentors randomly. Distinguishing effects of individual preferences from constraints imposed by population structure is a perennial issue in studies of homophily [ 23 – 25 ]. When mentors of one gender are scarce relative to students of that gender, complete homophily is impossible: the greater the scarcity, the lower the maximum level of homophily attainable ( S3A Fig ). We therefore normalized the value of homophily so that 0% indicates random trainee–mentor gender pairing and 100% indicates the maximum possible value, given the gender composition of the mentor and trainee pools ( S3B Fig and Eq 3 ).

Gender inequity in mentor status and trainee continuation to academic mentorship roles

Consistent with previous investigations into the attrition of women across the academic career track (sometimes known as the “leaky pipeline”) [14,26–28], our results show that the proportion of women in social science and STEM fields is lower at progressively later stages of the academic career track, from graduate student to postdoc to academic mentor (S11A Fig). This result indicates the population of academic mentors remains skewed toward men, even in research areas with student populations close to gender parity. However, it does not in itself indicate whether women graduate students continue on to academic mentorship positions at the same rate as men graduate students. In addition, it does not indicate whether structural gender biases that affect women as mentors indirectly affect retention of their students in academia as formal mentors.

To address these questions, we examined the proportion of graduate students and postdocs in the life sciences that continued on to academic mentorship, accounting for factors that may impact continuation (see S1 Appendix). We hypothesized that if men and women mentors differ in status (defined here as access to funding, labor, and prestige markers such as citations) due to gender bias, these disparities might lead to differences in trainees’ continuation to academic mentorship roles. We therefore compiled several widely used metrics to quantify mentor’s status: h-index (a measurement of citation rate and publication production [29] based on data from the National Library of Medicine and Semantic Scholar), trainee count (total number of PhD students and postdocs mentored, a metric closely related to laboratory size [3]), the rate of funding granted by the US governmental agencies NSF and National Institutes of Health (NIH), and the rank of the mentor’s academic institution in the Quacquarelli Symonds World University Rankings, an annual assessment that heavily weights the institution’s reputation among academics. Funding rate, h-index, trainee count, and institutional prestige were all correlated with one another, suggesting that all 4 metrics measured a common trait of “aggregate status” (S8 Fig). To compare mentors of the same status, we sorted mentors of all genders by each status metric, then grouped them into up to 10 bins of approximately equal size, such that mentors with the same value for a status metric were never placed in different bins.

We limited our analysis of mentor status to the life sciences. Sampling was more complete in these fields for 2 reasons. First, data on publication and funding were drawn from sources specific to this field (including funding data from the NIH and publication data from the National Library of Medicine). Second, because the AFT began as an effort to crowdsource the academic genealogy of neuroscience, its sampling of mentorship data is most dense for the life sciences. To avoid false negatives in our identification of trainees that continued to academic mentorship roles, we further limited the analysis to the subset of training relationships with stop dates before 2010 and whose records had been manually edited by AFT users (final n = 11,112 mentors, 26,420 trainees, 26% of life science training relationships meeting other criteria for analysis). Although these criteria reduced the size of the dataset, they minimized the chance of false negatives in our identification of progression to academic mentorship. Due to our strict definition of continuation as progress to academic mentorship, it is likely that the continuation rates reported here (see S1 Appendix) underestimate the actual proportion of trainees that remain in academia.

Compared to women mentors, men mentors had higher mean rates of funding, trainee count, and h-index, but not institution rank (Fig 2A, p<0.00, Welch’s unequal variances t test). Consistent with this finding, men mentors were overrepresented at the highest status deciles for funding, trainee count, and h-index, while women mentors were overrepresented in lower status deciles (Fig 2B).

PPT PowerPoint slide

PNG larger image

TIFF original image Download: Fig 2. Relationship between mentor gender and mentor status. (A) Distribution of mentor-status metrics among men and women mentors in life sciences (n: total number of mentors in the continuation dataset with data for the status metric, p: p-value of Welch’s unequal variances t test for difference in mean between men and women mentors). For improved visualization, 2 outliers are not included in the histograms (trainee count = 75, h-index = 230). (B) Gender distribution of mentors after binning by status (smaller numbers higher rank). Solid line indicates percentage of men mentors within each bin. Dashed line indicates percentage of men mentors across all bins. The data and code needed to generate this figure are available on Zenodo (DOI: 10.5281/zenodo.4722020). https://doi.org/10.1371/journal.pbio.3001771.g002

To test the hypothesis that structural gender bias among mentors indirectly affects the rate at which trainees continue to academic mentorship positions, we fit logistic regression models that predicted student and postdoc continuation based on each mentor status metric individually, trainee and mentor gender, mentor seniority, and training end date (Fig 3, left). Including training end date as an independent variable accounted for long-term changes in the number of trainees, status variables and continuation rates [30]. Mentor seniority was included to control for the possibility that phenomena apparently related to gender disparities in mentor status could be explained instead by gender differences in mentors’ academic age. We also fit a model that included the first principal component (PC) of all 4 status metrics as a single “aggregate status” variable (Fig 3, right). To quantify the degree to which differences in mentors’ status account for differences in trainee continuation rates, we compared each model to one in which mentor status had been shuffled across trainees (Figs 3 and 4).

PPT PowerPoint slide

PNG larger image

TIFF original image Download: Fig 3. Association between mentor status, mentor gender, and trainee continuation in academia. (A) Mean continuation rate in academia for PhD students and postdocs in life sciences. Each panel shows data sorted according to a different measure of mentor status. Points show mean continuation rate of trainees of mentors with a given status (smaller numbers indicate higher status), grouped by trainee and mentor gender. Lines show prediction of logistic regression model, fit to these variables as well as training end date and mentor seniority. Titles indicate number of mentors with respective status data available. (B) Marginal effects of each independent variable on trainee continuation rate, predicted by logistic regression models incorporating the status variables in (A). Marginal effects of gender show the impact of the mentor or trainee being a man relative to a woman. Marginal effect of mentor status shows impact of a 1-decile increase (worsening) in rank compared to others in the field. Marginal effect of mentor seniority shows impact of a 10% increase in the variable. Error bars show 95% confidence intervals. Light gray bars show marginal effects for same model fit to data with mentor status shuffled across trainees. The data and code needed to generate this figure are available on Zenodo (DOI: 10.5281/zenodo.4722020). https://doi.org/10.1371/journal.pbio.3001771.g003

PPT PowerPoint slide

PNG larger image

TIFF original image Download: Fig 4. Reduction in mentor-gender effects after controlling for mentor status. (A) Performance of logistic regression model predicting trainee continuation based on training end date, mentor seniority, trainee and mentor gender, and mentor status before (solid line) and after shuffling (dashed) mentor status across trainees. Models are fit to the subset of life sciences data with information available for the corresponding status metric (see Fig 3). Performance of the status shuffled models varied because the pool of mentors with available data varied between metrics. (B) Marginal effect of gender and temporal variables for models in which mentor status is shuffled (dashed line) or is not (solid lines), quantified as in Fig 3B. (C) Percent decrease in marginal effect of mentor gender after incorporating each mentor status variable into the logistic regression model (p indicated for t test between marginal effect for models with and without each status metric shuffled. The data and code needed to generate this figure are available on Zenodo (DOI: 10.5281/zenodo.4722020). https://doi.org/10.1371/journal.pbio.3001771.g004

For all measures of mentor status, higher rank was associated with greater rates of trainee continuation to roles as academic mentors (note consistently sloping lines in Fig 3). Being a man or the trainee of a man was also associated with greater continuation rates. However, this disparity was substantially reduced if one considered the overrepresentation of men in higher mentor status ranks (Fig 2). Including a measure of mentor status in the model substantially reduced the effect of mentor gender. For all measures of mentor status, the magnitude of the mentor-gender effect was reduced relative to a model in which mentor status was randomized. This randomization had minimal impact on trainee-gender and temporal effects (Figs 3B and 4). Thus, controlling for mentor status reduces the apparent effect of mentor gender on trainee retention as academic mentors by up to 49% (p<0.002, t test, for all metrics except trainee count, Fig 4C). The maximum reduction occurred in the model that included h-index, and the aggregate status metric did not have greater predictive power than h-index.

Incorporating status metrics also accounted for some effects of mentor seniority (Fig 4B). In a stepwise comparison, randomizing data for both status and seniority resulted in a greater reduction of mentor-gender effects than randomizing either alone, suggesting that gender differences in seniority do not account for effects of gender differences in status (S9 Fig).

Separately analyzing data for graduate students and postdocs (S10 Fig) showed consistent effects for mentor status, mentor seniority, and trainee gender. Mentor-gender effects did not reach significance among all subsets of the data, possibly because of the reduced statistical power available in these smaller datasets.

[END]
---
[1] Url: https://journals.plos.org/plosbiology/article?id=10.1371/journal.pbio.3001771

Published and (C) by PLOS One
Content appears here under this condition or license: Creative Commons - Attribution BY 4.0.

via Magical.Fish Gopher News Feeds:
gopher://magical.fish/1/feeds/news/plosone/