- Split View
-
Views
-
Cite
Cite
Hagai Levine, Niels Jørgensen, Anderson Martino-Andrade, Jaime Mendiola, Dan Weksler-Derri, Maya Jolles, Rachel Pinotti, Shanna H Swan, Temporal trends in sperm count: a systematic review and meta-regression analysis of samples collected globally in the 20th and 21st centuries, Human Reproduction Update, Volume 29, Issue 2, March-April 2023, Pages 157–176, https://doi.org/10.1093/humupd/dmac035
- Share Icon Share
Abstract
Numerous studies have reported declines in semen quality and other markers of male reproductive health. Our previous meta-analysis reported a significant decrease in sperm concentration (SC) and total sperm count (TSC) among men from North America–Europe–Australia (NEA) based on studies published during 1981–2013. At that time, there were too few studies with data from South/Central America–Asia–Africa (SAA) to reliably estimate trends among men from these continents.
The aim of this study was to examine trends in sperm count among men from all continents. The broader implications of a global decline in sperm count, the knowledge gaps left unfilled by our prior analysis and the controversies surrounding this issue warranted an up-to-date meta-analysis.
We searched PubMed/MEDLINE and EMBASE to identify studies of human SC and TSC published during 2014–2019. After review of 2936 abstracts and 868 full articles, 44 estimates of SC and TSC from 38 studies met the protocol criteria. Data were extracted on semen parameters (SC, TSC, semen volume), collection year and covariates. Combining these new data with data from our previous meta-analysis, the current meta-analysis includes results from 223 studies, yielding 288 estimates based on semen samples collected 1973–2018. Slopes of SC and TSC were estimated as functions of sample collection year using simple linear regression as well as weighted meta-regression. The latter models were adjusted for predetermined covariates and examined for modification by fertility status (unselected by fertility versus fertile), and by two groups of continents: NEA and SAA. These analyses were repeated for data collected post-2000. Multiple sensitivity analyses were conducted to examine assumptions, including linearity.
Overall, SC declined appreciably between 1973 and 2018 (slope in the simple linear model: –0.87 million/ml/year, 95% CI: –0.89 to –0.86; P < 0.001). In an adjusted meta-regression model, which included two interaction terms [time × fertility group (P = 0.012) and time × continents (P = 0.058)], declines were seen among unselected men from NEA (–1.27; –1.78 to –0.77; P < 0.001) and unselected men from SAA (–0.65; –1.29 to –0.01; P = 0.045) and fertile men from NEA (–0.50; –1.00 to –0.01; P = 0.046). Among unselected men from all continents, the mean SC declined by 51.6% between 1973 and 2018 (–1.17: –1.66 to –0.68; P < 0.001). The slope for SC among unselected men was steeper in a model restricted to post-2000 data (–1.73: –3.23 to –0.24; P = 0.024) and the percent decline per year doubled, increasing from 1.16% post-1972 to 2.64% post-2000. Results were similar for TSC, with a 62.3% overall decline among unselected men (–4.70 million/year; –6.56 to –2.83; P < 0.001) in the adjusted meta-regression model. All results changed only minimally in multiple sensitivity analyses.
This analysis is the first to report a decline in sperm count among unselected men from South/Central America–Asia–Africa, in contrast to our previous meta-analysis that was underpowered to examine those continents. Furthermore, data suggest that this world-wide decline is continuing in the 21st century at an accelerated pace. Research on the causes of this continuing decline and actions to prevent further disruption of male reproductive health are urgently needed.
Introduction
In 2017, ‘Temporal Trends in Sperm Count: A Systematic Review and Meta-Regression Analysis’ was published by this journal (Levine et al., 2017). That article, which was widely discussed and highly cited, includes all eligible English-language publications in 1981–2013 that contained data on sperm count. We reported a very strong decline in sperm concentration (SC) and total sperm count (TSC) in North America, Europe, Australia/New Zealand (hereafter NEA) but too few studies have been published in South/Central America, Asia and Africa (hereafter SAA) to draw a conclusion about trends in those continents. We examined mean SC and TSC as a function of collection year, as approximated by the mid-year of the sample collection period. Because sample collection preceded the year of publication by an average of 6 years, our results were already somewhat dated by the time we published our analysis in 2017. Therefore, we conducted a new literature search in the spring of 2020 to identify eligible studies published between 1 January 2014 and 31 December 2019. Here, we report on global trends in SC and TSC in publications 1981 through 2019, which combines results of both searches and analyses. This expanded analysis addresses two important questions. With increased sample size, was a trend seen in South America, Africa and Asia? Did the trends we reported continue post-2011?
Recognition of the importance of male reproductive function, and sperm count, has increased since 2017 (Levine et al., 2018; United Nations Human Rights Office of the High Commissioner, 2019). The economic and societal burden of male infertility is now widely recognized, as is the unequal burden of male infertility which falls most heavily on low-income populations (Winters and Walsh, 2014; Hauser et al., 2015; Dupree, 2016; Skakkebaek et al., 2016). Increasingly strong evidence links reduced sperm count and concentration to increases in all-cause mortality and morbidity (Latif et al., 2017; Del Giudice et al., 2021; Ferlin et al., 2021). Links between sperm count and infertility are well-recognized (Bonde et al., 1998; Skakkebæk et al., 2022). Furthermore, the decline in sperm count is paralleled by declines in testosterone and increases in testicular cancer and male genital anomalies (Skakkebæk et al., 2022). In fact, the decline in semen quality and male reproductive health has recently been described as a crisis (De Jonge and Barratt, 2019). Relative to declines in sperm counts, these latter trends are far more difficult to document. There is currently no systematic collection of such data, making the examination of those trends difficult. Therefore, an international group of scientists, including several of the authors, has suggested the formation of a multidisciplinary monitoring system for reproductive health indicators that would provide ongoing surveillance of reproductive health outcomes (Le Moal et al., 2016).
The broad implications of a global decline in sperm count, the data gaps left unfilled by our prior analysis and the controversies surrounding these issues warrant an up-to-date meta-analysis. This meta-analysis was conducted to address these aims.
Methods
This systematic review and meta-regression analysis was conducted, and the results reported, in accordance with Meta-analysis in Observational Studies in Epidemiology (Stroup et al., 2000) (Supplementary Table SI) and Preferred Reporting Items for Systematic reviews and Meta-Analysis guidelines (Liberati et al., 2009; Moher et al., 2009). Our research team included epidemiologists, andrologists and a medical librarian. Our predefined protocol was developed following best practices (Borenstein et al., 2009; Higgins and Green 2011; National Toxicology Program 2015) and informed by our previous study. Throughout, unless otherwise noted, the methods of the current study are those employed and published in the previous study (Levine et al., 2017), including keywords and databases searched, eligibility criteria and statistical methods.
Systematic review and study selection
A comprehensive search of the PubMed/MEDLINE and EMBASE was conducted, to identify English-language publications that reported primary data on human sperm count, published in 2014–2019, i.e. from the last date included in our prior search through 2019. On 15 May 2020, we searched MEDLINE/PubMed and Embase (Excerpta Medica database) for peer-reviewed publications meeting our inclusion criteria. Following the recommendation of the Cochrane Handbook for Systematic Reviews, we searched in title and abstract for both index (MeSH) terms and keywords and filtered out animal-only studies. We used the MeSH term ‘sperm count’, which includes seven additional terms, and to increase sensitivity, we added 13 related keywords (e.g. ‘sperm density’, ‘sperm concentration’).
All English-language studies that reported primary data on human SC were considered eligible for abstract screening. We evaluated the eligibility of all subgroups within a study. For example, in a case–control study, the control group might have been eligible for inclusion even though, based on our exclusion criteria, the case group was not.
We divided eligible studies into two fertility-defined groups: (i) studies of men unselected by fertility status, hereafter ‘Unselected’ (e.g. young men unlikely to be aware of their fertility, such as young men being screened for military service or college students) and (ii) studies of men whose partners had born a child or whose partners were pregnant regardless of pregnancy outcome, hereafter ‘Fertile’. ‘Total’ refers to both groups.
A study was excluded if participants were selected based on: (i) infertility or sub-fertility; (ii) their range of semen parameters (e.g. studies selecting normospermic men); and (iii) genital abnormalities, diseases or medications. We also excluded studies limited to men with exposures that may affect fertility, such as an occupational exposure, post-clinical trial intervention or smoking. Studies of candidates for vasectomy or semen donation were included only if semen quality was not a criterion for men’s study participation. We excluded studies that used non-standard methods for sperm collection (e.g. methods other than masturbation) or counting methods other than hemocytometer and studies with fewer than 10 men.
First, the publication was either excluded or advanced to full-text screening based on the title and abstract. Any publication without an abstract was automatically referred for full-text screening. Second, we reviewed the full text and assigned it to exclusion (and categorized the reason for exclusion) or to data extraction. We then confirmed study eligibility and identified multiple publications from the same study to ensure that estimates from the same population were not used more than once.
Data extraction
For each estimate, we extracted summary statistics on SC and TSC (mean, standard deviation, minimum, maximum, median and percentiles), semen volume (mean and measured method), sample size, sample collection years, data on covariates (fertility group, country, age, ejaculation abstinence time reported and per protocol, method of semen collection, method of assessing of SC and semen volume, selection of population, study exclusion criteria and number of samples per man). The range of permissible values, both for categorical and numerical variables and information on data completeness were recorded. Data were extracted on the total population as well as on all eligible subgroups.
Quality control
The current study followed the same protocol as used in the previous study. For the new search, we used Covidence systematic review software for the process of screening the articles that were not available for our prior search. In addition, one member of the research team was replaced. Screening of this extensive systematic review was conducted by a team of eight reviewers (H.L., N.J., A.M.-A., J.M., D.W.-D., M.J., R.P., S.H.S.). As in our previous analysis, the screening protocol was piloted and reviewers were trained by screening of 50 abstracts by all reviewers followed by a comparison of results, resolution of any inconsistencies and clarification of the protocol as needed. The same quality control process was followed for full-text screening and data extraction by all reviewers. All data were entered into digital spreadsheets with explicit permissible values (no open-ended entries) to increase consistency. After data extraction, an additional round of data editing and quality control of all studies was conducted by H.L. and M.J. The process ensured that each study was evaluated by at least two different trained reviewers.
Statistical analysis
We ran all models both on the data used in our 2017 study (for quality control) and for the dataset including all years. In all models, the midpoint of the sample collection period was the independent variable (‘collection year’) and mean SC (or TSC) was the dependent variable(s). Units were million/ml for SC and million for TSC (reported or defined as SC × sample volume). All slopes denote unit change per calendar year, reported with 95% confidence intervals (CI). We also reported values for the first and last years, and the % change/year.
We first used simple linear regression models to estimate SC and TSC as functions of collection year, with each study weighted by sample size. Beside a model for all men (Unselected plus Fertile), we also ran a model stratified by fertility status.
Then, we used random-effects meta-regression to model both SC and TSC as linear functions of time, weighting estimates by the standard error (SE).
In the meta-regression models, we controlled for a predetermined set of potential confounders: fertility group, age, abstinence time, whether semen collection and counting methods were reported, number of samples per man and indicators for exclusion criteria (Supplementary Table SII). The method used to determine sample volume was included when modeling TSC. In all meta-regression analyses, we included indicator variables to denote studies with more than one SC/TSC estimate.
Several variables were imputed including mean SC (and TSC), as described in our 2017 study. For example, for studies that reported median (not mean) SC or TSC, we estimated the mean by adding the average difference between the mean and median in studies for which both were reported.
We included indicators in all models for imputed values. We included continental group (NEA or SAA) and fertility group (Unselected or Fertile), as variables in the model when applicable.
We ran several meta-regression models for both SC and TSC: (i) basic, unadjusted model; (ii) adjusted model for all men; (iii) stratified by fertility; (iv) unselected men only with time × continent interaction; (v) all men with two interactions: time × fertility and time × continent.
These analyses were repeated for subsets of data collection to examine recent trends (>1972, >1985, >1995 and >2000).
We conducted several sensitivity analyses: (i) adding cubic and quadratic terms for collection year in meta-regression analyses to assess non-linearity; (ii) removing covariates one at a time from the model; (iii) excluding a specific group, i.e. the group with no information on age, for each covariate; (iv) replacing age group by mean age, excluding studies that did not report mean age; (v) adding a covariate for high smoking prevalence (>30%); (iv) removing each continent one at a time; (vii) excluding studies with five or more data points to examine the influence of large studies; and (viii) removing studies with SEs >20 million/ml.
All analyses were conducted using STATA version 14.1 (StataCorp).
Results
Systematic review and summary statistics
Using PubMed/MEDLINE and Embase searches, we identified 2936 new publications meeting our criteria for abstract screening (Fig. 1). Of these, 151 duplicate records were removed and 1917 were excluded based on title or abstract screening. Full texts of the remaining 868 articles were reviewed for eligibility and 743 studies were excluded. Of the remaining 125 articles, 87 were excluded during data extraction through the second round of full-text screening; 44 of them were multiple publications. The remaining 38 studies of semen samples from 14 233 men included 44 unique mean SC estimates that met the protocol criteria. Combining these new data with data from our previous meta-analysis, the current meta-analysis includes results from 223 studies, yielding 288 estimates based on semen samples collected 1973–2018 provided by 57 168 men. Data were available from 6 continents and 53 countries (complete list in Supplementary Table SIII and Supplementary Fig. S1). Of the 288 estimates, 118 (41%) were Unselected NEA, 35 (12%) were Unselected SAA, 81 (28%) were Fertile NEA and 54 (19%) were Fertile SAA. The number of estimates from SAA increased by 29%.
Data from the 223 publications included in the meta-analysis are available upon reasonable request through contact with corresponding author (Abyholm, 1981; Fariss et al., 1981; Leto and Frensilli, 1981; Wyrobek et al., 1981a,b; Aitken et al., 1982; Nieschlag et al., 1982; Obwaka et al., 1982; Albertsen et al., 1983; Fowler and Mariano, 1983; Sultan Sheriff, 1983; Wickings et al., 1983; Asch et al., 1984; de Castro and Mastrorocco, 1984; Fredricsson and Sennerstam, 1984; Freischem et al., 1984; Ward et al., 1984; Ayers et al., 1985; Heussner et al., 1985; Rosenberg et al., 1985; Aribarg et al., 1986; Comhaire et al., 1987; Kirei, 1987; Giblin et al., 1988; Kjaergaard et al., 1988; Mieusset et al., 1988, 1995; Jockenhovel et al., 1989; Sobowale and Akiwumi, 1989; Svanborg et al., 1989; Zhong et al., 1990; Culasso et al., 1991; Dunphy et al., 1991; Gottlieb et al., 1991; Nnatu et al., 1991; Pangkahila, 1991; Weidner et al., 1991; Levine et al., 1992; Sheriff and Legnain, 1992; Ali et al., 1993; Arce et al., 1993; Bartoov et al., 1993; Fedder et al., 1993; Noack-Füller et al., 1993; World Health Organization and Task Force on Methods for the Regulation of Male Fertility, 1993; Hill et al., 1994; Rehan, 1994; Rendon et al., 1994; Taneja et al., 1994; Vanhoorne et al., 1994; Auger et al., 1995; Cottell and Harrison, 1995; Figà-Talamanca et al., 1996; Fisch et al., 1996; IrVine et al., 1996; Van Waeleghem et al., 1996; Vierula et al., 1996; Vine et al., 1996; Auger and Jouannet, 1997; Jensen et al., 1997; Lemcke et al., 1997; Handelsman, 1997a,b; Chia et al., 1998; Muller et al., 1998; Naz et al., 1998; Gyllenborg et al., 1999; Kolstad et al., 1999; Kuroki et al., 1999; Larsen et al., 1999; Purakayastha et al., 1999; Reddy and Bordekar, 1999; De Celis et al., 2000; Glazier et al., 2000; Mak et al., 2000; Selevan et al., 2000; Wiltshire et al., 2000; Zhang et al., 2000; Foppiani et al., 2001; Guzick et al., 2001; Hammadeh et al., 2001; Jørgensen et al., 2001, 2002, 2011, 2012; Kelleher et al., 2001; Lee and Coughlin, 2001; Patankar et al., 2001; Tambe et al., 2001; Xiao et al., 2001; Costello et al., 2002; Junqing et al., 2002; Kukuvitis et al., 2002; Luetjens et al., 2002; Punab et al., 2002; Richthoff et al., 2002; Danadevi et al., 2003; de Gouveia Brazao et al., 2003; Firman et al., 2003; Liu et al., 2003; Lundwall et al., 2003; Roste et al., 2003; Serra-Majem et al., 2003; Uhler et al., 2003; Xu et al., 2003; Ebesunun et al., 2004; Rintala et al., 2004; Toft et al., 2004, 2005; Bang et al., 2005; Mahmoud et al., 2005; Muthusami and Chinnaswamy, 2005; O’Donovan, 2005; Tsarev et al., 2005, 2009; Durazzo et al., 2006; Fetic et al., 2006; Giagulli and Carbone, 2006; Haugen et al., 2006; Iwamoto et al., 2006, 2013a,b; Pal et al., 2006; Yucra et al., 2006; Aneck-Hahn et al., 2007; Garcia et al., 2007; Multigner et al., 2007; Plastira et al., 2007; Rignell-Hydbom et al., 2007; Wu et al., 2007; Akutsu et al., 2008; Bhattacharya, 2008; Gallegos et al., 2008; Goulis et al., 2008; Jedrzejczak et al., 2008; Kobayashi et al., 2008; Korrovits et al., 2008; Li and Gu, 2008; Lopez-Teijon et al., 2008; Paasch et al., 2008; Peters et al., 2008; Recabarren et al., 2008; Recio-Vega et al., 2008; Saxena et al., 2008; Shine et al., 2008; Andrade-Rocha, 2009; Kumar et al., 2009, 2011; Rylander et al., 2009; Stewart et al., 2009; Vani et al., 2009, 2012; Verit et al., 2009; Engelbertz et al., 2010; Hossain et al., 2010; Ortiz et al., 2010; Rubes et al., 2010; Tirumala Vani et al., 2010; Al Momani et al., 2011; Auger and Eustache, 2011; Axelsson et al., 2011; Brahem et al., 2011; Jacobsen et al., 2011; Khan et al., 2011; Linschooten et al., 2011; Venkatesh et al., 2011; Vested et al., 2011; Absalan et al., 2012; Al-Janabi et al., 2012; Katukam et al., 2012; Mostafa et al., 2012; Nikoobakht et al., 2012; Rabelo-Junior et al., 2012; Splingart et al., 2012; Bujan et al., 2013; Girela et al., 2013; Halling et al., 2013; Ji et al., 2013; Mendiola et al., 2013; Redmon et al., 2013; Thilagavathi et al., 2013; Valsa et al., 2013; Zalata et al., 2013; Zareba et al., 2013; Huang et al., 2014; Castiglione et al., 2014; Giagulli et al., 2014; Kavitha and Malini, 2014; Liu et al., 2014; Mendiola et al., 2014; Tainio et al., 2014; Evgeni et al., 2015; Franken, 2015; Hosen et al., 2015; Layali et al., 2015; Mohammed et al., 2015; Ramzan et al., 2015; Romero-Otero et al., 2015; Tsao et al., 2015; Valsa et al., 2015; Altintas et al., 2016; Karimian and Colagar, 2016; Malić Vončina et al., 2016; Shirota et al., 2016; Malini, 2017; Mínguez-Alarcón et al., 2017; Pullar et al., 2017; Azad et al., 2018; Fanny et al., 2018; Inih et al., 2018; López-Espín et al., 2018; Lotti et al., 2018; Palani, 2018; Priskorn et al., 2018; Recio-Vega et al., 2018; Ahmed et al., 2019; Bassey et al., 2019; Dhawan et al., 2019; García Rodríguez et al., 2019; Lazzarino et al., 2019; Rodprasert et al., 2019; Antonio et al., 2020; Dias et al., 2020).
Simple linear models
Combining results from all men, SC declined steeply (slope per year –0.87 million/ml; 95% CI: –0.89 to –0.86; P < 0.001) between 1973 and 2018 when using simple linear models, unadjusted and weighted by sample size (Supplementary Table SIV, Supplementary Fig. S2). For all men combined, SC declined by 0.93% per year and overall, by 41.5% between 1973 and 2018. In a model stratified by fertility group, the slope was much steeper for Unselected (–1.23; –1.25 to –1.20; P < 0.001) than for Fertile men (–0.30; –0.33 to –0.27; P < 0.001) (Supplementary Table SIV). A similar trend was seen for TSC when combining the two fertility groups (slope per year = –2.80 million; –2.86 to –2.74; P < 0.001), and the slope was steeper for the Unselected group (–3.77; –3.83 to –3.71; P < 0.001) (Supplementary Table SIV, Supplementary Fig. S2). Semen volume did not change over the study period (slope per year = 0.0002 ml; –0.0001 to 0.0005; P = 0.249).
Meta-regression models for SC
In a basic meta-regression model for SC, in which estimates were weighted by their SE but without covariate adjustment, slopes were slightly less steep than for the simple regression model, and with wider CIs (slope per year –0.66 million/ml; 95% CI: –0.92 to –0.40; P < 0.001). Covariate adjustment did not appreciably alter the slope but widened the CI further (–0.59; –0.90 to –0.27; P < 0.001) (Table I, betas for covariates in Supplementary Table SII).
Model . | Category . | N (estimates) . | First year . | First year SC (million/ml) . | Last year . | Last year SC (million/ml) . | %change/year . | Slope (95% CI), million/ml/year . |
---|---|---|---|---|---|---|---|---|
Total | All men | 288 | 1973 | 83.5 | 2018 | 57.1 | –0.71 | –0.59 (–0.90 to –0.27) |
Stratified | Unselected | 153 | 1973 | 101.2 | 2018 | 49.0 | –1.16 | –1.17 (–1.66 to –0.68) |
Fertile | 135 | 1977 | 77.3 | 2017 | 72.8 | –0.14 | –0.11 (–0.54 to 0.32) | |
Unselected with interaction | Unselected NEAb | 118 | 1973 | 103.7 | 2015 | 49.1 | –1.25 | –1.30 (–1.89 to –0.71) |
Unselected SAAb | 35 | 1986 | 88.3 | 2018 | 61.2 | –0.96 | –0.84 (–1.82 to 0.13) | |
Two interactions | Unselected NEAb | 118 | 1973 | 100.3 | 2015 | 46.8 | –1.27 | –1.27 (–1.78 to –0.77) |
Unselected SAAb | 35 | 1986 | 75.8 | 2018 | 54.9 | –0.86 | –0.65 (–1.29 to –0.01) | |
Fertile NEA | 81 | 1977 | 85.5 | 2017 | 65.1 | –0.59 | –0.50 (–1.00 to –0.01) | |
Fertile SAA | 54 | 1978 | 71.5 | 2016 | 76.4 | 0.18 | 0.13 (–0.42 to 0.67) |
Model . | Category . | N (estimates) . | First year . | First year SC (million/ml) . | Last year . | Last year SC (million/ml) . | %change/year . | Slope (95% CI), million/ml/year . |
---|---|---|---|---|---|---|---|---|
Total | All men | 288 | 1973 | 83.5 | 2018 | 57.1 | –0.71 | –0.59 (–0.90 to –0.27) |
Stratified | Unselected | 153 | 1973 | 101.2 | 2018 | 49.0 | –1.16 | –1.17 (–1.66 to –0.68) |
Fertile | 135 | 1977 | 77.3 | 2017 | 72.8 | –0.14 | –0.11 (–0.54 to 0.32) | |
Unselected with interaction | Unselected NEAb | 118 | 1973 | 103.7 | 2015 | 49.1 | –1.25 | –1.30 (–1.89 to –0.71) |
Unselected SAAb | 35 | 1986 | 88.3 | 2018 | 61.2 | –0.96 | –0.84 (–1.82 to 0.13) | |
Two interactions | Unselected NEAb | 118 | 1973 | 100.3 | 2015 | 46.8 | –1.27 | –1.27 (–1.78 to –0.77) |
Unselected SAAb | 35 | 1986 | 75.8 | 2018 | 54.9 | –0.86 | –0.65 (–1.29 to –0.01) | |
Fertile NEA | 81 | 1977 | 85.5 | 2017 | 65.1 | –0.59 | –0.50 (–1.00 to –0.01) | |
Fertile SAA | 54 | 1978 | 71.5 | 2016 | 76.4 | 0.18 | 0.13 (–0.42 to 0.67) |
Model . | Category . | N (estimates) . | First Year . | First year TSC (million) . | Last year . | Last year TSC (million) . | %change/year . | Slope (95% CI), million/year . |
---|---|---|---|---|---|---|---|---|
Total | All men | 288 | 1973 | 297.4 | 2018 | 205.6 | –0.69 | –2.06 (–3.25 to –0.87) |
Stratified | Unselected | 153 | 1973 | 335.7 | 2018 | 126.6 | –1.40 | –4.70 (–6.56 to –2.83) |
Fertile | 135 | 1977 | 305.8 | 2017 | 296.1 | –0.08 | –0.24 (–1.99 to 1.52) | |
Unselected with interaction | Unselected NEAb | 118 | 1973 | 337.9 | 2015 | 125.9 | –1.49 | –5.05 (–7.31 to –2.79) |
Unselected SAAb | 35 | 1986 | 263.2 | 2018 | 141.7 | –1.44 | –3.79 (–7.58 to –0.01) | |
Two interactions | Unselected NEAb | 118 | 1973 | 350.9 | 2015 | 153.3 | –1.34 | –4.71 (–6.53 to –2.88) |
Unselected SAAb | 35 | 1986 | 229.8 | 2018 | 173.0 | –0.77 | –1.78 (–4.10 to 0.55) | |
Fertile NEA | 81 | 1977 | 303.8 | 2017 | 219.3 | –0.69 | –2.09 (–3.86 to –0.32) | |
Fertile SAA | 54 | 1978 | 216.6 | 2016 | 250.2 | 0.40 | 0.87 (–1.11 to 2.85) |
Model . | Category . | N (estimates) . | First Year . | First year TSC (million) . | Last year . | Last year TSC (million) . | %change/year . | Slope (95% CI), million/year . |
---|---|---|---|---|---|---|---|---|
Total | All men | 288 | 1973 | 297.4 | 2018 | 205.6 | –0.69 | –2.06 (–3.25 to –0.87) |
Stratified | Unselected | 153 | 1973 | 335.7 | 2018 | 126.6 | –1.40 | –4.70 (–6.56 to –2.83) |
Fertile | 135 | 1977 | 305.8 | 2017 | 296.1 | –0.08 | –0.24 (–1.99 to 1.52) | |
Unselected with interaction | Unselected NEAb | 118 | 1973 | 337.9 | 2015 | 125.9 | –1.49 | –5.05 (–7.31 to –2.79) |
Unselected SAAb | 35 | 1986 | 263.2 | 2018 | 141.7 | –1.44 | –3.79 (–7.58 to –0.01) | |
Two interactions | Unselected NEAb | 118 | 1973 | 350.9 | 2015 | 153.3 | –1.34 | –4.71 (–6.53 to –2.88) |
Unselected SAAb | 35 | 1986 | 229.8 | 2018 | 173.0 | –0.77 | –1.78 (–4.10 to 0.55) | |
Fertile NEA | 81 | 1977 | 303.8 | 2017 | 219.3 | –0.69 | –2.09 (–3.86 to –0.32) | |
Fertile SAA | 54 | 1978 | 216.6 | 2016 | 250.2 | 0.40 | 0.87 (–1.11 to 2.85) |
Meta-regression model, adjusted for continents, age, abstinence time, semen collection method reported, counting method reported, having more than one sample per men, indicators for study selection of population and exclusion criteria (some vasectomy candidates, some semen donor candidates, exclusion of men with chronic diseases, exclusion by other reasons not related to fertility, selection by occupation not related to fertility), whether collection year was estimated, whether arithmetic mean of SC was estimated, whether SE of SC was estimated and indicator variable to denote studies with more than one estimate. Sperm concentration (SC) meta-regression models weighted by SC SE, adjusted for similar covariates and method used to assess semen volume. Total sperm count (TSC) meta-regression models weighted by TSC SE, adjusted for similar covariates and method used to assess semen volume.
NEA, North America–Europe–Australia; SAA, South/Central America–Asia–Africa.
Model . | Category . | N (estimates) . | First year . | First year SC (million/ml) . | Last year . | Last year SC (million/ml) . | %change/year . | Slope (95% CI), million/ml/year . |
---|---|---|---|---|---|---|---|---|
Total | All men | 288 | 1973 | 83.5 | 2018 | 57.1 | –0.71 | –0.59 (–0.90 to –0.27) |
Stratified | Unselected | 153 | 1973 | 101.2 | 2018 | 49.0 | –1.16 | –1.17 (–1.66 to –0.68) |
Fertile | 135 | 1977 | 77.3 | 2017 | 72.8 | –0.14 | –0.11 (–0.54 to 0.32) | |
Unselected with interaction | Unselected NEAb | 118 | 1973 | 103.7 | 2015 | 49.1 | –1.25 | –1.30 (–1.89 to –0.71) |
Unselected SAAb | 35 | 1986 | 88.3 | 2018 | 61.2 | –0.96 | –0.84 (–1.82 to 0.13) | |
Two interactions | Unselected NEAb | 118 | 1973 | 100.3 | 2015 | 46.8 | –1.27 | –1.27 (–1.78 to –0.77) |
Unselected SAAb | 35 | 1986 | 75.8 | 2018 | 54.9 | –0.86 | –0.65 (–1.29 to –0.01) | |
Fertile NEA | 81 | 1977 | 85.5 | 2017 | 65.1 | –0.59 | –0.50 (–1.00 to –0.01) | |
Fertile SAA | 54 | 1978 | 71.5 | 2016 | 76.4 | 0.18 | 0.13 (–0.42 to 0.67) |
Model . | Category . | N (estimates) . | First year . | First year SC (million/ml) . | Last year . | Last year SC (million/ml) . | %change/year . | Slope (95% CI), million/ml/year . |
---|---|---|---|---|---|---|---|---|
Total | All men | 288 | 1973 | 83.5 | 2018 | 57.1 | –0.71 | –0.59 (–0.90 to –0.27) |
Stratified | Unselected | 153 | 1973 | 101.2 | 2018 | 49.0 | –1.16 | –1.17 (–1.66 to –0.68) |
Fertile | 135 | 1977 | 77.3 | 2017 | 72.8 | –0.14 | –0.11 (–0.54 to 0.32) | |
Unselected with interaction | Unselected NEAb | 118 | 1973 | 103.7 | 2015 | 49.1 | –1.25 | –1.30 (–1.89 to –0.71) |
Unselected SAAb | 35 | 1986 | 88.3 | 2018 | 61.2 | –0.96 | –0.84 (–1.82 to 0.13) | |
Two interactions | Unselected NEAb | 118 | 1973 | 100.3 | 2015 | 46.8 | –1.27 | –1.27 (–1.78 to –0.77) |
Unselected SAAb | 35 | 1986 | 75.8 | 2018 | 54.9 | –0.86 | –0.65 (–1.29 to –0.01) | |
Fertile NEA | 81 | 1977 | 85.5 | 2017 | 65.1 | –0.59 | –0.50 (–1.00 to –0.01) | |
Fertile SAA | 54 | 1978 | 71.5 | 2016 | 76.4 | 0.18 | 0.13 (–0.42 to 0.67) |
Model . | Category . | N (estimates) . | First Year . | First year TSC (million) . | Last year . | Last year TSC (million) . | %change/year . | Slope (95% CI), million/year . |
---|---|---|---|---|---|---|---|---|
Total | All men | 288 | 1973 | 297.4 | 2018 | 205.6 | –0.69 | –2.06 (–3.25 to –0.87) |
Stratified | Unselected | 153 | 1973 | 335.7 | 2018 | 126.6 | –1.40 | –4.70 (–6.56 to –2.83) |
Fertile | 135 | 1977 | 305.8 | 2017 | 296.1 | –0.08 | –0.24 (–1.99 to 1.52) | |
Unselected with interaction | Unselected NEAb | 118 | 1973 | 337.9 | 2015 | 125.9 | –1.49 | –5.05 (–7.31 to –2.79) |
Unselected SAAb | 35 | 1986 | 263.2 | 2018 | 141.7 | –1.44 | –3.79 (–7.58 to –0.01) | |
Two interactions | Unselected NEAb | 118 | 1973 | 350.9 | 2015 | 153.3 | –1.34 | –4.71 (–6.53 to –2.88) |
Unselected SAAb | 35 | 1986 | 229.8 | 2018 | 173.0 | –0.77 | –1.78 (–4.10 to 0.55) | |
Fertile NEA | 81 | 1977 | 303.8 | 2017 | 219.3 | –0.69 | –2.09 (–3.86 to –0.32) | |
Fertile SAA | 54 | 1978 | 216.6 | 2016 | 250.2 | 0.40 | 0.87 (–1.11 to 2.85) |
Model . | Category . | N (estimates) . | First Year . | First year TSC (million) . | Last year . | Last year TSC (million) . | %change/year . | Slope (95% CI), million/year . |
---|---|---|---|---|---|---|---|---|
Total | All men | 288 | 1973 | 297.4 | 2018 | 205.6 | –0.69 | –2.06 (–3.25 to –0.87) |
Stratified | Unselected | 153 | 1973 | 335.7 | 2018 | 126.6 | –1.40 | –4.70 (–6.56 to –2.83) |
Fertile | 135 | 1977 | 305.8 | 2017 | 296.1 | –0.08 | –0.24 (–1.99 to 1.52) | |
Unselected with interaction | Unselected NEAb | 118 | 1973 | 337.9 | 2015 | 125.9 | –1.49 | –5.05 (–7.31 to –2.79) |
Unselected SAAb | 35 | 1986 | 263.2 | 2018 | 141.7 | –1.44 | –3.79 (–7.58 to –0.01) | |
Two interactions | Unselected NEAb | 118 | 1973 | 350.9 | 2015 | 153.3 | –1.34 | –4.71 (–6.53 to –2.88) |
Unselected SAAb | 35 | 1986 | 229.8 | 2018 | 173.0 | –0.77 | –1.78 (–4.10 to 0.55) | |
Fertile NEA | 81 | 1977 | 303.8 | 2017 | 219.3 | –0.69 | –2.09 (–3.86 to –0.32) | |
Fertile SAA | 54 | 1978 | 216.6 | 2016 | 250.2 | 0.40 | 0.87 (–1.11 to 2.85) |
Meta-regression model, adjusted for continents, age, abstinence time, semen collection method reported, counting method reported, having more than one sample per men, indicators for study selection of population and exclusion criteria (some vasectomy candidates, some semen donor candidates, exclusion of men with chronic diseases, exclusion by other reasons not related to fertility, selection by occupation not related to fertility), whether collection year was estimated, whether arithmetic mean of SC was estimated, whether SE of SC was estimated and indicator variable to denote studies with more than one estimate. Sperm concentration (SC) meta-regression models weighted by SC SE, adjusted for similar covariates and method used to assess semen volume. Total sperm count (TSC) meta-regression models weighted by TSC SE, adjusted for similar covariates and method used to assess semen volume.
NEA, North America–Europe–Australia; SAA, South/Central America–Asia–Africa.
After stratifying by fertility group and adjusting for all covariates including continental group, there was a strong decline in SC among unselected men (–1.17; –1.66 to –0.68; P < 0.001) but not among fertile men (–0.11; –0.54 to 0.32; P = 0.615) (Table I, Fig. 2). Using SC model estimates of 101.2 million/ml in 1973 and 49.0 million/ml in 2018, SC declined among unselected men by 1.16% per year and 51.6% overall (Table I).
In an adjusted meta-regression model among unselected men that included interaction by geographic group (P for interaction = 0.44), the slope for Unselected NEA was –1.30 (–1.89 to –0.71; P < 0.001) and the slope for Unselected SAA was –0.84 (–1.82 to 0.13, P = 0.088) (Table I, Fig. 3).
In an adjusted meta-regression model, which included all men and two interaction terms [time × fertility group (P = 0.012) and time × continents (P = 0.058)], declines were seen among Unselected NEA (–1.27; –1.78 to –0.77; P < 0.001), Unselected SAA (–0.65; –1.29 to –0.01; P = 0.045) and Fertile NEA (–0.50; –1.00 to –0.01; P = 0.046) (Table I, Supplementary Fig. S3).
Meta-regression models for TSC
Overall, TSC trends were similar to those for SC. In an adjusted meta-regression model for all men, there was a steep decline in TSC (slope per year –2.06 million, –3.25 to –0.87; P = 0.001) (Table I).
After stratifying by fertility group and adjusting for all covariates including continent, there was a strong decline in TSC among unselected men (–4.70; –6.56 to –2.83; P < 0.001) but not among fertile men (–0.24; –1.99 to 1.52; P = 0.788) (Table I, Fig. 2). Using TSC model estimates of 335.7 million in 1973 and 126.6 million in 2018, TSC declined among unselected men by 1.40% per year and 62.3% overall (Table I).
In an adjusted meta-regression model among unselected men, including interaction by geographic group (P for interaction = 0.44), the slope for Unselected NEA was –5.05 (–7.31 to –2.79; P < 0.001) and the slope for Unselected SAA was –3.79 (–7.58 to –0.01, P = 0.049) (Table I, Fig. 3).
In an adjusted meta-regression model, which included all men and two interaction terms [time × fertility group (P = 0.013) and time × continents (P = 0.015)], declines were seen among Unselected NEA (–4.71; –6.53 to –2.88; P < 0.001), Unselected SAA (–1.78; –4.10 to 0.55; P = 0.133) and Fertile NEA (–2.09; –3.86 to –0.32; P = 0.021) (Table I, Supplementary Fig. S3).
Meta-regression models for recent periods
We also restricted the analysis of unselected men (in all continents) to recent time intervals (Table II, Fig. 2). Post-1995, the slope for SC was somewhat steeper (–1.33; –2.41 to –0.26; P = 0.016) and was steeper still post-2000 (–1.73; –3.23 to –0.24; P = 0.024). There was a marked increase in the percent decline in SC per year in the recent period, from 1.16% post-1972 to 2.64% post-2000 (Fig. 4). Post-2000, the slope for TSC (–5.26, –10.72 to 0.19; P = 0.058) was also steeper than that for post-1972 (Table II, Fig. 2).
Period (years) . | N (estimates) . | First year . | First year SC (million/ml) . | Last year . | Last year SC (million/ml) . | % change/year . | Slope (95% CI), million/ml/year . |
---|---|---|---|---|---|---|---|
>1972 | 153 | 1973 | 101.2 | 2018 | 49.0 | –1.16 | –1.17 (–1.66 to –0.68) |
>1985 | 131 | 1985 | 82.3 | 2018 | 47.1 | –1.31 | –1.08 (–1.68 to –0.49) |
>1995 | 89 | 1995 | 70.1 | 2018 | 40.1 | –1.90 | –1.33 (–2.41 to –0.26) |
>2000 | 60 | 2000 | 65.6 | 2018 | 35.3 | –2.64 | –1.73 (–3.23 to –0.24) |
Period (years) . | N (estimates) . | First year . | First year SC (million/ml) . | Last year . | Last year SC (million/ml) . | % change/year . | Slope (95% CI), million/ml/year . |
---|---|---|---|---|---|---|---|
>1972 | 153 | 1973 | 101.2 | 2018 | 49.0 | –1.16 | –1.17 (–1.66 to –0.68) |
>1985 | 131 | 1985 | 82.3 | 2018 | 47.1 | –1.31 | –1.08 (–1.68 to –0.49) |
>1995 | 89 | 1995 | 70.1 | 2018 | 40.1 | –1.90 | –1.33 (–2.41 to –0.26) |
>2000 | 60 | 2000 | 65.6 | 2018 | 35.3 | –2.64 | –1.73 (–3.23 to –0.24) |
Period (years) . | N (estimates) . | First year . | First year TSC (million) . | Last year . | Last year TSC (million) . | % change/year . | Slope (95% CI), million/year . |
---|---|---|---|---|---|---|---|
>1972 | 153 | 1973 | 335.7 | 2018 | 126.6 | –1.40 | –4.70 (–6.56 to –2.83) |
>1985 | 131 | 1985 | 275.2 | 2018 | 105.6 | –1.90 | –5.22 (–7.62 to –2.82) |
>1995 | 89 | 1995 | 231.1 | 2018 | 138.5 | –1.78 | –4.11 (–8.21 to –0.02) |
>2000 | 60 | 2000 | 212.1 | 2018 | 120.0 | –2.48 | –5.26 (–10.72 to 0.19) |
Period (years) . | N (estimates) . | First year . | First year TSC (million) . | Last year . | Last year TSC (million) . | % change/year . | Slope (95% CI), million/year . |
---|---|---|---|---|---|---|---|
>1972 | 153 | 1973 | 335.7 | 2018 | 126.6 | –1.40 | –4.70 (–6.56 to –2.83) |
>1985 | 131 | 1985 | 275.2 | 2018 | 105.6 | –1.90 | –5.22 (–7.62 to –2.82) |
>1995 | 89 | 1995 | 231.1 | 2018 | 138.5 | –1.78 | –4.11 (–8.21 to –0.02) |
>2000 | 60 | 2000 | 212.1 | 2018 | 120.0 | –2.48 | –5.26 (–10.72 to 0.19) |
Stratified meta-regression model, adjusted for continents, age, abstinence time, semen collection method reported, counting method reported, having more than one sample per men, indicators for study selection of population and exclusion criteria (some vasectomy candidates, some semen donor candidates, exclusion of men with chronic diseases, exclusion by other reasons not related to fertility, selection by occupation not related to fertility), whether collection year was estimated, whether arithmetic mean of SC was estimated, whether SE of SC was estimated and indicator variable to denote studies with more than one estimate. Sperm concentration (SC) meta-regression models weighted by SC SE, adjusted for similar covariates and method used to assess semen volume. Total sperm count (TSC) meta-regression models weighted by TSC SE, adjusted for similar covariates and method used to assess semen volume.
Period (years) . | N (estimates) . | First year . | First year SC (million/ml) . | Last year . | Last year SC (million/ml) . | % change/year . | Slope (95% CI), million/ml/year . |
---|---|---|---|---|---|---|---|
>1972 | 153 | 1973 | 101.2 | 2018 | 49.0 | –1.16 | –1.17 (–1.66 to –0.68) |
>1985 | 131 | 1985 | 82.3 | 2018 | 47.1 | –1.31 | –1.08 (–1.68 to –0.49) |
>1995 | 89 | 1995 | 70.1 | 2018 | 40.1 | –1.90 | –1.33 (–2.41 to –0.26) |
>2000 | 60 | 2000 | 65.6 | 2018 | 35.3 | –2.64 | –1.73 (–3.23 to –0.24) |
Period (years) . | N (estimates) . | First year . | First year SC (million/ml) . | Last year . | Last year SC (million/ml) . | % change/year . | Slope (95% CI), million/ml/year . |
---|---|---|---|---|---|---|---|
>1972 | 153 | 1973 | 101.2 | 2018 | 49.0 | –1.16 | –1.17 (–1.66 to –0.68) |
>1985 | 131 | 1985 | 82.3 | 2018 | 47.1 | –1.31 | –1.08 (–1.68 to –0.49) |
>1995 | 89 | 1995 | 70.1 | 2018 | 40.1 | –1.90 | –1.33 (–2.41 to –0.26) |
>2000 | 60 | 2000 | 65.6 | 2018 | 35.3 | –2.64 | –1.73 (–3.23 to –0.24) |
Period (years) . | N (estimates) . | First year . | First year TSC (million) . | Last year . | Last year TSC (million) . | % change/year . | Slope (95% CI), million/year . |
---|---|---|---|---|---|---|---|
>1972 | 153 | 1973 | 335.7 | 2018 | 126.6 | –1.40 | –4.70 (–6.56 to –2.83) |
>1985 | 131 | 1985 | 275.2 | 2018 | 105.6 | –1.90 | –5.22 (–7.62 to –2.82) |
>1995 | 89 | 1995 | 231.1 | 2018 | 138.5 | –1.78 | –4.11 (–8.21 to –0.02) |
>2000 | 60 | 2000 | 212.1 | 2018 | 120.0 | –2.48 | –5.26 (–10.72 to 0.19) |
Period (years) . | N (estimates) . | First year . | First year TSC (million) . | Last year . | Last year TSC (million) . | % change/year . | Slope (95% CI), million/year . |
---|---|---|---|---|---|---|---|
>1972 | 153 | 1973 | 335.7 | 2018 | 126.6 | –1.40 | –4.70 (–6.56 to –2.83) |
>1985 | 131 | 1985 | 275.2 | 2018 | 105.6 | –1.90 | –5.22 (–7.62 to –2.82) |
>1995 | 89 | 1995 | 231.1 | 2018 | 138.5 | –1.78 | –4.11 (–8.21 to –0.02) |
>2000 | 60 | 2000 | 212.1 | 2018 | 120.0 | –2.48 | –5.26 (–10.72 to 0.19) |
Stratified meta-regression model, adjusted for continents, age, abstinence time, semen collection method reported, counting method reported, having more than one sample per men, indicators for study selection of population and exclusion criteria (some vasectomy candidates, some semen donor candidates, exclusion of men with chronic diseases, exclusion by other reasons not related to fertility, selection by occupation not related to fertility), whether collection year was estimated, whether arithmetic mean of SC was estimated, whether SE of SC was estimated and indicator variable to denote studies with more than one estimate. Sperm concentration (SC) meta-regression models weighted by SC SE, adjusted for similar covariates and method used to assess semen volume. Total sperm count (TSC) meta-regression models weighted by TSC SE, adjusted for similar covariates and method used to assess semen volume.
Sensitivity analyses
We performed multiple analyses to examine the sensitivity of results to assumptions about our model, linearity, influence of covariates and imputation of missing data. Results from sensitivity analyses are only shown here for SC in unselected men (slope = –1.17 million/ml/year in the main model). In all sensitivity analyses (except one elaborated hereafter), there was a strong (>1.0 million/ml/year) decline in SC in the Unselected group, with P < 0.01.
Adding a quadratic or cubic function of year to meta-regression model did not substantially change the association between year and SC or improve the model fit: coefficient for the quadratic term: 0.04; 95% CI: –0.07 to 0.08, P = 0.135; for the cubic term 0.0005; 95% CI: –0.0007 to 0.001, P = 0.086).
For each covariate, we conducted two sensitivity analyses: (i) removing the covariate and (ii) by excluding a specific group, for each covariate (Supplementary Table SV). Excluding 47 estimates with no data on mean age and adjusting for mean age instead of age group, yielded a slope of –1.27 million/ml/year (–1.86 to –0.68; P < 0.001), which is similar to the main model. The sensitivity analysis which showed a more than minimal change in slope was the exclusion of 39 estimates with no information on age, which yielded a slightly diminished slope of –0.94 million/ml/year (–1.51 to –0.37, P = 0.002).
The proportion of smokers was reported in only 26.0% of studies and in 18.1% of studies of unselected men. To examine this variable, we ran a sensitivity analysis including a covariate for ‘high proportion of smokers’ (> 30%), and the slopes changed only slightly (–1.20 million/ml, –1.70 to –0.71; P < 0.001).
Results for Unselected men did not materially change with additional sensitivity analysis, by exclusion of estimates from any specific continent (Supplementary Table SVI). Slopes were also robust after excluding the four studies with five or more data points (–1.04, –1.55 to –0.53; P < 0.001), or excluding five estimates with a SE of SC > 20 million/ml (−1.11, –1.61 to –0.62; P < 0.001).
Due to a typo in the value extracted for Rubes et al. (2010) in the previous meta-analysis, we repeated the analysis without this study as well as with the corrected value. The results did not materially change.
Discussion
The results of the present study extend those of our 2017 meta-analysis. As further elaborated below, the new data allow for new analyses and new results. We provide strong evidence, for the first time, of a decline in sperm counts among men from South/Central America, Asia, and Africa, as well as a world-wide decline in the 21st century, with data suggesting that the pace of this decline has accelerated.
Key findings
In our prior systematic review and meta-analysis (Levine et al., 2017), we reported a marked, continuing decline in both SC and TSC in North America, Europe and Australia based on samples collected between 1973 and 2011. What is new in the current analysis?
Our current analysis, the largest ever to examine temporal trends in sperm counts, extends both the study period and the number of estimates. This new analysis includes seven additional years of sample collection and adds 44 estimates to the 244 included in the earlier analysis. It is therefore both more robust and more temporally relevant.
The distribution of contributing countries has changed since our 2017 analysis. The new analysis appreciably increases the number of studies from SAA. With this increase in sample size, there is now adequate power to examine trends in SC and TSC in those continents. This analysis provides strong evidence, for the first time, of an appreciable decline in sperm counts among unselected men from SAA. Importantly, this analysis also demonstrates an accelerated decline in SC and TSC post-2000. In summary, this update confirms, extends and strengthens the results of our 2017 analysis.
Comparison to prior studies
Table III compares basic characteristics and results of the current study with those of Carlsen et al. (1992), Swan et al. (2000) and Levine et al. (2017), studies that together include data collected over more than 80 years. It is notable that although search and statistical analysis methods have become more sophisticated, and the distribution of studies has changed (with the proportion from SAA increasing), these slopes are remarkably consistent.
First author (publication year) . | Carlsen et al. (1992) . | Swan et al. (2000) . | Levine et al. (2017) . | Levine et al. (2022), current paper . |
---|---|---|---|---|
Publication years | 1938–1990 | 1934–1996 | 1981–2013 | 1981–2019 |
Number of studies | 61 | 101 | 185 | 223 |
Number of countries | 20 | 28 | 50 | 53 |
Fertility group: N (%) | ||||
Unselected | 22 (36%) | 50 (50%) | 140 (57%) | 153 (53%) |
Fertile | 39 (64%) | 51 (50%)a | 104 (43%) | 135 (47%) |
Continents: N (%) | ||||
NEAb | 45 (74%) | 78 (77%) | 175 (72%) | 199 (69%) |
SAA | 16 (26%) | 23 (23%) | 69 (28%) | 89 (31%) |
Slope | –0.93 | –0.94 | –0.70 | –0.87 |
P-value | <0.001 | <0.001 | <0.001 | <0.001 |
First author (publication year) . | Carlsen et al. (1992) . | Swan et al. (2000) . | Levine et al. (2017) . | Levine et al. (2022), current paper . |
---|---|---|---|---|
Publication years | 1938–1990 | 1934–1996 | 1981–2013 | 1981–2019 |
Number of studies | 61 | 101 | 185 | 223 |
Number of countries | 20 | 28 | 50 | 53 |
Fertility group: N (%) | ||||
Unselected | 22 (36%) | 50 (50%) | 140 (57%) | 153 (53%) |
Fertile | 39 (64%) | 51 (50%)a | 104 (43%) | 135 (47%) |
Continents: N (%) | ||||
NEAb | 45 (74%) | 78 (77%) | 175 (72%) | 199 (69%) |
SAA | 16 (26%) | 23 (23%) | 69 (28%) | 89 (31%) |
Slope | –0.93 | –0.94 | –0.70 | –0.87 |
P-value | <0.001 | <0.001 | <0.001 | <0.001 |
Wife pregnant or post-partum or at least 90% of men with proven fertility.
NEA includes studies from North America–Europe–Australia. SAA includes studies from South/Central America–Asia–Africa.
First author (publication year) . | Carlsen et al. (1992) . | Swan et al. (2000) . | Levine et al. (2017) . | Levine et al. (2022), current paper . |
---|---|---|---|---|
Publication years | 1938–1990 | 1934–1996 | 1981–2013 | 1981–2019 |
Number of studies | 61 | 101 | 185 | 223 |
Number of countries | 20 | 28 | 50 | 53 |
Fertility group: N (%) | ||||
Unselected | 22 (36%) | 50 (50%) | 140 (57%) | 153 (53%) |
Fertile | 39 (64%) | 51 (50%)a | 104 (43%) | 135 (47%) |
Continents: N (%) | ||||
NEAb | 45 (74%) | 78 (77%) | 175 (72%) | 199 (69%) |
SAA | 16 (26%) | 23 (23%) | 69 (28%) | 89 (31%) |
Slope | –0.93 | –0.94 | –0.70 | –0.87 |
P-value | <0.001 | <0.001 | <0.001 | <0.001 |
First author (publication year) . | Carlsen et al. (1992) . | Swan et al. (2000) . | Levine et al. (2017) . | Levine et al. (2022), current paper . |
---|---|---|---|---|
Publication years | 1938–1990 | 1934–1996 | 1981–2013 | 1981–2019 |
Number of studies | 61 | 101 | 185 | 223 |
Number of countries | 20 | 28 | 50 | 53 |
Fertility group: N (%) | ||||
Unselected | 22 (36%) | 50 (50%) | 140 (57%) | 153 (53%) |
Fertile | 39 (64%) | 51 (50%)a | 104 (43%) | 135 (47%) |
Continents: N (%) | ||||
NEAb | 45 (74%) | 78 (77%) | 175 (72%) | 199 (69%) |
SAA | 16 (26%) | 23 (23%) | 69 (28%) | 89 (31%) |
Slope | –0.93 | –0.94 | –0.70 | –0.87 |
P-value | <0.001 | <0.001 | <0.001 | <0.001 |
Wife pregnant or post-partum or at least 90% of men with proven fertility.
NEA includes studies from North America–Europe–Australia. SAA includes studies from South/Central America–Asia–Africa.
Comparing the current analysis with Levine et al. (2017), we note that the methods for searching and screening the literature, which are well documented in both, have not changed, nor have the analytic methods.
In both our current and past analyses, we excluded studies that selected men based on criteria that were likely to affect sperm count (e.g. requiring a minimum sperm count, or men’s participation in a sperm bank) with one exception. Studies of fertile men were included as a separate stratum (denoted Fertile). This group of studies includes fathers or partners of pregnant women. Thus, these men had either themselves helped to conceive a pregnancy, or the pregnancy was the result of in vitro fertilization (IVF). The proportion of IVF births has increased over the study period of this analysis, with eight million IVF babies born worldwide since the world’s first IVF birth in 1978 (European Society of Human Reproduction and Embryology, 2018). Among the 135 studies categorized as Fertile in this analysis, only 27.4% explicitly excluded pregnancies conceived by IVF.
Here, as in our 2017 analysis, we stratified countries into two groups, because of the potential for confounding of trends by geography. In the past, we referred to these two groups of countries as ‘Western’ and ‘Other’. Though not our intent, it became apparent that these terms had the potential to be misinterpreted and become politicized. Therefore, we now refer to these two groups of studies by the continent in which data were collected: ‘NEA’ (North America, Europe, and Australia) and ‘SAA’ (South/Central America, Africa, and Asia). We present results stratified by continental group, as well as combined.
Other issues
Could the declines we report be simply the result of a random decrease in a pattern of fluctuations (termed ‘sperm variability’) (Boulicault et al., 2021)? The continued decline demonstrated in this, and earlier, meta-analyses provide strong evidence that this is not the case. If, in fact, the declines we are reporting were merely the result of random fluctuation in sperm count, we would expect, on average, some percent of studies to report no change and the remainder to report (approximately) an equal number of increases and decreases. The literature does not support this (Jørgensen et al., 2021; Aitken, 2022).
While sperm count is an imperfect proxy for fertility, SC and TSC are closely linked to fertility chances (Guzick et al., 2001). The relationship between SC and time to conception is nonlinear. Thus, past a threshold of 40–50 million/ml, a higher SC does not necessarily imply a higher probability of conception. On the other hand, below that threshold, the probability of conception drops off rapidly as SC declines (Bonde et al., 1998). On a population level, the drop in mean SC from 104 to 49 million/ml that we report here implies a substantial increase in the proportion of men with delayed time to conception. Thus, SC provides the most stable and reliable measurement for comparisons within and among populations and over time.
Strengths
In this study, as in Levine et al. (2017), we used written protocols and extensive quality control procedures that minimized potential information and selection bias in all steps of the study. Further strengths of this study include our complete and documented literature search, the review of all retrieved articles by two members of the study team, and the use of current meta-analytic methods. All estimates were weighted by the SE of the measurement and all assumptions were examined in sensitivity analyses. In this study, we re-ran all steps of the prior meta-analyses on the larger, combined data set, as well as on the newly retrieved publications.
Our large number of studies and data points allowed us to control for a pre-determined set of covariates as well as for modification by fertility status and geographic group and variables indicating data completeness and study exclusion criteria.
The methods we used for the systematic literature review and meta-analysis are the most current and widely accepted by the scientific community.
Limitations
We analyzed sperm count (SC and TSC) but not sperm motility and morphology. Interpreting trends in sperm motility and morphology is difficult, since methods have changed markedly over the study period, However, methods for measuring SC have remained largely unchanged. Counting by hemocytometer is the classical way to assess SC and has been recommended by the World Health Organization in all versions of organizations semen analysis manuals (Wang et al., 2022).
If no counting method was stated or use of a method other than hemocytometer was reported, the study was excluded. Overall, 334 studies were excluded because a method other than hemocytometer was used, while 128 studies were excluded because no sperm count method was provided. Of the 288 estimates for SC and TSC included, about half (146) were from studies that stated explicitly that a hemocytometer was used. The remaining 142 estimates were from studies that used World Health Organization methods without naming the particular type of hemocytometer used. We included both groups of studies in our analyses together with a variable indicating whether hemocytometer had been named explicitly as the counting method. In the sensitivity analyses, none of the slopes changed appreciably if we restricted the analysis to studies in which hemocytometer was named as the counting method or if this indicator variable was removed from the model.
Complete elimination of all selection/recruitment bias is impossible, since it is not possible to collect semen samples at random. However, we minimized recruitment bias by evaluating recruitment methods in all included studies. As stated in Methods, studies which selected men based on any variable known to affect sperm count were not eligible. This includes studies that selected men based on a semen parameter, a condition associated with a semen parameter (such as varicocele), or an exposure or occupation associated with semen quality. However, we did include ‘Fertile’ men as a separate group, even though this is a selected group. Compared to unselected men, the slope for fertile men was more modest than that for unselected men from NEA, and no decline was seen among fertile men from SAA. Men classified as ‘fertile’ are problematic in several ways. First, they include both those whose partner has conceived without medical assistance and with medical assistance, the fraction of which varied by time and location. Second, men with lower semen quality are underrepresented among fertile men. Changes in the proportion of fertile men in the population over time could lead to a selection bias in the Fertile group. In contrast, the Unselected group is not prone to such selection bias.
It is also possible that men providing a semen sample differ from those who do not. We previously studied this important question by comparing testosterone and Inhibin B levels in unselected men (potential military recruits undergoing a compulsory routine physical exam to determine their fitness for military service) who agreed to deliver a semen sample to those in men who only agreed to give a blood sample. In both groups of men, the hormone levels were similar (Andersen et al., 2000). Therefore, recruitment bias is unlikely in studies of unselected men.
As in our prior analysis, we included only English-language publications, which was unavoidable given the size of the task and the limited size of our study team. However, of the 2936 publications identified through our database searches in 2020, only 49 were excluded because of language.
In addition, it would be interesting to explore trends in sperm count in a specific continent or even within countries and sub-populations. However, we had inadequate statistical power to examine this question at a finer geographic level. Repeated studies on semen quality in specific populations would complement the current study by providing information about local trends.
Conclusion and wider implications
Our new data and analyses confirm our prior findings of an appreciable decline in sperm count between 1973 and 2018 among men from North America, Europe and Australia and support a decline among unselected men from South/Central America, Africa and Asia. This decline has continued, as predicted by our prior analysis, and has become steeper since 2000. This substantial and persistent decline is now recognized as a significant public health concern. In 2018, a group of leading clinicians and scientists called for governments to acknowledge decreased male fertility as a major public health problem and to recognize the importance of male reproductive health for the survival of the human (and other) species (Levine et al., 2018). Research on the causes of this continuing decline and an immediate focused response to prevent further disruption of male reproductive health are needed.
We hope that the new evidence provided here will receive attention not only from clinicians and scientists, but also from decision-makers and the general public.
Supplementary data
Supplementary data are available at Human Reproduction Update online.
Data availability
The data underlying this article will be shared on reasonable request to the corresponding author.
Acknowledgements
We thank Alexandria Albert for her assistance in the literature review.
Authors’ roles
H.L. had full access to all the data and takes responsibility for the integrity of the data and the accuracy of the data analysis. Study concept and design: H.L. and S.H.S. Search strategy design and execution: H.L. and R.P. Acquisition, analysis or interpretation of data: H.L., N.J., A.M.-A., J.M., D.W.-D., M.J., R.P. and S.H.S. Drafting of the manuscript: H.L., S.H.S. and M.J. Critical revision of the manuscript for important intellectual content: H.L., N.J., A.M.-A, J.M., D.W.-D., M.J., R.P. and S.H.S. Statistical analysis: H.L. and M.J. Administrative, technical or material support: H.L., R.P., S.H.S. and M.J. Study supervision: H.L. and S.H.S.
Funding
We gratefully acknowledge financial support from The Grantham Foundation for the Protection of the Environment. None of the funding sources played a role in the design and conduct of the study; collection, management, analysis and interpretation of the data; preparations, review or approval of the manuscript; or decision to submit the manuscript for publication. Anderson Martino-Andrade is a scholarship recipient from the National Council for Scientific and Technological Development (CNPq, Brazil).
Conflict of interest
All authors declare they have no conflict of interest.
References
Layali I, Tahmasbpour E, Joulaei M, Jorsaraei SGA, Farzanegi P.
Swan SH, Elkin EP, Fenster L.