Why women are not small men: sex-related differences in perioperative cardiopulmonary exercise testing
Perioperative Medicine volume 9, Article number: 18 (2020)
The use of preoperative cardiopulmonary exercise testing (CPET) to evaluate the risk of adverse perioperative outcomes is increasingly prevalent. CPET-derived information enables personalised perioperative care and enhances shared decision-making. Sex-related differences in physical fitness are reported in non-perioperative literature. However, little attention has been paid to sex-related differences in the context of perioperative CPET.
We explored differences in the physical fitness variables reported in a recently published multi-centre study investigating CPET before colorectal surgery. We also report the inclusion rate of females in published perioperative CPET cohorts that are shaping guidelines and clinical practice.
We performed a post hoc analysis of the trial data of 703 patients who underwent CPET prior to major elective colorectal surgery. We also summarised the female inclusion rate in peer-reviewed published reports of perioperative CPET.
Fitness assessed using commonly used perioperative CPET variables—oxygen consumption at anaerobic threshold (AT) and peak exercise—was significantly higher in males than in females both before and after correction for body weight. In studies contributing to the development of perioperative CPET, 68.5% of the participants were male.
To our knowledge, this is the first study to describe differences between males and females in CPET variables used in a perioperative setting. Furthermore, there is a substantial difference between the inclusion rates of males and females in this field. These findings require validation in larger cohorts and may have significant implications for both sexes in the application of CPET in the perioperative setting.
The use of clinical exercise testing is increasingly prevalent in the field of perioperative risk assessment (Huddart et al., 2013). Cardiopulmonary exercise testing (CPET) is the most objective and precise means of assessing physical fitness in surgical candidates and has found utility for the prediction and stratification of surgical risk in various clinical fields including thoracic, vascular and abdominal surgery (Moran et al., 2016; Tew et al., 2018; Goodyear et al., 2013; Brunelli et al., 2009). Current guidelines therefore recommend clinical exercise testing before major surgery (Levett et al., 2018). Nevertheless, although substantial progress is being made in the development of such guidelines for perioperative physicians, these guidelines have not addressed sex-related differences in exercise capacity or other CPET-derived variables (Levett et al., 2018). Amongst the most notable differences between males and females is their body composition and exercise capacity: males generally have less fat tissue, more lean mass and a higher aerobic capacity (Sparling, 1980; Loe et al., 2013). Furthermore, it is known that clinical risk profiles and the response to surgical injury differ between males and females. For example, the incidence of infectious complications, as well as cardiovascular risk profiles, differs between sexes (Foxman, 2002; Oberholzer et al., 2000; Falagas et al., 2007; Appelman et al., 2015). In the specific case of CPET, it has been shown that VO2 peak, a measure for maximum oxygen uptake, is lower in females than males in a healthy population, even when matched for age, weight and body mass index (Koch et al., 2009). Furthermore, in cardiovascular disease, it has been shown that these differences in CPET-derived variables are related to differences in prognosis. A reduction of 5 ml/kg/min in the VO2 peak from the “classical cut-off value” (14 ml/kg/min) has been proposed for females in order to establish an accurate prognosis in cases of heart failure (Corrà et al., 2013).
Non-sex-specific reference values established by Jones et al. and Hansen et al. have traditionally been used in perioperative practice to assess physical fitness and estimate surgical risk for females and males alike (Jones et al., 1985; Hansen et al., 1984; Balady et al., 2010). More males than females were included in these two studies, with the second including male participants only (Hansen et al., 1984). A recently updated systematic review showed that, in studies establishing CPET reference values in healthy adults, females are underrepresented and account for 38% of the total number of included study participants (Takken et al., 2019). Although an increasing number of hospitals in the UK use CPET to assess surgical risk, separate reference values for males and females have not been reported in recent perioperative CPET publications (Levett et al., 2018; Reeves et al., 2018).
This phenomenon is not unique to the field of perioperative risk assessment: it is seen in a broad range of research fields. More than 25 years after the introduction of the US National Institute of Health Revitalization Act of 1993, which required the enrolment of female participants in federally supported Phase III clinical trials (Labor USCSCo, Resources H, 1993), sex-related differences are disregarded in a large proportion of clinical research (Rochon et al., 1998; Gupta & Wenger, 2012). Epidemiological and clinical studies have often shown differences between males and females in terms of disease incidence, aetiology and response to therapy (Light et al., 2006; Siegel et al., 2017; National Institutes of Health (NIH), 2018). Females are more likely to suffer adverse drug reactions (Patel et al., 2007; Pirmohamed et al., 2004) and a study from 2005 reported that eight out of ten prescription drugs were taken off the market due to health issues in females (Simon, 2005). Despite these issues, both the inclusion of females and female-specific subgroup analyses are underreported in the literature. In the era of precision medicine and personalised treatment, this is a surprising observation with striking and significant implications.
In summary, despite the rapid increase in the prevalence of the clinical application of perioperative CPET, little research is taking place to elucidate differences between the sexes and whether those differences have any bearing on the applicability of CPET in the prediction of surgical risk (Balady et al., 2010). The aim of the present analysis is to highlight this gap in the current literature and to perform a brief analysis of sex-related differences in CPET-derived variables from a recently published multi-centre observational study in patients scheduled to undergo perioperative CPET and major colorectal surgery.
We performed a post hoc analysis of a cohort of 703 consecutive patients, of whom 428 were male, in multiple centres in the UK. The design and results of this study have been described elsewhere (West et al., 2016). In short, all patients underwent CPET before major elective surgery in line with American Thoracic Society/American College of Chest Physicians recommendations (Society, 2003). Clinical outcomes including complications, length of stay and mortality were recorded. The primary outcomes of our analysis were oxygen uptake (VO2) at anaerobic threshold (AT) and at peak exercise for males and females. Several secondary outcome variables were also assessed, including ventilatory equivalents for carbon dioxide (VE/VCO2) at AT and heart rate (HR) at peak exercise. Kolmogorov-Smirnov tests were used to assess the normality of the distribution of variables. Variables were reported as mean ± SD or as frequency (percentage). ANOVA was used for normally distributed variables and Bonferroni correction was applied for multiple variable testing. P values < 0.05 were considered significant.
In addition, studies investigating the use of CPET for surgical risk assessment and including over one hundred patients were identified and screened for the proportion of female study participants. These studies were selected in a non-systematic way, using expert opinion and lists of studies included in recent review articles. As this is a timeless issue, no studies were excluded on the basis of publication year.
Table 1 shows the patient characteristics and CPET-derived variables before colorectal surgery analysed separately for males and females. The majority of patients (n = 428) were male. In our cohort, there were no significant differences between the groups in terms of age, BMI, tumour stage, laparoscopic approach or clinical outcomes. The primary CPET-derived outcomes, VO2 at AT and VO2 at peak, were 1.2 L/min and 1.6 L/min, respectively, for males. In females, VO2 was 0.8 L/min at AT and 1.1 L/min at peak. The male-female differences were therefore 33% and 25%, respectively. When adjusted for body weight, the differences between males and females were 14% and 18% at AT and peak, respectively. All differences were statistically significant (see Table 1). There was no statistically significant difference between males and females in terms of VE/VCO2 at AT or heart rate (HR) at peak exercise.
A total of seventeen studies were screened to determine the numbers of male and female participants. The results can be found in Table 2. Four studies did not report the sex of their participants. In the remaining studies, which included 5117 patients in all, 68.5% of participants were male. Sex-specific analyses of outcome data were performed in two studies (West et al., 2013; Bernal et al., 2014). One study noted that the proportion of females who were classified as high-risk on the basis of oxygen uptake at AT was much larger than the proportion of males. However, no separate outcome analyses were made for both sexes in that study (Wilson et al., 2010).
To our knowledge, this is the first study to report differences between male and female CPET-derived exercise-testing variables in a perioperative setting. VO2 at AT and peak are currently the CPET variables most commonly used to assess surgical risk (Moran et al., 2016; Levett et al., 2018). We found remarkable differences for both these variables (male-female differences of 33 and 25%, respectively) in our dataset, even when corrected for body weight (14 and 18%, respectively). Although these observations are not surprising—given the known differences in strength and fitness between males and females—this is the first study to report of such differences in the context of surgical risk assessment.
Known differences in sex-based physiology
Several factors may contribute to sex-based differences in the assessment of cardiopulmonary fitness in the context of surgical risk assessment. Firstly, and most obviously, male and female body composition is not the same. The differences in the distribution of fat and muscle tissue may be one of the causes of the sex-related difference in physical fitness we observed (Geer & Shen, 2009). However, this is not adequately resolved by correcting for body weight, and studies show that, even when correcting specifically for body fat, there is still a difference in oxygen uptake between the sexes (Sparling, 1980). Secondly, the female body reacts differently to physical stress and stressors related to treatment and disease. Females generally have a lower stroke volume, higher heart rate and lower cardiac output (Hart et al., 2009). When CPET is used for surgical risk stratification, a physical stressor is provoked which increases oxygen demand. Adaptive mechanisms, such as increased heart rate and respiratory rate, are the consequence of sympathetic nervous system activation, increasing oxygen uptake. In males, increasing vascular resistance and therefore blood pressure is the primary response to physical exertion, whereas the main response in females is an increase in heart rate (Hart et al., 2009). It is also known that, at all ages, females have a lower sympathetic drive and lower blood levels of norepinephrine than males, and this is a possible contributory factor to lower maximum oxygen uptake. So, even though current guidelines highlight the need for further research to address the interpretation of perioperative CPET and also discuss the difficulty of interpreting results for obese and cachectic individuals, differences between males and females have not previously been addressed in the perioperative setting (Levett et al., 2018).
Sex-based differences, CPET and perioperative risk
Oxygen uptake at peak or at AT is used to predict how well patients will adapt to a different stressor in their near future: surgery. Surgery also leads to increased oxygen demand, leading to a number of metabolic and endocrine changes that are collectively known as the surgical stress response.
Although females have a lower maximum oxygen uptake, they do not seem to have a less efficient surgical stress response than males. As seen in our cohort, average outcome after surgical treatment for colorectal cancer seems similar and, in some studies, more in favour of females than of males (West et al., 2016; González et al., 2005; De Angelis et al., 2014). Females seem to adapt to surgical stressors similarly to males, despite having a lower maximum oxygen uptake as a group. When predicting surgical risk, then, the average oxygen uptake capacity of sexes should be taken in account and it would therefore seem illogical to use the same reference values for males and females when assessing their risk of surgical complications and adverse outcome.
In addition, long-known sex-related differences in energy metabolism and physical performance may play a role (Björntorp PAJTAjocn, 1989). During moderate exercise, females primarily utilise fatty acids, preserving muscle glycogen reserves. This enables them to sustain low to moderate exercise for a long time. By contrast, males use glycogen reserves quickly, possibly enabling performance of shorter-lasting exercise with relatively high levels of intensity. An example of such a short, high-intensity, exercise is the steep-ramp protocol, which is mostly used in perioperative CPET (Society, 2003). A different protocol with a stronger focus on moderately intense exercise capacity of longer duration may produce contrasting sex-related differences and merits evaluation for the assessment of oxygen uptake capacity in both female and male surgical candidates.
The observations, and mechanisms, explored above suggest that simply correcting for body weight when applying CPET for surgical risk stratification does not account for the observed difference between sexes. Our assessment of recent studies investigating perioperative fitness using CPET showed that the current pool of research participants in published studies comprises more males (68.5%) than females. Sex-specific analysis is rarely performed in these studies (two out of seventeen studies), and one study even used a completely male reference population to compare with a mixed-sex study population (Rose et al., 2018). There seems to be little awareness of the possibility that not considering the sex of surgical candidates may lead to less effective risk assessment.
In heart failure, where one of the applications of CPET has been the timing of heart transplantation, studies show that the relationship between CPET-derived variables and prognosis is different between males and females, although the same reference values have been used historically for both sexes (Ehrman et al., 2018). Furthermore, a similar inclusion bias is seen in these studies comparable to our findings: more males were included than females and reference values were therefore skewed towards male standards. Accordingly, in addition to inaccurate information about their prognosis, females were given heart transplants too soon, and this may have led to the inefficient use of scarce donor hearts, higher mortality in male patients on waiting lists and higher health care costs. The implication is that assessing female patients on the basis of a predominantly or exclusively male standard can have adverse outcomes for both sexes.
Preoperative CPET suffers from similar problems: applying the current risk stratification models—with their lack of discrimination for sex—in clinical decision-making will lead to females being assessed on the basis of male-oriented standards and therefore the overestimation of surgical risk for females (Fig. 1). As a result, females may receive different perioperative interventions or have consultations in which shared decision-making is biased and they may even be advised to forego surgical treatment. Another clinical implication is triage to more intensive post-operative care environments. The misrepresentation of perioperative risk due to sex-specific imbalances could result in the misallocation of level-two post-operative resources, with obvious cost implications as well as the risk of unintended consequences of excess treatment. On the other hand, the surgical risk for males could be underestimated, putting them at a higher risk of surgical complications and emergency ICU admission. However, due to the large numbers of male participants in studies investigating surgical risk assessment, it is not unreasonable to assume that this effect may be less pronounced than the effect described above for female patients. Nevertheless, adequate reference values for both patient groups are required in order to deliver personalised and cost-effective care.
A stronger focus on generating and exploring sex-specific data, which is currently lacking in a large number of scientific fields, will provide researchers with the tools to further elucidate these differences and the mechanisms behind them. Since we are the first to highlight these sex-related differences in the context of perioperative risk stratification, and using a post hoc analysis of observational, non-randomised data, the results of this study should be interpreted with caution and validated in other, larger cohorts. Given the results described here, a heightened awareness of sex-related differences and more research into male and female performance during preoperative exercise testing is urgently needed to enhance the clinical applicability of CPET before major surgery.
Conclusions and recommendations
Although we recognise the limitations of the results described above, which are observational in nature, we did find a striking difference between the aerobic capacity of male and female surgical candidates in our study, that has previously gone underreported.
It is clear that there are sex differences to be taken into account in the clinical application of CPET, especially given the low number of female research participants in this field at present. To further interrogate the mechanisms and quantification of these differences in perioperative CPET, the authors recommend that a larger, adequately powered study is needed. Our results, however, resonate with a broader, persistent problem in all scientific fields: a lack of sex-specific research and a smaller number of female research participants, skewing results towards male-specific outcomes and subsequent clinical practice. We therefore recommend that researchers in the perioperative field consider these differences in the design and analysis of future clinical studies and undertake adequately powered studies that allow sufficient patient numbers to perform subgroup sex-specific analyses to assess surgical risk. A better understanding of sex-related differences in physical fitness will allow for sex-specific reference values, leading to further refinement and increased accuracy in perioperative risk assessment, truly personalised care and ultimately a better outcome for both sexes.
Availability of data and materials
Data sharing is not applicable to this article as no datasets were generated or analysed during the current study.
Cardiopulmonary exercise testing
- VE/VCO2 :
Ventilatory equivalents for carbon dioxide
- VO2 :
Volume of oxygen
Appelman Y, van Rijn BB, Monique E, et al. Sex differences in cardiovascular risk factors and disease prevention. Atherosclerosis. 2015;241(1):211–8.
Balady GJ, Arena R, Sietsema K, et al. Clinician’s guide to cardiopulmonary exercise testing in adults: a scientific statement from the American Heart Association. Circulation. 2010;122(2):191–225.
Bernal W, Martin-Mateos R, Lipcsey M, et al. Aerobic capacity during cardiopulmonary exercise testing and survival with and without liver transplantation for patients with chronic liver disease. Liver Transpl. 2014;20(1):54–62.
Björntorp PAJTAjocn. Sex differences in the regulation of energy balance with exercise. 1989;49(5):958-61.
Brunelli A, Belardinelli R, Refai M, et al. Peak oxygen consumption during cardiopulmonary exercise test improves risk stratification in candidates to major lung resection. Chest. 2009;135(5):1260–7.
Corrà U, Mezzani A, Giordano A, et al. Peak oxygen consumption and prognosis in heart failure: 14 mL/kg/min is not a “gender-neutral” reference. Int J Cardiol. 2013;167(1):157–61.
De Angelis R, Sant M, Coleman MP, et al. Cancer survival in Europe 1999–2007 by country and age: results of EUROCARE-5—a population-based study. Lancet Oncol. 2014;15(1):23–34.
Ehrman JK, Brawner CA, Shafiq A, et al. Cardiopulmonary exercise measures of men and women with HFrEF differ in their relationship to prognosis: the Henry ford hospital cardiopulmonary exercise testing (FIT-CPX) project. J Card Fail. 2018;24(4):227–33.
Falagas ME, Mourtzoukou EG, Vardakas KZ. Sex differences in the incidence and severity of respiratory tract infections. Respir Med. 2007;101(9):1845–63.
Foxman B. Epidemiology of urinary tract infections: incidence, morbidity, and economic costs. Am J Med. 2002;113(1):5–13.
Geer EB, Shen W. Gender differences in insulin resistance, body composition, and energy balance. Gender Med. 2009;6:60–75.
González JR, Fernandez E, Moreno V, et al. Sex differences in hospital readmission among colorectal cancer patients. J Epidemiol Community Health. 2005;59(6):506–11.
Goodyear SJ, Yow H, Saedon M, et al. Risk stratification by pre-operative cardiopulmonary exercise testing improves outcomes following elective abdominal aortic aneurysm surgery: a cohort study. Perioper Med. 2013;2(1):10.
Gupta D, Wenger NK. Guidelines for the prevention of cardiovascular disease in women: international challenges and opportunities. Expert Rev Cardiovasc Ther. 2012;10(3):379–85.
Hansen JE, Sue DY, Wasserman K. Predicted values for clinical exercise testing. Am Rev Respir Dis. 1984;129(2P2):S49–55.
Hart EC, Charkoudian N, Wallin BG, et al. Sex differences in sympathetic neural-hemodynamic balance: implications for human blood pressure regulation. Hypertension. 2009;53(3):571–6.
Huddart S, Young EL, Smith R-L, et al. Preoperative cardiopulmonary exercise testing in England–a national survey. Perioper Med. 2013;2(1):4.
Jones NL, Makrides L, Hitchcock C, et al. Normal standards for an incremental progressive cycle ergometer test. Am Rev Respir Dis. 1985;131(5):700–8.
Koch B, Schäper C, Ittermann T, et al. Reference values for cardiopulmonary exercise testing in healthy volunteers: the SHIP study. Eur Respir J. 2009;33(2):389–97.
Labor USCSCo, Resources H. National Institutes of Health revitalization act of 1993 1993.
Levett D, Jack S, Swart M, et al. Perioperative cardiopulmonary exercise testing (CPET): consensus clinical guidelines on indications, organization, conduct, and physiological interpretation. Br J Anaesth. 2018;120(3):484–500.
Light KP, Lovell AT, Butt H, et al. Adverse effects of neuromuscular blocking agents based on yellow card reporting in the UK: are there differences between males and females? Pharmacoepidemiol Drug Saf. 2006;15(3):151–60.
Loe H, Rognmo Ø, Saltin B, et al. Aerobic capacity reference data in 3816 healthy men and women 20–90 years. PLoS One. 2013;8(5):e64319.
Moran J, Wilson F, Guinan E, et al. Role of cardiopulmonary exercise testing as a risk-assessment method in patients undergoing intra-abdominal surgery: a systematic review. BJA. 2016;116(2):177–91.
National Institutes of Health (NIH) OoRoWsHO. 2018 [Available from: https://orwh.od.nih.gov/sex-gender accessed 03-10 2018.
Oberholzer A, Keel M, Zellweger R, et al. Incidence of septic complications and multiple organ failure in severely injured patients is sex specific. J Trauma Acute Care Surg. 2000;48(5):932–7.
Patel H, Bell D, Molokhia M, et al. Trends in hospital admissions for adverse drug reactions in England: analysis of national hospital episode statistics 1998–2005. BMC Clin Pharmacol. 2007;7(1):9.
Pirmohamed M, James S, Meakin S, et al. Adverse drug reactions as cause of admission to hospital: prospective analysis of 18 820 patients. Bmj. 2004;329(7456):15–9.
Reeves T, Bates S, Sharp T, et al. Cardiopulmonary exercise testing (CPET) in the United Kingdom—a national survey of the structure, conduct, interpretation and funding. Perioper Med. 2018;7(1):2.
Rochon PA, Clark JP, Binns MA, et al. Reporting of gender-related information in clinical trials of drug therapy for myocardial infarction. Can Med Assoc J. 1998;159(4):321–7.
Rose G, Davies R, Appadurai I, et al. Cardiorespiratory fitness is impaired and predicts mid-term postoperative survival in patients with abdominal aortic aneurysm disease. Exp Physiol. 2018;103(11):1505–12.
Siegel RL, Miller KD, Jemal A. Cancer statistics, 2017. CA Cancer J Clin. 2017;67(1):7–30.
Simon V. Wanted: women in clinical trials: American Association for the Advancement of Science; 2005.
Society AT. ATS/ACCP statement on cardiopulmonary exercise testing. Am J Respir Crit Care Med. 2003;167(2):211.
Sparling PB. A meta-analysis of studies comparing maximal oxygen uptake in men and women. Res Q Exerc Sport. 1980;51(3):542–52.
Takken T, Mylius C, Paap D, et al. Reference values for cardiopulmonary exercise testing in healthy subjects–an updated systematic review. Expert Rev Cardiovasc Ther. 2019;17(6):413–26.
Tew G, Ayyash R, Durrand J, et al. Clinical guideline and recommendations on pre-operative exercise training in patients awaiting major non-cardiac surgery. Anaesthesia. 2018;73(6):750–68.
West M, Asher R, Browning M, et al. Validation of preoperative cardiopulmonary exercise testing-derived variables to predict in-hospital morbidity after major colorectal surgery. Br J Surg. 2016;103(6):744–52.
West M, Lythgoe D, Barben C, et al. Cardiopulmonary exercise variables are associated with postoperative morbidity after major colonic surgery: a prospective blinded observational study. Br J Anaesth. 2013;112(4):665–71.
Wilson R, Davies S, Yates D, et al. Impaired functional capacity is associated with all-cause mortality after major elective intra-abdominal surgery. Br J Anaesth. 2010;105(3):297–303.
We thank Dr S.T. Sims for providing inspiration on the title. We are also very grateful to Dr S.P. Thomas for his proofreading of the manuscript and perpetual moral support.
Ethics approval and consent to participate
Consent for publication
This work was undertaken whilst MW and MPWG were funded by the National Institute of Health Research and the Royal College of Anaesthetists British Oxygen Company Fellowship awarded by the National Institute of Academic Anaesthesia for the Fit-4-Surgery programme of research. MPWG is vice president of CPX-International (International Society of Exercise Intolerance Research and Education), treasurer of the International Prehabilitation Society and deputy chair of the UK national Centre for Perioperative Care, UK. DZHL is president of the Perioperative Exercise Testing and Training Society and serves on the Board of CPX-International. SJ serves on the board of the International Prehabilitation Society. All other authors declare no conflict of interest.
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
About this article
Cite this article
Thomas, G., West, M.A., Browning, M. et al. Why women are not small men: sex-related differences in perioperative cardiopulmonary exercise testing. Perioper Med 9, 18 (2020). https://doi.org/10.1186/s13741-020-00148-2