Background

Breast cancer is a largely hormone-driven disease and the relationships between endogenous sex-steroid hormones – especially estrogens – and postmenopausal breast cancer are thought to be well established [1,2,3]. A recent systematic review and meta-analysis found moderate- to high-quality evidence that higher levels of estrogens and androgens, and lower levels of sex-hormone binding globulin (SHBG), were associated with increased risks of postmenopausal breast cancer [4]. The quality of the evidence in this review was largely determined by dose–response effects and large effect sizes [4]. No extracted result had adjusted for biomarkers from other biological pathways; namely, the insulin/insulin-like growth factor (IGF)-signaling and inflammatory pathways. These pathways may confound the effect of the sex-steroid hormone pathway (Fig. 1). For example, insulin and insulin-like growth factor-1 (IGF-1) can affect the bioavailability of estrogens and androgens via the regulation of aromatase and suppression of hepatic SHBG production [1, 5, 6]. They may also play a role in breast carcinogenesis: insulin and the IGF axis are proposed to have mitogenic and anti-apoptotic properties, and higher systemic concentrations of IGF-1 are associated with increased risks of breast cancer [1, 2, 5,6,7,8]. Further, a state of low-grade chronic inflammation – for example, in the context of physical inactivity and obesity – can foster a pro-carcinogenic environment via the overstimulation and dysregulation of immune cells, cytokines and adipokines [1, 2, 5, 6, 9]. Higher circulating levels of C-reactive protein (CRP) – a non-specific marker of chronic inflammation – are associated with increased risks of breast cancer, but the epidemiological evidence for other inflammatory markers remains uncertain [2, 10, 11]. Higher circulating levels of pro-inflammatory biomarkers including leptin, tumor necrosis factor-alpha (TNF-α) and interleukin-6 (IL-6) are also associated with enhanced aromatase activity and lower circulating levels of SHBG [1, 2, 5, 6, 9, 12].

Fig. 1
figure 1

Diagram of the assumed interrelationships between the inflammatory, insulin/insulin-like growth factor-signaling, and sex-steroid hormone pathways. SHBG sex-hormone binding globulin, CRP C-reactive protein, IGF insulin-like growth factor, IGF-1 Insulin-like growth factor-1

The aim of this study was to estimate risk ratios for sex-steroid hormone biomarkers in relation to postmenopausal breast cancer in a case-cohort of postmenopausal women within the Melbourne Collaborative Cohort Study (MCCS), while accounting for other biomarkers from the insulin/IGF-signaling and inflammatory pathways.

Methods

The Melbourne Collaborative Cohort Study

The MCCS includes 24,469 women aged 40–69 at recruitment from 1990 to 1994 [13]. At baseline and the second follow-up (F2, 2003–7), participants provided information about health status, lifestyle factors, sociodemographics and medical history via structured questionnaires [13]. Anthropometric and clinical measurements were performed at the study center, including the collection of blood samples [13]. At both times, plasma was stored in liquid nitrogen. Data linkages to national and state death and cancer registries—including the Victorian Cancer Registry and Australian Cancer Database—enabled vital status and cancer diagnoses to be determined prospectively [13]. The study protocol was approved by the Cancer Council Victoria Human Research Ethics Committee.

The case-cohort study

Initial eligibility criteria at second follow-up (2003–7)

This case-cohort study was restricted to women who attended F2. At F2, eligible women were postmenopausal, not known to be taking hormone replacement therapy (HRT), had provided a blood sample (within one year of the F2 questionnaire, if completed), had no prior invasive cancer diagnosis (except for keratinocyte cancers); at baseline, they had a body mass index (BMI) ≥ 18.5 kg/m2. Women were considered postmenopausal if they had had no menstrual periods in the past 12 months and met one of the following criteria: had experienced natural cessation of menses; had a bilateral oophorectomy; were age 55 years or older; or had had no periods in the 12 months prior to baseline and, for participants in a previous case-cohort study, measured estradiol concentration below 109 pmol/L at baseline (a threshold from that study [14, 15]). The case-cohort comprised a random sample of the 10,669 eligible women and all eligible women diagnosed with estrogen receptor (ER)-positive postmenopausal breast cancer between blood collection at F2 and 31 October 2020.

An eligible tumor was defined as invasive adenocarcinoma of the breast (International Classification of Diseases, Tenth Revision [ICD-10] code C50) that was ER-positive. Tumors of unknown hormone receptor status were included as 88% of breast cancer diagnoses among eligible women of known ER status were ER-positive. ER-negative and progesterone receptor (PR)-positive cancers were also included as this tumor subtype may be misclassified and accounts for only 1–4% of diagnoses [16,17,18,19]. Unspecified adenocarcinomas and unspecified cancers were presumed to be adenocarcinomas as 99% of breast cancer diagnoses among eligible women of known morphology were adenocarcinomas.

In total, 1,412 women were selected for the case-cohort study, including 999 in the subcohort and 459 cases (46 from the subcohort) (Fig. 2). The subcohort was a random sample of eligible women (Online Resource 1).

Fig. 2
figure 2

Selection of participants into the case-cohort study and analyses. N Number, F2 second follow-up wave, HRT hormone replacement therapy, BMI body mass index (kg/m.2)

Post-hoc criteria

Of the 1,412 selected women, 286 (20%) had unknown menopausal status and/or HRT use. Eligibility was confirmed for all selected women using the distribution of measured estradiol values at F2 for naturally postmenopausal women who were not taking HRT (806, 57% of selected women). Thirty-two women with estradiol values at or above the 99th percentile of this distribution (29.3 pg/mL, equivalent to 107.6 pmol/L) were excluded, regardless of age, menopausal status, or HRT use. Menopausal status and/or HRT use could not be determined for six women missing estradiol measurements. One woman was excluded as she did not participate in F2 despite providing a blood sample.

Four cases outside the subcohort were retrospectively disqualified as cases; three diagnoses were ascertained from death certificate only and one woman was diagnosed with non-adenocarcinoma breast cancer. To minimize the impact of death as a competing risk, follow-up was chosen to end on participants’ 86th birthday (Online Resource 2). Thus, 44 cases outside the subcohort were excluded and eight cases within the subcohort were analyzed as non-cases. Thirteen users of exogenous insulin were excluded so that measured insulin concentrations were of endogenous insulin.

The total study sample after post-hoc exclusions comprised 1,312 women, 969 in the subcohort and 378 cases (35 also in the subcohort) (Fig. 2).

Laboratory analysis of plasma biomarkers

Plasma samples of selected women were randomly ordered and allocated into 21 batches containing approximately equal numbers of cases. The samples were shipped on dry ice in two dispatches to the International Agency for Research on Cancer (IARC).

The plasma concentrations of all biomarkers were measured at the Nutrition Metabolism Branch, IARC. Plasma concentrations of sex-steroid hormones and SHBG were measured as previously described [20]. In brief, sex-steroid hormone concentrations were measured using a liquid chromatography-mass spectrometry system consisting of an ultra-high-performance liquid chromatograph (Agilent 1290, Agilent, Santa Clara, CA) and a QTRAP 5500 mass spectrometer (SCIEX, Framingham, MA). SHBG concentrations were measured by solid-phase “sandwich” enzyme-linked immunoassay (DRG International, Springfield, NJ). Interferon gamma (IFN-γ), IL-6, interleukin-8 (IL-8), interleukin-10 (IL-10), TNF-α, insulin, adiponectin, leptin, and CRP were measured by highly-sensitive and highly-specific electrochemiluminescent methods (Meso Scale Discovery, Rockville, MD). IGF-1 and insulin-like growth factor binding protein-3 (IGFBP-3) were measured by immunoassay methods by R&D Systems (Biotechne, Minneapolis, USA). C-peptide was measured by an enzyme-linked immunosorbent assay by ALPCO (Salem, USA). Further details are included in Online Resource 3. Three quality control samples at different concentration levels were measured in duplicate in each batch of analyses to assess the reliability of biomarker measurements. Reliability was assessed by calculating intra-assay and inter-assay coefficients of variation (CVs), as well as intra-batch and inter-batch intra-class correlation coefficients (ICCs), as described in Online Resource 4. Assay performance for estradiol and testosterone was evaluated by measuring samples created from reference standards with known concentrations. Measured values were compared with true values using validity coefficients and correlation plots, as described in Online Resource 5.

Normalization of biomarker values

Biomarker data were cleaned and normalized to correct for effects of batch, dispatch, and time since last meal (12% of study participants were not fasting at blood collection). The normalization technique was adapted from Viallon et al. [21]. Normalization models were used to estimate residual ICCs for the total proportion of variation attributable to batch for each biomarker. Methods for normalization and estimated ICCs are presented in Online Resource 6.

Calculation of free estradiol and free testosterone

Concentrations of free estradiol and free testosterone (i.e., not bound to SHBG) were calculated from normalized values of estradiol, testosterone and SHBG using the law of mass action assuming a fixed albumin concentration of 40 g/L (5.97 × 10–4 mol/L) and the following association constants: 6 × 104 L/mol (binding of estradiol to albumin); 4 × 104 L/mol (binding of testosterone to albumin); 0.68 × 109 L/mol (binding of estradiol to SHBG); 1.6 × 109 L/mol (binding of testosterone to SHBG) [22,23,24,25].

Statistical analysis

Descriptive statistics were presented as medians and interquartile ranges (IQRs) or as frequencies and percentages, where appropriate. Weighted modified Poisson regression with a robust variance estimator was used to estimate risk ratios (RRs) and 95% confidence intervals (CIs) of postmenopausal ER-positive breast cancer, per doubling plasma concentration of progesterone, androstenedione, DHEA, total and calculated free testosterone, estrone, total and calculated free estradiol, and SHBG. Poisson regression models were weighted to account for the oversampling of cases, which can be considered a stratified form of sampling in which stratification depends on the outcome [26]. Case weights were one, and weights for non-cases were the inverse of the sampling probability for non-cases [26]. The latter was calculated as the number of non-cases in the eligible cohort divided by the number of non-cases in the subcohort ([10,669—459] /953).

Confounders including other biomarkers were identified a priori using causal diagrams informed by expert consensus and literature review. Sociodemographic and lifestyle confounders included: education; country of birth; socioeconomic disadvantage; diet at baseline (dietary intake of carotenoids and dietary intake of calcium); alcohol consumption at baseline; smoking status at baseline; adiposity at baseline; physical activity at F2; age at blood collection; and age at menopause. The identification, measurement and modelling of sociodemographic and lifestyle confounders are described in Online Resource 7. As age at menopause could only be measured for naturally postmenopausal women (821, 63% of the case-cohort after post-hoc exclusions), this variable was not included in the adjustment set for the primary analyses. Sensitivity analyses were conducted, restricting to naturally postmenopausal women with a recorded age at menopause to include this variable in adjustment sets. Biomarkers that were identified as potential confounders a priori but had correlations ≥ 0.50 with the biomarker of interest were not included in the primary analysis (Online Resource 8).

The primary analyses modelled all biomarker concentrations as continuous, normalized values on the log2-scale. A one unit increase on the log2-scale represents a doubling in biomarker concentration. Analyses were repeated without adjustment for other biomarkers (where applicable). Sensitivity analyses excluding ER-negative/PR-positive tumors and tumors of unknown hormone receptor status were conducted to test the assumption that these tumor subtypes were ER-positive. Additional sensitivity analyses excluded all cases diagnosed within one year of blood draw at F2 to assess the potential impact of reverse causation. These analyses also excluded deaths that occurred within this time frame to be consistent with the target trial approach [27]. In addition, analyses that modelled concentrations of each sex-steroid hormone biomarker as quartiles corresponding to the distribution of normalized biomarker values in the subcohort were performed without adjustment for other biomarkers.

All analyses were complete-case analyses. The linearity assumption was tested for the continuous, normalized biomarker values using restricted cubic splines and Wald-tests. All statistical analyses were performed using Stata 16 (StataCorp, College Station, TX).

Results

Of the 1312 women eligible after post-hoc exclusions, 87 were excluded due to missing sociodemographic and lifestyle confounder data (Fig. 2). In addition, 17 women were excluded due to missing measurements for all sex-steroid hormone biomarkers. The characteristics of the remaining 1,208 women are summarized in Table 1. Compared with non-cases, cases were more likely to be educated, have obesity, and experience the menopause at ≥ 53 years, and were less likely to be sufficiently active. The normalized concentrations of DHEA, total estradiol, free estradiol, leptin and CRP were higher, and the normalized concentration of SHBG was lower, for cases compared with non-cases. The characteristics of the 1,312 women eligible after post-hoc exclusions were not appreciably different from the 1,208 women analyzed (Online Resource 9).

Table 1 Characteristics of the analyzed case-cohort (n = 1208)

Of the 342 cases eligible for complete-case analysis, 324 (95%) were ER-positive, 5 were ER-negative/PR-positive, and 13 were of unknown hormone receptor status. Twenty cases were diagnosed within one year of blood draw at F2, and 5 subcohort non-cases had died.

Reliability of biomarker measurements and assay performance

The calculated overall intra-assay and inter-assay CVs were below 10% and 15%, respectively, for most biomarkers (Online Resource Table 4.1). The estimated intra-batch and inter-batch reliability ICCs were above 80% and 70%, respectively, for most biomarkers (Online Resource Table 4.2). The validity coefficients for the true and measured values of estradiol and testosterone were 0.987 and 0.997, respectively. Correlation plots are presented in Online Resource 5.

Risk ratios per doubling of biomarker concentration

For the primary analyses, increased risks of postmenopausal ER-positive breast cancer were observed per doubling plasma concentration of progesterone (RR 1.22, 95% CI 1.03 to 1.44), androstenedione (RR 1.20, 95% CI 0.99 to 1.45), DHEA (RR 1.15, 95% CI 1.00 to 1.34), total testosterone (RR 1.11, 95% CI 0.96 to 1.29), calculated free testosterone (RR: 1.12, 95% CI: 0.98 to 1.28), estrone (RR: 1.21, 95% CI: 0.99 to 1.48), total estradiol (RR: 1.19, 95% CI: 1.02 to 1.39) and calculated free estradiol (RR: 1.22, 95% CI: 1.05 to 1.41) (Table 2). A decreased risk was suggested for SHBG (RR: 0.83, 95% CI: 0.66 to 1.05).

Table 2 Risk ratios for postmenopausal estrogen receptor-positive breast cancer per doubling of biomarker concentration

Results did not appreciably differ in analyses without adjustment for other biomarkers (Table 2), except that the inverse association for SHBG was somewhat weaker (RR: 0.90, 95% CI: 0.73 to 1.11). Point estimates for RR for all sex-steroid hormone biomarkers were slightly stronger when ER-negative/PR-positive tumors and tumors of unknown hormone receptor status were excluded (Online Resource 10). The results of the sensitivity analyses excluding cases and deaths that occurred within one year of blood draw at F2 were similar to the results of the primary analyses (Online Resource 11).

For the sensitivity analyses in the subset of naturally postmenopausal women with a recorded age at menopause (Online Resource 12), the point estimates for RR were closer to the null for progesterone (RR 1.11, 95% CI 0.90 to 1.36) and androstenedione (RR 1.08, 95% CI 0.85 to 1.39), and further away from the null for estrone (RR 1.30, 95% CI 0.99 to 1.69), total estradiol (RR 1.29, 95% CI 1.04 to 1.58) and calculated free estradiol (RR 1.31, 95% CI 1.08 to 1.60). Results with and without adjustment for age at menopause were similar, whereas the point estimates for RR without adjustment for other biomarkers were closer to the null for estrone, free estradiol and SHBG (Online Resource 12).

Risk ratios for quartiles of biomarker concentration

The highest versus lowest levels of biomarker concentrations were associated with increased risks of postmenopausal ER-positive breast cancer for progesterone (RR 1.56, 95% CI 1.09 to 2.24), androstenedione (RR 1.39, 95% CI 0.97 to 2.00), DHEA (RR 1.55, 95% CI 1.06 to 2.25), total estradiol (RR 1.49, 95% CI 1.01 to 2.19) and calculated free estradiol (RR 1.47, 95% CI 0.99 to 2.17) (Table 3). RRs were suggestive of monotonic increases for DHEA, estrone and total estradiol. In contrast, the positive relationship between calculated free estradiol and postmenopausal ER-positive breast cancer plateaued at the third-highest plasma concentration compared to the lowest.

Table 3 Risk ratios for postmenopausal estrogen receptor-positive breast cancer, by quartiles of biomarker concentrations

Discussion

Summary of principal findings

Higher plasma concentrations of progesterone, estrogens and androgens, and decreasing plasma concentration of SHBG, were associated with increased risks of postmenopausal ER-positive breast cancer in this case-cohort of postmenopausal women. Similar results were obtained with and without control for other biomarkers that were identified as potential confounders, suggesting that confounding by the insulin/IGF-signaling and inflammatory pathways was minimal. The exception was SHBG; a somewhat stronger inverse relationship was observed with adjustment for adiponectin, leptin, insulin and IGF-1. Slightly stronger associations were observed when cases that were ER-negative/PR-positive or of unknown hormone receptor status were excluded, indicating that some cases that were assumed to be ER-positive may not have truly been ER-positive. The impact of reverse causation was negligible. Results of the sensitivity analyses in the subset of naturally postmenopausal women with a recorded age at menopause were not sensitive to adjustment for age at menopause. Rather, the deviations observed from the primary analyses could be explained by reduced precision in the subsample, or differences between women who were naturally postmenopausal (with a known age at menopause) and women who were assumed to be postmenopausal for other reasons.

Strengths and limitations

A strength of our study was that careful consideration was given to biomarkers from the insulin/IGF-signaling and inflammatory pathways that may confound the associations between biomarkers of the sex-steroid hormone pathway and risk of postmenopausal ER-positive breast cancer. Biomarkers that may be potential confounders were identified a priori using a causal diagram that was informed by literature review and expert opinion. Causal diagrams can minimize the pitfalls of other confounder selection methods, including overadjustment bias [28,29,30]. However, residual confounding may remain if our assumptions are inaccurate or if important confounders have not been identified or correctly measured [28, 30]. Depicting the true complexity of biomarker interrelationships and their role in breast carcinogenesis is challenging. The current body of causal knowledge is limited, and we could not account for bidirectional relationships as the biomarkers had only been measured at one point in time. Thus, we assumed what the net direction of the effects of the measured biomarkers would be in a relatively older cohort of postmenopausal women in our causal diagram. Our assumptions can be refined with the advancement of causal knowledge over time, ideally in studies that measure biomarkers at multiple points in time.

A potential limitation of our study was that selection bias may have been introduced if there were systematic differences between women who did and did not attend F2 and provide a blood sample. However, the sociodemographic and lifestyle confounders that we have adjusted for in our analyses are likely to have included the critical determinants of participation at F2 (e.g., age, country of birth, socioeconomic disadvantage, education, smoking, alcohol, adiposity) [13]. Restriction to women who had not been diagnosed with breast cancer before F2 could potentially bias the RR for hormones towards the null, since women with high hormone concentrations at baseline were more likely to develop breast cancer [15]. Bias may also arise due to the exclusion of participants with missing data, although in this study, the proportion of eligible women with missing data was small (8%). Considered collectively, we expect selection bias to have minimal impact on our study conclusions. We note that our findings are only generalizable to postmenopausal women who are not taking hormone replacement therapy or exogenous insulin and have no personal history of cancer (including breast cancer).

A major strength of our study was the use of a highly sensitive liquid chromatography-mass spectrometry method to measure the plasma concentrations of sex-steroid hormones in postmenopausal women with high precision and accuracy. We were able to demonstrate the validity of this method using reference standards for estradiol and testosterone. The measured and true values of estradiol and testosterone were highly correlated. Further, intra-assay and inter-assay CVs, as well as intra-batch and inter-batch ICCs, calculated from quality control samples indicated that biomarker measurements were reliable, with few exceptions that may be attributable to batch and dispatch effects (Online Resource 4). We adopted a novel analysis approach to correct for batch effects, dispatch effects and time since last meal, whilst retaining meaningful biological variation in the biomarker measurements [21]. Further, we measured the plasma concentrations of a breadth of biomarkers from the inflammation, insulin/IGF-signaling and sex-steroid hormone pathways, which were selected through expert consultation and literature review. However, plasma concentrations of biomarkers measured at only one point in time will not be perfect proxies of complex and time-varying biological processes that may operate at cellular and systemic levels.

Comparison with other studies

Previous studies that have adjusted for other biomarkers have typically compared results with and without adjustment for other sex-steroid hormones and/or SHBG [3, 15, 31,32,33,34,35,36,37,38,39]. These are often mutual or progressive adjustments to assess “independence” rather than confounding, on the basis that biomarkers share complex interrelationships and correlations. However, this practice can lead to overadjustment bias, which we have attempted to minimize by explicitly considering our assumed underlying causal structure a priori [29]. In addition, only a handful of other studies have measured and adjusted for biomarkers from other biological pathways that may be potential confounders. One study from the Women’s Health Initiative presented results for estradiol with and without adjustment for free IGF-1 and insulin; positive associations with postmenopausal breast cancer appeared stronger with adjustment for both IGF-1 and insulin [40]. In contrast, our results for estradiol were similar with and without adjustment for biomarkers that were identified as potential confounders (adiponectin, leptin, TNF-α, IL-6, IGF-1, insulin and SHBG). Another study from the UK Biobank presented results for total testosterone with and without adjustment for SHBG and IGF-1 that were not appreciably different [41]. Likewise, our results for total testosterone did not change with and without adjustment for SHBG, but IGF-1 was not identified as a potential confounder of this analysis.

Overall, our findings were generally consistent with previous studies, including a recent systematic review by Drummond et al. [4], a previous case-cohort study conducted at baseline (1990–1994) within the MCCS [15], and a pooled analysis of nine prospective studies examining the relationship between endogenous sex-steroid hormones and postmenopausal breast cancer [3]. A notable finding was the estimated risk ratio per doubling plasma concentration of progesterone; we observed the largest increased risk of postmenopausal ER-positive breast cancer for this biomarker (RR 1.22, 95% CI 1.03 to 1.44) compared to any other measured biomarker from the sex-steroid hormone pathway. Previous studies have either not measured endogenous progesterone or have drawn inconclusive results regarding its relationship with breast cancer after the menopause, largely due to insufficient assay sensitivity and low circulating levels in postmenopausal women [42]. Our result is in support of a recent study by Trabert et al. [43], which also used a highly sensitive liquid chromatography-mass spectrometry method and found increased risks of postmenopausal breast cancer per standard deviation increase in circulating endogenous progesterone levels (hazard ratio for invasive breast cancers: 1.24, 95% CI 1.07 to 1.43). Trabert et al. [43] also present evidence for effect modification: reduced risks of postmenopausal breast cancer were observed with higher levels of progesterone among women in the lowest quintile of circulating estradiol (< 6.30 pg/mL), while increased risks were observed among women in the higher quintiles (≥ 6.30 pg/mL). Collectively, these results may challenge the plausibility of our a priori assumption that progesterone does not have a direct effect on postmenopausal ER-positive breast cancer (depicted by no direct arrow from progesterone to postmenopausal breast cancer in our causal diagram, Online Resource Fig. 8.1). This assumption was based on the systematic review by Drummond et al. [4], which found moderate quality evidence of no association between progesterone and breast cancer risk (albeit in both pre- and postmenopausal women combined). The implication of this assumption is that we should interpret the risk ratio for progesterone as an indirect effect, possibly driven by its role as a precursor of androgens and estrogens in steroidogenesis. This finding—in addition to concerns over the sensitivity of progesterone measurements in early studies, as well as studies demonstrating paracrine effects of progesterone via neighboring PR-positive cells [42]—warrants future studies including mediation analyses to determine what dictates the effect of progesterone on postmenopausal ER-positive breast cancer.

Implications and future directions

Our study confirms the causal role that sex-steroid hormones and SHBG play in the etiology of postmenopausal ER-positive breast cancer. We strengthen the causal evidence by demonstrating that potential confounding from other biological pathways implicated in breast carcinogenesis is likely non-substantial. However, the limited state of causal knowledge about biomarker interrelationships and the potential for residual confounding should be considered when interpreting our findings. Of note, two recent systematic reviews found insufficient evidence to establish a causal link between the inflammation and insulin/IGF-signaling pathways and breast cancer [8, 11]. Future research could examine whether adjustment for biomarkers from other biological pathways is more important for pre–menopausal breast cancer or ER-negative postmenopausal breast cancer. In addition, time-varying confounding could be examined in future studies that measure biomarkers at multiple points in time.