Introduction

Outdoor air quality has been recognized as a leading cause of disease and death globally (GBD 2019). As such, strategies to improve air quality have been proposed (Johnston et al. 2012; Liang et al. 2018; Gu et al. 2018; Lelieveld et al. 2019). These strategies are often evaluated in terms of their societal monetary costs and extent to which they improve public health (World Bank 2016; Heo et al. 2017; Muller 2018). Public health improvements are often measured by the number of attributable adverse health events (i.e., incidence of disease or death) associated with a reduction in exposure to outdoor concentrations of pollution over a fixed time period (i.e., year). We can quantitatively represent the attributable number, A N, by the following equation:

$$\textrm{A}N\left({z}_C,{z}_F\right)=P\times \textrm{BR}\times \textrm{PA}{\textrm{F}}_{\beta}\left({z}_C,{z}_F\right)$$

where zC is the current air pollution concentration, zF the future concentration predicted under a specific mitigation strategy, P the target population size for which the concentration applies, BR the baseline rate of disease or death, and PAFβ(zC, zF), the population attributable fraction defined by the following equation:

$${\textrm{PAF}}_{\beta}\left({z}_C,{z}_F\right)=1-\frac{1}{R_{\beta}\left({z}_C,{z}_F\right)}$$

with Rβ(zC, zF) the relative risk between zC and zF, indexed by a vector of parameters β. Here, Rβ(zC, zF) represents the ratio of the probability of an adverse event over a fixed time period for a population which is exposed to zC to the corresponding probability if that population was contrary to the fact exposed to zF.

The magnitude, shape, and uncertainty in Rβ(zC, zF) need to be characterized in order to conduct a health benefits analysis. The relative risk function is usually defined with respect to a counterfactual concentration, zcf say, denoted by Rβ(zC|zcf) with Rβ(zcf|zcf) = 1. We then have

$${R}_{\beta}\left({z}_C,{z}_F\right)=\frac{R_{\beta}\left({z}_C\left|{z}_{cf}\right.\right)}{R_{\beta}\left({z}_F\left|{z}_{cf}\right.\right)}$$

Uncertainty in Rβ(zC|zcf) is induced by uncertainty in the estimate of β given the algebraic form of how R varies with z. Let \(\hat{\beta}\) be the estimate of β with distribution \({F}_{\hat{\beta}}\). Then the uncertainty distribution of \({R}_{\hat{\beta}}\left({z}_C,{z}_F\right)\) is obtained by generating a (usually) large number, N, of realizations of \(\hat{\beta}\) from \({F}_{\hat{\beta}}\), denoted by \(\left\{{\hat{\beta}}^{(i)},i=1,\dots, N\right\}\), thus yielding N estimates

$${R}_{{\hat{\beta}}^{(i)}}\left({z}_C,{z}_F\right)=\frac{R_{{\hat{\beta}}^{(i)}}\left({z}_C\left|{z}_{cf}\right.\right)}{R_{{\hat{\beta}}^{(i)}}\left({z}_F\left|{z}_{cf}\right.\right)}$$

for i = 1, …, N. These N estimates of relative risk are used to calculate N estimates of the PAF. Uncertainty in the estimate of attributable deaths can be determined by the joint uncertainty in P, BR, and PAF (GBD 2019).

Previous relative risk models

The most common form of Rβ(zC, zF) is the following equation

$${R}_{\beta}\left({z}_C,{z}_F\right)=\frac{R_{\beta}\left({z}_C\left|{z}_{cf}\right.\right)}{R_{\beta}\left({z}_F\left|{z}_{cf}\right.\right)}=\frac{\exp \left\{\beta {z}_C-\beta {z}_{cf}\right\}}{\exp \left\{\beta {z}_F-\beta {z}_{cf}\right\}}=\exp \left\{\beta \left({z}_C-{z}_F\right)\right\}$$

for scalar parameter β. This characterization is often termed the Log − Linear model (Cohen et al. 2004) since the logarithm of the relative risk function is linear in concentration. We note that under the Log − Linear model, Rβ(zC, zF) is independent of the counterfactual concentration zcf and only depends on the magnitude of the difference in concentrations zC − zF, but not the magnitude of each concentration.

The estimate of β is usually obtained from the analysis of primary health data with a normal uncertainty distribution, \(N\left(\hat{\beta},{\hat{\sigma}}_{\hat{\beta}}\right)\), where \({\hat{\sigma}}_{\hat{\beta}}\) is the standard error of \(\hat{\beta}\). In the case of the Log − Linear model, we have

$$\beta \left({z}_C-{z}_F\right)\sim N\left(\hat{\beta}\left({z}_C-{z}_F\right),{\hat{\sigma}}_{\hat{\beta}}\left({z}_C-{z}_F\right)\right)$$

since

$$\sqrt{\operatorname{var}\left(\hat{\beta}{z}_C-\hat{\beta}{z}_F\right)}=\sqrt{\operatorname{var}\left(\hat{\beta}{z}_C\right)+\operatorname{var}\left(\hat{\beta}{z}_F\right)-2\operatorname{cov}\left(\hat{\beta}{z}_C,\hat{\beta}{z}_F\right)}=$$
$$\sqrt{{\hat{\sigma}}_{\hat{\beta}}^2\ {z}_C^2+{\hat{\sigma}}_{\hat{\beta}}^2\ {z}_F^2-2{\hat{\sigma}}_{\hat{\beta}}^2\ {z}_C{z}_F}={\hat{\sigma}}_{\hat{\beta}}\left({z}_C-{z}_F\right)$$

The uncertainty distribution in Rβ(zC, zF) is completely characterized by \(N\left(\hat{\beta},{\hat{\sigma}}_{\hat{\beta}}\right)\) and the concentrations zC and zF.

In recent years, interest has focused on the shape of the association between air pollution exposure and health, specifically assessing evidence of departures from linearity. An early attempt (Cohen et al. 2004) was based on the Log − Log model of the form:

$$\ln {R}_{\beta}\left(z,{z}_{cf}\right)=\beta \left(\ln \left(z+1\right)-\ln \left({z}_{cf}+1\right)\right)=\beta \ln \left(\frac{z+1}{z_{cf}+1}\right)$$

where the logarithm of the relative risk varied with the logarithm of concentration. Note that the relative risk function is dependent on the counterfactual concentration, a departure from the Log − Linear model.

These models were extended to a broader family of shapes by the Shape Constrained Health Impact function (SCHIF) (Nasari et al. 2016) which has the form:

$$\ln {R}_{\beta}\left(z,{z}_{cf}\right)=\theta \frac{f\left(z-{z}_{cf}\right)}{1+\exp \left(-\left(z-{z}_{cf}-\mu \right)/\tau \right)}$$

where f is either a linear or logarithmic function of concentration. The SCHIF can take near-linear, sub, and supra-linear forms in addition to sigmodal shapes. Ensemble modelling methods were used to estimate the unknown parameters (θ, μ, τ) and form of f (Nasari et al. 2016). Like the Log − Log model, the SCHIF is also dependent on the counterfactual concentration.

The Log − Linear, Log − Log, and SCHIF were intended to be used on primary health data. However, interest has also focused on combining information from selected studies to form a common relative risk model within a meta-regression framework. Log − Linear models have been used for this purpose (Chen and Hoek 2020) in addition to non-linear models, including the integrated exposure-response (IER) (Burnett et al. 2014) with the form

$$\ln {R}_{\beta}\left(z,{z}_{cf}\right)=\ln \left(1+\alpha \left(1-\exp \Big(-\beta {\tilde{z}}^{\rho}\right)\right)$$

the Global Exposure Mortality Model (GEMM) (Burnett et al. 2018), that generalized the SCHIF, with the form

$$\ln {R}_{\beta}\left(z,{z}_{cf}\right)=\theta \frac{\ln \left(\overset{\sim }{z}/\alpha +1\right)}{1+\exp \left(-\left(\overset{\sim }{z}-\mu \right)/\tau \right)}$$

and the Fusion model (Burnett et al. 2022)

$$\ln {R}_{\beta}\left(z,{z}_{cf}\right)=\gamma \times \left(\min \left(\overset{\sim }{z},\mu \right)+\mathcal{F}\left(\overset{\sim }{z}|\rho, \mu, \theta \right)+\rho \theta \ln \left(\max \left(\overset{\sim }{z},\theta \right)/\theta \right)\right)$$

where

$$\mathcal{F}\left(\overset{\sim }{z}|\rho, \mu, \theta \right)=\int_{\mu}^{min(\theta,\tilde{z})}{\left(1+\frac{1-\rho }{\rho }{\left(\frac{x-\mu }{\theta -\mu}\right)}^{\frac{\theta -\mu }{\theta \left(1-\rho \right)}}\right)}^{-1} dx$$

with \(\overset{\sim }{z}=z-{z}_{cf}\). These models are based on a small number of unknown parameters (three for the IER, and four for both the GEMM and Fusion). All three models report 1000 sets of parameter estimates, in addition to 1000 sets of relative risk predictions over the concentration range, thus enabling calculation of 1000 PAFs based on any two concentrations (Burnett et al. 2014:2018:2022). This PAF uncertainty distribution can be used in benefits assessment.

Most researchers now use non-parametric spline methods to both characterize the shape and magnitude of the concentration-response function, either based on primary data (Brunekreef et al. 2021; Brauer et al. 2022; Dominici et al. 2022), or meta-regression combining information from multiple primary data studies (GBD 2019), in order to gain additional flexibility compared to parametric models with a few parameters, such as the IER, GEMM, and Fusion models. Splines have the general form

$${s}_{\beta }(z)=\sum_{j=1}^J{\beta}_j{b}_j\left(z|\kappa \right)$$

where the basis functions bj(z| k1, …, kK) are transformations of concentration specified by K knot locations: κ = (k1, …, kK). The most common forms of bj(z| κ) are cubic polynomials, such as those used to define natural (Bartels et al. 1998) or restricted (Harrel 2015) cubic splines such that the second derivative of the spline is continuous at the knot values. They typically involve a limited number of terms, J, with the number of terms selected based on fitting criteria. These regression splines have been extended to include a penalty term based on the integral of the second derivative of the spline that smooths the estimates of the βj. Smoothing splines (Eilers and Marx 1996) typically have many basis functions and estimates of the βj are a function of the penalty parameter, with larger values inducing more smoothing on these estimates.

For spline representations, the quantity of interest in benefits assessment is as follows:

$$\ln {R}_{\hat{\beta}}\left({z}_C,{z}_F\right)=\left({\hat{s}}_{\hat{\beta}}\left({z}_C\right)-{\hat{s}}_{\hat{\beta}}\left({z}_{cf}\right)\right)-\left({\hat{s}}_{\hat{\beta}}\left({z}_F\right)-{\hat{s}}_{\hat{\beta}}\left({z}_{cf}\right)\right)={\hat{s}}_{\hat{\beta}}\left({z}_C\right)-{\hat{s}}_{\hat{\beta}}\left({z}_F\right)=\sum_{j=1}^J{\hat{\beta}}_j\left({b}_j\left({z}_C|\kappa \right)-{b}_j\left({z}_F|\kappa \right)\right)$$

independent of the counterfactual concentration zcf. Researchers typically report the results of fitting splines with the exponential of the mean spline prediction over the concentration range and the 95% confidence intervals of those exponentiated predictions (Brunekreef et al. 2021; Brauer et al. 2022; Dominici et al. 2022). The exponential of the mean spline prediction is shifted such that the prediction at the minimum concentration is unity.

The uncertainty in \({\hat{s}}_{\hat{\beta}}\left({z}_C\right)-{\hat{s}}_{\hat{\beta}}\left({z}_F\right)\) is given by the following:

$$\operatorname{var}\left\{{\hat{s}}_{\hat{\beta}}\left({z}_C\right)-{\hat{s}}_{\hat{\beta}}\left({z}_F\right)\right\}=\left(b\Big({z}_C|\kappa \right)-b\left({z}_F|\kappa \right)\Big){}^{\prime}\hat{V}\left(b\left({z}_C|\kappa \right)-b\left({z}_F|\kappa \right)\right)$$

where b(z| κ) = (b1(z| κ), …, bJ(z| κ)) and \(\hat{V}\) is the estimated covariance matrix of \(\hat{\beta}={\left({\hat{\beta}}_1,\dots, {\hat{\beta}}_J\right)}^{\prime }\), the estimates of the spline parameters.

We note that

$$\operatorname{var}\left\{{\hat{s}}_{\hat{\beta}}\left({z}_C\right)-{\hat{s}}_{\hat{\beta}}\left({z}_F\right)\right\}=\operatorname{var}\left({\hat{s}}_{\hat{\beta}}\left({z}_C\right)\right)+\operatorname{var}\left({\hat{s}}_{\hat{\beta}}\left({z}_F\right)\right)-2\operatorname{cov}\left({\hat{s}}_{\hat{\beta}}\left({z}_C\right),{\hat{s}}_{\hat{\beta}}\left({z}_F\right)\right)$$

cannot be determined from the mean spline predictions over the concentration range and their associated variance since \(\operatorname{cov}\left({\hat{s}}_{\hat{\beta}}\left({z}_C\right),{\hat{s}}_{\hat{\beta}}\left({z}_F\right)\right)\) cannot be calculated from these summary results. This is true even for the case where burden assessments are of interest. Here, all future concentrations are set to a counterfactual. However, the uncertainty in the spline prediction at the counterfactual is often positive.

One can form relative risk predictions such that \(\operatorname{var}\left({\hat{s}}_{\hat{\beta}}\left({z}_{cf}\right)\right)=0\) and thus \(\operatorname{cov}\left({\hat{s}}_{\hat{\beta}}\left({z}_C\right),{\hat{s}}_{\hat{\beta}}\left({z}_{cf}\right)\right)\) =0. We then have: \(\operatorname{var}\left\{{\hat{s}}_{\hat{\beta}}\left({z}_C\right)-{\hat{s}}_{\hat{\beta}}\left({z}_{cf}\right)\right\}=\) \(\operatorname{var}\left({\hat{s}}_{\hat{\beta}}\left({z}_C\right)\right)\), that can be determined from the spline prediction confidence intervals given knowledge of the distribution of \({\hat{s}}_{\hat{\beta}}\left({z}_C\right)\). Unfortunately, uncertainty in the difference between spline predictions at any two concentrations greater than the counterfactual cannot be determined alone from the uncertainty in spline predictions at the two concentrations even if \(\operatorname{var}\left({\hat{s}}_{\hat{\beta}}\left({z}_{cf}\right)\right)=0\).

We conclude from these observations that the spline parameter estimates and their covariance matrix should be reported in addition to knot locations and type of spline used. Alternatively, many spline curves could be reported based on random draws of the spline parameters.

Spline representations often describe the association between concentration and response in a complex manner, including steep changes or waviness in the association over narrow concentration intervals. These types of associations may be of questionable biological plausibility and thus are not desirable features in a benefits assessment. An extension of the SCHIF (eSCHIF) was proposed to transform spline predictions into a form thought to be more suitable for benefits assessment (Brauer et al. 2022). Here

$$\textrm{lneSCHIF}\left(\overset{\sim }{z}\right)={\theta}_0\ln \left(\frac{\tilde{z}}{\alpha_0}+1\right)+{\theta}_1\ \frac{\ln \left(\frac{\tilde{z}}{\alpha_1}+1\right)}{1+\exp \left(-\frac{\overset{\sim }{z}-\mu }{\tau}\right)}$$

This algebraic function was fit to each of the 1000 series of spline predictions over the concentration range 0 to 20 μg/m3 (Brauer et al. 2022). Restrictions on the parameters (α0, α1, τ) were imposed in order to smooth each series of spline predictions. However, no restrictions were made on (θ0, θ1), resulting in fitted curves that were not necessarily monotonically increasing.

In this paper, we propose a new approach to using spline predictions of concentration-response associations that are monotonically increasing in the mean predictions over concentration, which we suggest is an additional desirable feature for benefits assessment. We allow the uncertainty in these predictions to possibly include the null or negative associations at any concentration, a property like the case if the original study data were in fact fit with a monotonically increasing spline (Pya and Wood 2015).

Monotonically increasing shape constrained health impact functions (mSCHIF)

Our approach is to generalize the eSCHIF to be able to characterize a wider variety of shapes and constrain the fit to be monotonically increasing. Under such constraints, the logarithmic function may not be flexible enough to model all the possible desirable shapes predicted by splines. In addition, using two terms may also not be flexible enough to model all possible spline predictions and their associated uncertainty under the additional constraint of monotonicity. We thus propose a new spline formulation defined by the following:

$$\textrm{mSCHIF}\left(\overset{\sim }{z}\right)=\exp \left\{\ \sum_{l=1}^L{\theta}_l\mathcal{M}\left(\overset{\sim }{z}|{\alpha}_l,{\lambda}_l,{\mu}_l,{\tau}_l\right)\right\}$$

with terms: \(\mathcal{M}\left(\overset{\sim }{z}|{\alpha}_l,{\lambda}_l,{\mu}_l,{\tau}_l\right)=\mathcal{F}\left(\overset{\sim }{z}|{\alpha}_l,{\lambda}_l\right)\mathcal{L}\left(\overset{\sim }{z}|{\mu}_l,{\tau}_l\right),l=1,\dots, L\), composed of the product of two functions

$$\mathcal{F}\left(\overset{\sim }{z}|\alpha, \lambda \right)={\int}_0^{\tilde{z}}{\left(1+{\left(\frac{x}{\alpha}\right)}^{\lambda}\right)}^{-1}\partial x$$

and

$$\mathcal{L}\left(\overset{\sim }{z}|\mu, \tau \right)={\left(1+\mathit{\exp}\left(-\frac{\overset{\sim }{z}-\mu }{\tau}\right)\right)}^{-1}$$

with θl ≥ 0, 0 < αl, μl, τl ≤ r, and 0 ≤ λl ≤ r, (l = 1, …, L), where r is the range in concentration.

\(\mathcal{F}\) is a monotonically increasing supra-linear function while \(\mathcal{L}\) is a two-parameter logistic function taking either sub-linear or sigmodal shapes over the concentration range depending on the values of μ and τ. When λ = 0, \(\mathcal{F}\) is linear in concentration, when λ = 1, \(\mathcal{F}\) is log-linear in concentration, and when λ = 2, \(\mathcal{F}\) is an arctangent function. For all other values of λ, an explicit form of \(\mathcal{F}\) is not available; thus, numerical integration is required.

\(\mathcal{F}\) has the property that it is nearly linear when \(\overset{\sim }{z}<\alpha\), with this approximation improving as λ increases. The decline in the derivative of \(\mathcal{F}\) increases as λ increases for \(\overset{\sim }{z}>\alpha\), resulting in a relatively flat functional form for \(\mathcal{F}\) over this concentration range. The smaller value of τ, the less increase in \(\mathcal{L}\) for \(\overset{\sim }{z}<\mu\). These properties of \(\mathcal{F}\) and \(\mathcal{L}\) facilitate adding mSCHIF terms together under the constraint (θl ≥ 0, l = 0, …, L). We view the mSCHIF as a spline-type formulation with \(\mathcal{F}\left(\overset{\sim }{z}|{\alpha}_l,{\lambda}_l\right)\mathcal{L}\left(\overset{\sim }{z}|{\mu}_l,{\tau}_l\right)\) as their basis functions. They can take on a wide variety of shapes depending on the values of their associated parameters, a property not shared by traditional spline models.

mSCHIF parameter estimation and inference

We estimate the mSCHIF parameters by non-linear regression (nlxb routine in R package: nlsr 2020) where the response is the mean spline predictions, \({\hat{s}}_{\hat{\beta}}\left({z}_i\right)-{\hat{s}}_{\hat{\beta}}\left({z}_{cf}\right)\), i = 1, …, I, over the concentration range, with zcf representing the minimum concentration of interest. We estimate the L × L covariance matrix, Σ, of \(\left({\hat{\theta}}_l,l=1,\dots, L\right)\) by equating the covariance of the spline predictions across concentrations to the covariance of the mSCHIF predictions. That is:

$${X}^{\prime}\hat{V}X={T}^{\prime}\Sigma T,$$

where X is a J × I matrix consisting of the J spline bases functions evaluated at the I , and T is a L × I matrix consisting of the L mSCHIF bases functions evaluated at the I concentrations. We then have the following:

$$\hat{\Sigma}={\left(T{T}^{\prime}\right)}^{-1}{TX}^{\prime}\hat{V}X{T}^{\prime }{\left(T{T}^{\prime}\right)}^{-1}$$

The mSCHIF predictions will in most cases not fit the spline predictions perfectly. We incorporate this uncertainty by adding the squared difference between the two model predictions at each concentration to the covariance of the spline predictions. We then have the following:

$$\hat{\Sigma}={\left(T{T}^{\prime}\right)}^{-1}T\ \left({X}^{\prime}\hat{V}X+\hat{D}\right){T}^{\prime }{\left(T{T}^{\prime}\right)}^{-1}$$

where \(\hat{D}\) is a I × I diagonal matrix of the squared differences in predictions among the two models.

We assign all the uncertainty in the mSCHIF predictions to L parameters, \(\hat{\theta}=\left({\hat{\theta}}_l,l=1,\dots, L\right)\), assuming that \(\left({\hat{\alpha}}_l,{\hat{\lambda}}_{l1},{\hat{\mu}}_l,{\hat{\tau}}_l,l=1,\dots .,L\right)\) are known without error. However, we are capturing the total amount of uncertainty characterized by the spline by equating the spline prediction uncertainty to that of the mSCHIF. We assume the difference between lnmSCHIF predictions at zC and zF is normally distributed with mean

$$\hat{m}\left({\tilde{z}}_C,{\tilde{z}}_F\right)={\left(T\left({\tilde{z}}_C\right)-T\left({\tilde{z}}_F\right)\right)}^{\prime}\hat{\theta}$$

and standard deviation

$$\hat{\sigma}\left({\tilde{z}}_C,{\tilde{z}}_F\right)=\sqrt{{\left(T\left({\tilde{z}}_C\right)-T\left({\tilde{z}}_F\right)\right)}^{\prime}\hat{\Sigma}\left(T\left({\tilde{z}}_C\right)-T\left({\tilde{z}}_F\right)\right)},$$

with \({\tilde{z}}_C={z}_C-{z}_{cf}\) and \({\tilde{z}}_F={z}_F-{z}_{cf}\).

In some cases, reporting a large number, N, of spline predictions over the concentration range is more convenient than reporting a potentially very large number of bases functions and parameter estimates, as would be the case for some smoothing splines. We first subtract the spline prediction at the counterfactual for each of the N series of predictions, resulting in a prediction of zero for each of the N series. We do this since the lnmSCHIF prediction at the counterfactual is zero with zero uncertainty. We then replace \({X}^{\prime}\hat{V}X\) by the empirical covariance among the N predictions for all pairs of concentrations.

Illustrative examples

We provide two examples to illustrate some of the properties of our method. Both examples examine the association between mortality and exposure to outdoor concentrations of fine particulate matter (PM2.5) based on primary data from the Medicare cohort (Dominici et al. 2022) and the 2006 Canadian Census Health and Environment Cohort (CanCHEC) (Chen et al. 2022). In both examples, annual PM2.5 estimates were based on the same geostatistical model that incorporated satellite retrievals, chemical transport model estimates, and ground data (Meng et al. 2019).

Medicare cohort

The Medicare cohort consists of subjects enrolled in the Medicare program in the USA over the age of 65 years (Dominici et al. 2022). For this example, the cohort includes over 74 million enrollees entered from 2000 to 2016. Subject-level information included age, sex, race/ethnicity, and Medicaid eligibility (measure of lower income). Zip code area (ZIP) measures include median household income, median house value, proportion of residents in poverty, proportion of residents that own their own house, and proportion of residents with a high school diploma. The databases include the fact and date of death from any cause and the time-varying ZIP code of the enrollees mailing address. Subjects were followed until their date of death or until December 31, 2016, the end of the study period. A Poisson regression model was used with the response defined by the count of all-cause deaths during each follow-up year, calendar year, ZIP code, age (5-year groupings), sex, and race; offset by the corresponding total person-time under observation. Annual PM2.5 concentrations were assigned to the same year of follow-up. A smoothing spline was used to characterize the magnitude and shape of the PM2.5-mortality association with the smoothing parameter estimated from the data controlling for the five ZIP code area variables as linear terms in addition to the four category Census Region variable.

A smoothing spline was fit within the Generalized Additive Model framework using the R package “gam” and the s(PM2.5, bs = tp) specification with the smoothing parameter estimated from the data by cross-validation. We extracted 1000 draws of the smoothing spline parameters and generated 1000 sets of spline predictions over the concentration range. We then calculated the empirical covariance matrix based on the spline predictions at all possible pairs of concentrations among the 1000 sets of predictions and used this matrix to estimate the covariance matrix Σ.

The mean spline predictions over the 1000 sets of predictions from the minimum concentration (0 μg/m3) to the 99th percentile (17.4 μg/m3) (solid red line, Fig. 1, panel A) suggest a complex association between PM2.5 and non-accidental mortality. A near linear increase over the 0 −3 μg/m3 range, flattening of the association over 3 −6 μg/m3 range, sharply increasing over the 6 −9 μg/m3 range, and finally a less steep increase above 9 μg/m3. The smoothing spline mean predictions display several waves over the concentration range that are not present in the mSCHIF mean predictions (solid blue line – panel A). The lack of monotonically increasing spline predictions and the presence of waves makes it less suitable for benefits analysis. The mSCHIF 95% confidence intervals (gray shaded area) are of similar magnitude to the spline confidence intervals (dashed red lines).

Fig. 1
figure 1

Smoothing spline mean PM2.5 predictions over concentration range 0 to 17.4 μg/m3 (solid red line; panel A) for the Medicare cohort with corresponding 95% confidence intervals (dashed red lines) in addition to mSCHIF mean predictions (solid blue line) and 95% confidence intervals (gray shaded area). mSCHIF terms (\(\exp \left({\hat{\theta}}_1\mathcal{M}\left(\overset{\sim }{z}|{\hat{\alpha}}_1,{\hat{\lambda}}_1,{\hat{\mu}}_1,{\hat{\tau}}_1\right)\right)\)—solid red line; \(\exp \left({\hat{\theta}}_2\mathcal{M}\left(\overset{\sim }{z}|{\hat{\alpha}}_2,{\hat{\lambda}}_2,{\hat{\mu}}_2,{\hat{\tau}}_2\right)\right)\)—solid black line; \(\exp \left({\hat{\theta}}_1\mathcal{M}\left(\overset{\sim }{z}|{\hat{\alpha}}_1,{\hat{\lambda}}_1,{\hat{\mu}}_1,{\hat{\tau}}_1\right)+{\hat{\theta}}_2\mathcal{M}\left(\overset{\sim }{z}|{\hat{\alpha}}_2,{\hat{\lambda}}_2,{\hat{\mu}}_2,{\hat{\tau}}_2\right)\right)\)—dashed blue line) are presented in panel B

The two mSCHIF terms, \(\exp \left({\hat{\theta}}_0\mathcal{F}\left(\overset{\sim }{z}|{\hat{\alpha}}_1,{\hat{\lambda}}_1\right)\mathcal{L}\left(\overset{\sim }{z}|{\hat{\mu}}_1,{\hat{\tau}}_1\right)\right)\) (solid red line) and \(\exp \left(\ {\hat{\theta}}_1\mathcal{F}\left(\overset{\sim }{z}|{\hat{\alpha}}_2,{\hat{\lambda}}_2\right)\mathcal{L}\left(\overset{\sim }{z}|{\hat{\mu}}_2,{\hat{\tau}}_2\right)\right)\) (solid black line) are plotted against PM2.5 in Fig. 1 (panel B). A linear increase is observed for \(\exp \left({\hat{\theta}}_1\mathcal{F}\left(\overset{\sim }{z}|{\hat{\alpha}}_1,{\hat{\lambda}}_1\right)\mathcal{L}\left(\overset{\sim }{z}|{\hat{\mu}}_1,{\hat{\tau}}_1\right)\right)\) from 0 to 2 μg/m3 with another linear increase with smaller slope for concentrations greater than 2 μg/m3. The second mSCHIF term, \(\exp \left({\hat{\theta}}_2\mathcal{F}\left(\overset{\sim }{z}|{\hat{\alpha}}_2,{\hat{\lambda}}_2\right)\mathcal{L}\left(\overset{\sim }{z}|{\hat{\mu}}_2,{\hat{\tau}}_2\right)\right)\), displays a sigmodal shape with inflection point \({\hat{\mu}}_2=7.76\ \upmu \textrm{g}/{\textrm{m}}^3\).

We compare the mSCHIF (solid blue line) and eSCHIF (solid brown line) predictions for this example as both models consist of two terms (Figure S1). The eSCHIF predictions are not monotonically increasing with concentration. The mSCHIF predictions more accurately reflected the mean spline predictions than the eSCHIF predictions in this example. This is due to the added flexibility of the \(\mathcal{F}\) function incorporated into the mSCHIF compared to the logarithmic function used by the eSCHIF.

2006 Canadian Census Environment and Health Cohort (CanCHEC)

The 2006 CanCHEC (Chen et al. 2022) consists of 2,663,645 non-institutionalized subjects aged 30–79 years who completed the 2006 Canadian census long-form. Subject-level information included age, sex, education, income, occupation, marital status, and visible minority status. Area level contextual information included multiple measures of social and economic status, in addition to community size and region of Canada. Home addresses were annually identified by linkage to income tax records. Each subject was followed over time from time from January 1, 2007 to December 31, 2016 to determine their vital status by linkage to mortality records, resulting in 25,730,790 person-years of observation. Cox proportional hazards models were used to relate time-varying air pollution exposures, based on a 3-year moving average lagged one prior to follow-up year, to survival from all non-accidental causes of death (213,882), adjusting for subject and area level mortality risk factors. A restricted cubic spline (Harrel 2015) (RCS) was used to characterize the magnitude and shape of the PM2.5-mortality association based on minimizing the Akaike information criterion (Akaike 1974), resulting in the selection of 10 knots.

We illustrate a modelling feature of the mSCHIF by selectively characterizing specific patterns in the mean spline predictions and associated uncertainty. The RCS mean predictions from the minimum concentration (2.5 μg/m3) to the 99th percentile (11.9 μg/m3) (solid red line, Fig. 2, panel A1) suggest a complex association between PM2.5 and non-accidental mortality with a sub-linear increase over the 2.5 − 5 μg/m3 range, a wavey pattern in the association with a slight increase in magnitude over the 5 − 10 μg/m3 range, followed by a much more rapid increase above 10 μg/m3. We would like to capture the sub-linear increase between 2.5 and 5 μg/m3, replace the wavey pattern between 5 and 10 μg/m3 with a linear increase, and finally closely model the rapid increase in risk above 10 μg/m3. We can model these features by limiting the allowable range in the μl mSCHIF parameters.

Fig. 2
figure 2

Mean restricted cubic spline PM2.5 predictions over concentration range 2.5 to 11.9 μg/m3 (solid red line) for the 2006 CanCHEC cohort with corresponding 95% confidence intervals (dashed red lines) in addition to mSCHIF mean predictions (solid blue line) and 95% confidence intervals (gray shaded area) for 2-, 3-, and 4-term mSCHIF in panels A1, B1, and C1 respectively. Spline knot locations represented by green tick marks on x-axis. Corresponding mSCHIF terms presented in panels A2, B2, and C2

We start with two mSCHIF terms with the following parameter limitations: 0 < μ1 < 2.5 and 2.5 < μ2 < r. Note that the μl values are defined with respect to PM2.5 − 2.5 μg/m3. We select 2.5 μg/m3 for the upper bound on μ1 since the logit function \(\mathcal{L}\left(\overset{\sim }{z}|{\mu}_1,{\tau}_1\right)\) is sub-linear for concentrations below μ1 and supra-linear above. The resulting mean mSCHIF predictions are displayed in Fig. 2 (panel A1 – solid blue line), yielding a concentration-mortality pattern near the one we are designing for. However, the two-term mSCHIF uncertainty (gray shaded area) is clearly underestimated compared to that of the spline (dashed red lines) when PM2.5 < 5 μg/m3, but gives a reasonable uncertainty estimate when PM2.5 > 5 μg/m3.

To improve our estimate of uncertainty when PM2.5 < 5 μg/m3, we add more mSCHIF terms such that the μ upper bounds are less than 2.5 μg/m3. The three-term mSCHIF fit is displayed in Fig. 2, panel B1, with an improvement in uncertainty estimation. The four-term mSCHIF (Fig. 2, panel C1) gives an even better approximation to the spline uncertainty. The corresponding 2-,3-, and 4-term mSCHIF terms are displayed in Fig. 2, panels A2, B2, and C2 respectively.

Comparison with the Log-Linear model

The Log − Linear is the most frequently used concentration-response model to estimate public health benefits due to exposure to outdoor air pollution. We estimated the slope parameter of the log-linear model, β, for each cohort using the exact same dataset as was used to fit the splines. Here we compare estimates of attributable deaths due to selected percentage reductions in outdoor concentrations to PM2.5 based on the mSCHIF and Log − Linear model representations of the relative risk function (Fig. 3).

Fig. 3
figure 3

CanCHEC mSCHIF mean predictions (solid blue line) and 95% confidence limits (gray shaded area) over Canadian population based PM2.5 distribution (orange vertical bars) overlaid with Log − Linear model mean predictions (solid black line) and 95% confidence intervals (dashed black lines) in panel A1. Panel A2 presents mean marginal changes in relative risk per 0.1ug/m3 (mSHCIF—solid blue line; Log − Linear—solid black line) and 95% confidence intervals (mSHCIF—gray shaded area; Log − Linear—dashed black lines) (panel A2) over Canadian population based PM2.5 distribution. Attributable deaths due to selected percentage reductions in PM2.5 presented in panel A3. Corresponding representation for Medicare cohort presented in panels B1–B3

We extracted PM2.5 estimated outdoor concentrations for the 3-year average (2015–2017) from the Air Quality Benefits Assessment Tool (AQBAT) (Judek et al. 2023) used by Health Canada for each of the 293 census divisions in Canada in addition to the corresponding number of non-accidental deaths in Canadians over the age of 25 years. We also extracted PM2.5 estimated outdoor concentrations for 2018 at 45,011 1km grid cells for the continental USA from Environmental Benefits Mapping and Analysis Program – Community Edition (BenMAP – CE) (U.S. EPA 2023) with corresponding number of all causes of death over the age of 65 years for 2020. These death categories correspond to those used to estimate the relative risk models of the two cohorts.

In CanCHEC, any PM2.5 person-year exposure value less than 2.5 μg/m3 was set to 2.5 μg/m3, representing 0.5% of all person-years of follow-up (Brauer et al. 2022). We made the same assignment for the census division concentrations, affecting 1% of the total deaths. To represent the population exposure to PM2.5, we constructed a death-weighted PM2.5 distribution by either census division or grid cell (orange vertical bars, Fig. 3). The Canadian mean death-weighted exposure was 6.1 μg/m3 (sd=1.60) while the USA mean was 8.1 μg/m3 (sd=1.69).

The CanCHEC mSCHIF and \(\textrm{Log}-\textrm{Linear}:\hat{\upbeta}=0.01056\ \left({\hat{\sigma}}_{\hat{\beta}}=0.00166\right)\) relative risk functions are plotted against PM2.5 concentrations in Fig. 3 (panel A1). The mSCHIF mean predictions (solid blue line) are much larger than the corresponding Log − Linear throughout the concentration range due to the steep increase in risk prediction over the 2.5 to 5 μg/m3 range.

This example illustrates that large differences in magnitude between non-parametric spline and parametric model predictions can occur due to how these models use data. The parametric model uses all the data together to estimate the unknown parameters while the spline model primarily uses different segments of the data to estimate parameters associated with basis functions whose support corresponds to those segments. The spline model predicts a relative risk of 1.10 between 5 and 2.5 μg/m3 while the Log − Linear predicts a 1.026 relative risk. This difference in relative risk is maintained throughout the concentration range. The 2.5 to 5 μg/m3 range represents 20% of the person-years of follow-up in the cohort; however, this rapid increase in relative risks appears to have minimal influence on the estimate of the Log − Linear parameter β.

To further examine differences between the two model predictions, we plot the change in the logarithm of the relative risk by 0.1 μg/m3 increments. These marginal changes in risk indicate a complex pattern between the models over concentration with the mSCHIF displaying a larger change between 2.5 and 5 μg/m3 compared to the constant change predicted by the Log − Linear, but a similar change for concentrations greater than 5 μg/m3. These differences between the models are reflected in estimates of attributable deaths (panel A3) for selected percent decreases in concentration (10% to 100% by 10% increments) with larger attributable death estimates for the mSCHIF model compared to the Log − Linear model, and differences increasing with increasing percentage reductions in concentration. Note that the estimates of attributable deaths for 80%, 90%, and 100% are the same since these reductions change the highest concentration of 8.8 μg/m3 to values less than the counterfactual of 2.5 μg/m3.

We conducted a similar analysis for the Medicare cohort (panels B1–B3 of Fig. 3). In this cohort, the increase in relative risk over the 0 to 2 μg/m3 range was similar for both models (panel B1). The \(\textrm{Log}-\textrm{Linear}:\hat{\beta}=0.00494\ \left({\hat{\sigma}}_{\hat{\beta}}=0.00040\right)\) model predictions were larger than the mSCHIF over the 2 to 8 μg/m3 range, but smaller when PM2.5 > 8 μg/m3 (panel B1). The marginal changes in risk were also similar for the two models in the 0 to 2 μg/m3 range, but the Log − Linear model predicted larger marginal changes in the 2 to 6 μg/m3 range, smaller changes in the 6 to 10 μg/m3 range, and similar changes when PM2.5> 10 μg/m3 compared to the mSCHIF predictions (panel B2). Attributable death estimates were lower for the Log − Linear model for the 10–70% reductions compared to the mSCHIF estimate, but larger for the higher percent reductions.

Discussion

In order to estimate the public health burden of exposure to outdoor concentrations of air pollution, the magnitude and shape of the association between outdoor concentrations and a health outcome must be characterized in addition to its uncertainty. Several attempts have been made using parametric functions indexed by a few parameters (Cohen et al. 2004; Chen and Hoek 2020; Burnett et al. 2014, 2018, 2022). However, recent interest has focused on non-parametric approaches using either regression (Brauer et al. 2022; Brunekreef et al. 2021) or smoothing (GBD 2019; Dominici et al. 2022) splines. Typically, no restrictions on the shape of the splines are made, resulting in non-monotonic predictions with multiple changes in direction that make these functions less desirable for public health benefits analysis. Limiting the number of terms in a regression spline can smooth the curve (Brunekreef et al. 2021), thus making it more useful for benefits analysis. However, it is not clear how good this approach is at characterizing the concentration-response shape. Even smoothing splines, if not restricted, can result in non-monotonicity with waviness as evidenced by our Medicare example.

In these cohort studies, an estimate of the outdoor concentration near a subject’s home is related to their survival. There exists a complex association, that varies by subject, between their estimate of outdoor concentration and their personal exposure (Hammond et al. 2014), with an equally likely complex association between their personal exposure and probability of surviving any fixed time period. In addition, it is likely that components/sources of particulate matter have a different, and potentially non-linear association, with survival (Henneman et al. 2023), resulting in a potentially complex concentration-response pattern based on total mass. Even if every component/source of particulate matter had a linear concentration-response, total mass may have a non-linear association due to variations in the atmospheric mixture at different total mass concentrations.

Although simple approximations have been suggested, such as linear or supra-linear, we suggest that we in fact do not know what the true concentration-response pattern should look like and can only rely on data to guide us. In this paper, we suggest a method that first identifies the best fitting unrestricted spline to the data, then transform the spline fit and its uncertainty to a monotonic function suitable for benefits assessment. We make few restrictions on the algebraic form of the fit, other than monotonicity.

We also suggest that it is important to conduct cohort-specific concentration-response analysis to examine the consistency, or lack of, in the shape among cohorts. Pooling these shapes provides a means to constructing a common concentration-response model as was done for the Global Exposure Mortality Model (GEMM) (Burnett et al. 2018).

However, there is interest to examine the concentration-response shape of cohorts representing the population of specific judications; as demographics, disease distribution, and how total mass represents the toxicity of the atmospheric mixture can vary by region. This raises concerns that a common concentration-response based on global studies may be a poor approximation to that of any specific region or country.

The Global Burden of Disease program (GBD 2019) fit a smoothing spline with their Integrated Exposure-Response (IER) meta-regression framework to predict risk from PM2.5 exposure to both outdoor and indoor pollution covering a very large range (0 to 1000 μg/m3) by combining information from multiple studies. They restricted the spline to be monotonically increasing and concave resulting in a supra-linear curve. However, such restrictions for individual studies may be too limiting as evidenced by our two examples. Restricting spline fits to be monotonically increasing may still not lead to a curve useful for benefits analysis as it may contain several concentration-response patterns like step-functions over narrow concentration intervals.

We address these limitations by proposing a new spline formulation, denoted as “monotonically increasing Shape Constrained Health Impact Function” or mSCHIF. The mSCHIF consists of a series of basis functions, not defined by knot values, but by four parameters, that can model a wide variety of shapes including near-linear, sub-linear, supra-linear, and sigmodal. A linear combination of these basis functions, multiplied by a parameter restricted to be positive, creates a monotonically increasing wide variety of shapes that we suggest can model virtually any monotonically increasing concentration-response pattern of interest. Since the basis functions can take a variety of shapes, far fewer are needed to make acceptable predictions compared traditional spline formulations. We also capture the totality of uncertainty in the spline predictions by equating the covariance matrix of the spline predictions over the concentration range to that of the mSCHIF predictions.

We give guidance to investigators in how to present results from spline model predictions. The typical method of reporting a spline fit is to plot mean predictions and their 95% confidence intervals, transformed in a manner such that the spline prediction at a pre-specified counterfactual concentration, often the minimum, is set to zero. However, if the uncertainty at the counterfactual is also not set to zero, the curve cannot be used for burden (all concentrations reduced to the counterfactual) analysis. Even when the uncertainty at the counterfactual is set to zero, the spline fit cannot be used for benefits analysis (difference in risk between any two specific concentrations) since the uncertainty in risk associated with such changes in concentration cannot be determined from this summary information. The spline parameter estimates, their covariance matrix, knot locations, and type of spline need to be reported. Alternatively, reporting 1000 sets of spline predictions over the concentration range is sufficient for benefits analysis using the spline fit directly. This information can also be used to estimate the mSCHIF. Benefits estimates and its uncertainty can be calculated from the mSCHIF parameter estimates and covariance matrix \(\hat{\Sigma}\).

Our comparison between the mSCHIF and Log − Linear model relative risk, marginal change in risk, and attributable deaths estimates, suggest there can be considerable departures in the concentration-response from linearity. These differences could potentially lead to differences in policy development.

Our two examples involved fitting either a regression or smoothing spline to primary data from cohorts. However, there is interest in characterizing concentration-response by combining information from multiple cohorts (GBD 2019; Burnett et al. 2014, 2018, 2022). Our mSCHIF is another option to create a common curve if an unrestricted spline fit is available to the meta-data.

Although we accessed mortality counts and PM2.5 concentration data from AQBAT and BenMAP-CE, we preformed the benefits calculations externally to these computer programs. We did this due to how both programs accept input on the relationship between concentration and mortality. They were constructed to use the information provided by the Log − Linear model, namely a single parameter estimate and its standard error, assuming a linear association between the logarithm of the hazard function and concentration. Since the mSCHIF is constructed as the sum of parameter estimates multiplied by transformations in concentration, with uncertainty described in terms of a multivariate normal distribution, it cannot be directly used by these computer programs.

We specifically selected spline model fits that met several criteria in order to illustrate properties of our new model. The spline fits had to display complex associations between PM2.5 concentrations and mortality in such a manner was it would likely not be suitable for benefits analysis; such as having several changes in direction and thus not being monotonically increasing with concentration. The mSCHIF can accept two formats of information from the spline fit: spline parameter estimates, their covariance matrix, knot locations, and type of spline or multiple sets of spline predictions over the concentration range. We selected a regression spline for the CanCHEC example where we accessed the first type of information and a smoothing spline for the Medicare example where we accessed the second type. We therefore did not select these fits as the most appropriate representation of the PM2.5-mortality association in either Canada or the USA. We did, however, further conduct benefits analysis to highlight potential differences in attributable death estimates between the mSCHIF and the most used model for such purposes. We thus highlighted the fact that characterizing the exposure-response relationship in a potentially non-linear manner may influence policy decisions compared to using the traditional benefits analysis Log − Linear model.