Pricing and risk of swing contracts in natural gas markets

Kohrs, Hendrik; Mühlichen, Hermann; Auer, Benjamin R.; Schuhmacher, Frank

doi:10.1007/s11147-018-9146-x

Pricing and risk of swing contracts in natural gas markets

Published: 22 June 2018

Volume 22, pages 77–167, (2019)
Cite this article

Review of Derivatives Research Aims and scope Submit manuscript

Hendrik Kohrs¹,
Hermann Mühlichen²,
Benjamin R. Auer^1,3 &
…
Frank Schuhmacher¹

642 Accesses
3 Citations
Explore all metrics

Abstract

Motivated by the growing importance of swing contracts in natural gas markets, this article extends the literature on commodity price modelling as well as valuation methods and sensitivity analysis for swing options. While most previous studies focused on simple price models, we face the challenge of deriving option properties under more realistic commodity price dynamics. We begin by formulating a multi-factor price forward curve model with parametric volatility functions, which can capture uncertainty in both yearly seasonality and time-to-maturity effects, and propose a two-step calibration procedure to fit such models to empirical data. We then show how results from the literature can be combined to obtain swing option values and sensitivities in such a general framework. In this context, we also provide new theoretical results and a first numerical approach to efficiently estimate swing options’ gammas. For options’ deltas, we expand upon existing studies by including a larger variety of contract specifications and by focusing on a multidimensional variant of the Longstaff–Schwartz algorithm as an alternative option valuation method. With these contributions, we supply important tools for swing option sellers and buyers relying on accurate option value and risk estimates to maintain their business models, hedge option-related risks and adequately represent swing options in financial reporting.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

The affine styled-facts price dynamics for the natural gas: evidence from daily returns and option prices

Article 27 April 2016

Option Pricing and Hedging Performance Under Stochastic Volatility and Stochastic Interest Rates

Commodity Spread Option with Cointegration

Article 07 December 2015

Notes

Joskow (1985, 1987) identifies common properties in coal contracts such as take-or-pay provisions and delivery schedules with minimum and maximum production and purchase obligations. Barbieri and Garman (1996) and Garman and Barbieri (1997) provide a systematization of different swing option variants.
Bardou et al. (2010) discuss the digital purchase property of swing options for absolute global constraints. Boogert and De Jong (2011) apply a multi-factor LSM to gas storage valuation.
For detailed overviews of other extensions, see Eydeland and Wolyniec (2003) and Geman (2005).
A recent application can be found in Kiesel et al. (2009). Clewlow and Strickland (2000, chpt. 8) and Eydeland and Wolyniec (2003, chpt. 5) provide general introductions to forward curve modelling.
For a brief discussion of such modifications, see Sect. 2.2.3.3.
The documented stylized facts are similar to characteristics which we can observe in electricity markets (see Hambly et al. 2009; Wahab et al. 2010).
More details on the VNG Group are available in its annual report of 2016 (see http://vng-gruppe.de/sites/default/files/vng_annual_report_2016_en_web.pdf).
Note that our focus is not a full empirical analysis of the European natural gas market. This sample serves the sole purpose of illustrating our methodologies. Thus, its small size is sufficient for our application.
$\Gamma $ is equal to the square-root of the corresponding correlation matrix $\Gamma ^2 = \Gamma \Gamma '$ (see Benmenzer et al. 2007).
We assume the existence of a unique equivalent martingale measure. The existence and uniqueness of such measures in different markets is a formidable topic in itself, so a detailed discussion is outside the scope of our study.
See Musiela and Rutkowski (2004) for a continuous-time proof or van der Hoek and Elliott (2006) for the discrete-time version of this result. Lioui and Poncet (2005, chpt. 2) summarize the general asset pricing implications of constant interest rates.
An alternative estimation approach is to use option-implied volatilities of forwards. Kiesel et al. (2009) consider a two-factor variant of the price model below for modelling electricity futures. They calibrate the model to options on electricity futures and discuss its performance in practical applications.
We do not intend to estimate the market price of risk. We refer to it simply to justify our approach.
Discretizing (2.1) directly can lead to negative forward prices in simulations because $\mathbb {P}(\Delta W(t) < c) >0$ for every $c\in \mathbb {R}$ if $\Delta W(t)$ is normally distributed. We avoid this problem by focusing on the log-variant (2.3).
On average, our sample data covers about 241 trading days per year. Thus, in what follows, we always use the factor $\sqrt{241}$ to annualize volatilities.
More precisely, $K = 48$ independent factors capture the entire return covariance structure.
Detailed results for the monthly and seasonal PCA are available upon request.
Each column $x_k(t_n)$ of $X_n$ contains a discretized normalized volatility function. In the case of (2.6), we have $x_1(t_n) = (1,...,1)'$, $x_2(t_n) = (\exp (-\textstyle {\frac{\vartheta }{12}}(\tau _j)))_{j=1,...,m}$ and $x_3(t_n) = (\cos (\textstyle {\frac{2\pi }{12}} (\tau _j + (t_n\bmod 12))))_{j=1,...,m}$. In this example, it is clear that $X_n$ has linearly independent columns.
We leave the detailed statistical properties of this estimator for future research.
We plot the resulting average $R^2$ dependent on the choice of $\vartheta $ in Fig. 12 of the appendix.
For fixed optimal $\vartheta $, we use the standard $95\%$ OLS confidence intervals.
If $\sigma _3(t,T)$ captures fewer season spreads at the front end of the forward curve (e.g. if we choose $(c_{\min }^{(3)}, c_{\max }^{(3)}) = (0,6)$), the TSLS procedure yields high negative correlations ($\rho _{12}$ and $\rho _{23}$) of short-term variation with both long-term and seasonal variation. This finding is not in line with our previous results which suggest positively correlated long- and short-term variation as well as very low correlation between seasonal variation and other factors. Thus, we have to be careful when choosing piecewise volatility functions.
The normalized seasonal volatility functions (in the case of $\tau _1$ being a January) are plotted in Fig. 14 of the appendix.
Additional sample regression plots are illustrated in Fig. 15 of the appendix.
We emphasize that we do not find monotonically increasing spread volatilities to be a general feature of the natural gas market. By applying another PFC methodology, which also fulfills the properties described in Sect. 2.1, on more recent market data, we also obtain low seasonal volatilities at the very front end of the curve, but find that spread variation peaks near 36 front months and subsequently decreases. To be precise, we estimate $\hat{\sigma }_{\text {sw}} = (1.88\%, 2.36\%, 2.67\%, 3.32\%, 2.85\%, 2.45\%)^{\prime }$ based on PFC data for the year 2015.
This can be deduced from our log return plots (see, for example, Figs. 4, 6).
Similar models have been used by Maciejowska and Weron (2013) for electricity price forecasting. They propose a factor model to reduce the dimension of the original multivariate time-series and suggest to model the factors based on a vector autoregressive process.
We restrict our attention to this model because our alternative model with piecewise seasonal volatility functions (2.18) allows similar conclusions.
The derivation of the drift term (2.26) can be found in Appendix C.
We drop the contract $j = 48$ because our P-TSLS procedure requires a total number of months which can be divided by six.
This is why we keep the spot price extension simple but point out that, of course, it could be improved.
We truncate outliers by considering data points inside the three sigma band only. As the outliers cluster together, a regime-switching model extension might be considered. However, this is beyond the scope of this article because we focus on deriving option pricing and sensitivity results based on simpler processes.
We can easily translate the results of the following sections to the price model introduced in Sect. 2.
We address the use of information in decision making more explicitly in Sect. 3.4.
Broadie and Glasserman (1996) have developed a pathwise derivative method to determine sensitivities of option prices. This pathwise approach is justified if we can interchange differentiation and taking expectations. While this is not allowed in our context, we are still able to express first-order sensitivities as the expectation of a pathwise calculable term.
We used the fact that, as shown in (3.15), the partial derivative of $S_T$ with respect to F(0, T) is a pathwise constant term such that the second-order partial derivative of $S_T$ vanishes.
In general, simulation methods are very popular in finance because they are easy to implement and, in most cases, independent of the price model (see Broadie and Glasserman 1997; Boyle et al. 1997; McLeish 2011).
In Appendix C, we illustrate how such errors can affect fair value and option sensitivity estimates. However, note that, to understand the presented numbers, the discussions of Sect. 4 should be read first.
A sequence $(x_n)$ of vectors in a Hilbert space H is said to be total, if the linear subspace spanned by $(x_n)$ is dense in H.
The choice of $R_{cond}$ is crucial for the performance of the valuation algorithm. We find unstable regression matrices X, which are very sensitive to input parameters, for barriers with low denominators around $\epsilon \cdot 10^3$. In contrast, a too high denominator around $\epsilon \cdot 10^9$ results in too small regression matrices, which contain only a few basis functions on average. Therefore, we suggest using a denominator around $\epsilon \cdot 10^6$ to ensure stability.
Besides the power class, other polynomial families like Chebyshev, Hermite, Legendre or Laguerre could be used in modelling. However, because most of them underperform and/or require more computation time, Boogert and De Jong (2011) conclude that powers are the preferred choice of basis functions.
The existence of the functional relationship $\mathcal {C}_t^r(F_t)$ is guaranteed by the Markov property and the factorization lemma of measure theory (see Klenke 2007, Corollary 1.97).
Details of this problem are illustrated in the simulation study of Sect. 4.4.
Because seasonal variation is not correlated with the other factors at economically significant levels, we set the corresponding correlation estimates equal to zero.
For simplicity and because it does not qualitatively change our results, we disregard discounting by setting the interest rate equal to zero.
In Sect. 4.2, we discuss the implications of larger values for N. A fixed random seed is of special relevance for the considerations of Sect. 4.4.
Figure 16 of the appendix illustrates some sample PFCs resulting in our simulations.
Our performance measure is not designed to compare estimated values and true values because the latter are unknown. We ensure accuracy by basing our estimation approach on solid theory and previous results.
Appendix C illustrates the Monte Carlo error for option value estimates. Because it also covers the errors for option sensitivities, we advise the reader to finish Sect. 4 before studying the appendix.
In Sect. 4.3.2, we find support for the hypothesis that this phenomenon might be related to the suboptimality of the forwards strategy.
This can be shown using random state variables as basis functions which are not correlated with the expected future payoff. The detailed results of such a simulation are available upon request.
Some instructions on this topic are given in Asmussen and Glynn (2007, sec. 2a).
For the commonly traded energy band products (“swing” contracts with $Q_{\min } = Q_{\max } = 365$), we do not need the swing option valuation methodology.
The detailed results of the analysis with sample size $N = 5000$ are reported in Table 11 of the appendix.
Recall that T has to fit calendar month numbering, i.e., $T \bmod 12 = c$ with $c \in \{0,...,11\} \, \hat{=} \{\text {Jan},..., \text {Dec}\}$.
We report the results for the corresponding PFD and TFD approaches in Table 12 of the appendix.
This proposition is similar to Bardou et al. (2010, Theorem 2). However, we avoid the triangular set notation introduced by the authors and provide a shorter proof by inducting backwards with Bellman’s equation.
Operations preserving convexity are discussed in, for example, Boyd and Vandenberghe (2009).
We obtain similar results for our forwards approach.
A theoretical discussion of an existing bias-variance tradeoff can be found in the last section of this appendix, where we focus on bandwidth choice for the gamma estimator.
The results for the other contracts and for the forwards approach are similar.
This would be the case if the integrand itself was differentiable almost everywhere (as stated by L’Ecuyer 2007). In fact, our analysis confirms exploding variance for small $\varepsilon _N$.

References

Alexandrov, A. (1939). Almost everywhere existence of the second differential of a convex function and some properties of convex surfaces connected with it. Leningrad State University Annals, Mathematics Series, 37, 3–35.
Google Scholar
Anupindi, R., & Bassok, Y. (1999). Supply contracts with quantity commitments and stochastic demands. In S. Tayur, G. Ram, & M. Magazine (Eds.), Quantitative models for supply change management (pp. 197–232). Boston: Kluwer Academic Publishers.
Chapter Google Scholar
Asmussen, S., & Glynn, P. (2007). Stochastic simulation: Algorithms and analysis (57th ed.). New York: Springer.
Book Google Scholar
Balvers, R., Wu, Y., & Gilliland, E. (2000). Mean reversion across national stock markets and parametric contrarian investment strategies. Journal of Finance, 55(2), 745–772.
Article Google Scholar
Barbieri, A., & Garman, M. (1996). Putting a price on swings. Energy Power Risk Management, 1(6), 17–19.
Google Scholar
Bardou, O., Bouthemy, S., & Pagés, G. (2010). When are swing options bang-bang and how to use it. International Journal of Theoretical and Applied Finance, 13(6), 867–899.
Article Google Scholar
Barrera-Esteve, C., Bergeret, F., Dossal, C., Gobet, E., Meziou, A., Munos, R., et al. (2006). Numerical methods for the pricing of swing options: A stochastic control approach. Methodology and Computing in Applied Probability, 8(4), 517–540.
Article Google Scholar
Bellman, R. (1952). On the theory of dynamic programming. Proceedings of the National Academy of Sciences, 38(8), 716–719.
Article Google Scholar
Bellman, R. (1957). Dynamic programming. Princeton, NJ: Princeton University Press.
Google Scholar
Benmenzer, G., Gobet, E., & Jérusalem, T. (2007). Arbitrage free cointegrated models in gas and oil future markets. Working Paper, Cornell University.
Black, F. (1976). The pricing of commodity contracts. Journal of Financial Economics, 3(1–2), 167–179.
Article Google Scholar
Black, F., & Scholes, M. (1973). The pricing of options and corporate liabilities. Journal of Political Economy, 81(3), 637–654.
Article Google Scholar
Bonnans, J., Cen, Z., & Christel, T. (2012). Sensitivity analysis of energy contracts by stochastic programming techniques. In R. Carmona, P. Del Moral, P. Hu, & N. Oudjane (Eds.), Numerical methods in finance (pp. 447–471). Berlin, Heidelberg: Springer.
Chapter Google Scholar
Bonnans, J., & Shapiro, A. (2000). Perturbation analysis of optimization problems. New York: Springer.
Book Google Scholar
Boogert, A., & De Jong, C. (2011). Gas storage valuation using a multi-factor price process. Journal of Energy Markets, 4(4), 29–52.
Article Google Scholar
Boyd, S., & Vandenberghe, L. (2009). Convex optimization (7th ed.). Cambridge, MA: Cambridge University Press.
Google Scholar
Boyle, R., Broadie, M., & Glasserman, P. (1997). Monte Carlo methods for security pricing. Journal of Economic Dynamics and Control, 21(8–9), 1267–1321.
Article Google Scholar
Breslin, J., Clewlow, L., Kwok, C., & Strickland, C. (2008). Gaining from complexity: MFMC models. Energy Risk, April, 60–64.
Brezis, H. (2010). Functional analysis, Sobolev spaces and partial differential equations. New York: Springer.
Book Google Scholar
Broadie, M., & Glasserman, P. (1996). Estimating security price derivatives using simulation. Management Science, 42(2), 269–285.
Article Google Scholar
Broadie, M., & Glasserman, P. (1997). Pricing American-style securities using simulation. Journal of Economic Dynamics and Control, 21(8–9), 1323–1352.
Article Google Scholar
Brown, S., & Yücel, M. (2008). What drives natural gas prices? Energy Journal, 29(2), 45–60.
Article Google Scholar
Carmona, R., & Touzi, N. (2008). Optimal multiple stopping and valuation of swing options. Mathematical Finance, 18(2), 239–268.
Article Google Scholar
Chen, N., & Liu, Y. (2014). American option sensitivities estimation via a generalized infinitesimal perturbation analysis approach. Operations Research, 62(3), 616–632.
Article Google Scholar
Chiarella, C., Clewlow, L., & Kang, B. (2009). Modelling and estimating the forward price curve in the energy market. Working Paper 260, Quantitative Finance Research Centre, University of Technology Sydney.
Clément, E., Lamberton, D., & Protter, P. (2002). An analysis of a least squares regression method for American option pricing. Finance and Stochastics, 6(4), 449–471.
Article Google Scholar
Clewlow, L., & Strickland, C. (1999a). A multi-factor model for energy derivatives. QFRC Research Paper Series 28, University of Technology Sydney.
Clewlow, L., & Strickland, C., (1999b). Valuing energy options in a one factor model fitted to forward prices. QFRC Research Paper Series 10, University of Technology Sydney.
Clewlow, L., & Strickland, C. (2000). Energy derivatives: Pricing and risk management. London: Lacima Publications.
Google Scholar
Cortazar, G., & Schwartz, E. (1994). The valuation of commodity contingent claims. Journal of Derivatives, 1(4), 27–39.
Article Google Scholar
Dahlgren, M. (2005). A continuous time model to price commodity-based swing options. Review of Derivatives Research, 8(1), 27–47.
Article Google Scholar
De Jong, C., (2006). The nature of power spikes: A regime-switch approach. Studies in Nonlinear Dynamics and Econometrics, 10(3), Article 3.
De Maeseneire, J. (2010). Estimating forward price curve in energy markets with MCMC method. Research Paper, University of Waterloo.
Deaton, A., & Laroque, G. (1992). On the behaviour of commodity prices. Review of Economic Studies, 59(1), 1–23.
Article Google Scholar
Efron, B., & Tibshirani, R. (1993). An introduction to the bootstrap. Boca Raton: Chapman & Hall.
Book Google Scholar
Eydeland, A., & Wolyniec, K. (2003). Energy and power risk management: New developments in modeling, pricing and hedging (Vol. 206). Hoboken, NJ: Wiley.
Google Scholar
Garman, M., & Barbieri, A. (1997). Ups and downs of swing. Energy Power Risk Management, 2(1).
Geman, H. (2005). Commodities and commodity derivatives: Modelling and pricing for agricultures, metals and energy. Chichester: Wiley.
Google Scholar
Gibson, R., & Schwartz, E. (1990). Stochastic convenience yield and the pricing of oil contingent claims. Journal of Finance, 45(3), 959–976.
Article Google Scholar
Glasserman, P. (2003). Monte Carlo methods in financial engineering. New York: Springer.
Book Google Scholar
Glasserman, P., & Yu, B. (2004). Number of paths versus number of basis functions in American option pricing. Annals of Applied Probability, 14(4), 2090–2119.
Google Scholar
Hambly, B., Howison, S., & Kluge, T. (2009). Modelling spikes and pricing swing options in electricity markets. Quantitative Finance, 9(8), 937–949.
Article Google Scholar
Haugh, M., & Kogan, L. (2004). Pricing American options: A duality approach. Operations Research, 52(2), 258–270.
Article Google Scholar
Hayashi, T., & Mykland, P. (2005). Evaluating hedging error: An asymptotic approach. Mathematical Finance, 15(1), 309–343.
Article Google Scholar
Hedestig, J. (2014). Pricing and hedging of swing options in the European electricity and gas markets. Student Paper, Lund University.
Hubbard, R., & Weiner, R. (1986). Regulation and long-term contracting US natural gas markets. Journal of Industrial Economics, 35(1), 71–79.
Article Google Scholar
Hull, J., & White, A. (1994). Numerical procedures for implementing term structure models i: Single factor models. Journal of Derivatives, 2(1), 7–16.
Article Google Scholar
Ibáñez, A., & Zapatero, F. (2004). Monte Carlo valuation of American options through computation of the optimal exercise frontier. Journal of Financial and Quantitative Analysis, 39(2), 253–275.
Article Google Scholar
Jaillet, J., Ronn, E., & Tompaidis, S. (2004). Valuation of commodity-based swing options. Management Science, 50(7), 909–921.
Article Google Scholar
Jamishidian, F. (1991). Commodity option valuation in the Gaussian futures term structure model. Review of Futures Markets, 10(2), 324–346.
Google Scholar
Jin, Y., & Jorion, P. (2006). Firm value and hedging: Evidence from US oil and gas producers. Journal of Finance, 61(2), 893–919.
Article Google Scholar
Johnson, R., & Wichern, D. (2007). Applied multivariate statistical analysis (6th ed.). Upper Saddle River, NJ: Prentice Hall.
Google Scholar
Joskow, P. (1985). Vertical integration and long-term contracts: The case of coal-burning electric generating plants. Journal of Law, Economics, & Organization, 1(1), 33–80.
Google Scholar
Joskow, P. (1987). Contract duration and relationship-specific investments: Empirical evidence from coal markets. American Economic Review, 77(1), 168–185.
Google Scholar
Kiesel, R., Gernhard, J., & Stoll, S. (2010). Valuation of commodity-based swing options. Journal of Energy Markets, 3(3), 91–112.
Article Google Scholar
Kiesel, R., Schindlmayr, G., & Börger, R. (2009). A two-factor model for the electricity forward market. Quantitative Finance, 9(3), 279–287.
Article Google Scholar
Klenke, A. (2007). Probability theory: A comprehensive course. London: Springer.
Google Scholar
Kohler, M. (2011). A review of regression-based Monte Carlo methods for pricing American options. In L. Devroye, B. Karasözen, M. Kohler, & R. Korn (Eds.), Recent developments in applied probability and statistics (pp. 37–58). Heidelberg: Physica.
Google Scholar
Kolb, R., & Overdahl, J. (Eds.). (2010). Financial derivatives: Pricing and risk management. Hoboken, NJ: Wiley.
Google Scholar
Kovacevic, R., Pflug, G., & Vespucci, M. (2013). Handbook of risk management in energy production and trading. New York: Springer.
Book Google Scholar
L’Ecuyer, P., (2007). Variance reduction’s greatest hits. In Proceedings of the 2007 European simulation and modeling conference (pp. 5–12). European Multidisciplinary Society for Modelling and Simulation Technology, Ostend.
L’Ecuyer, P., & Perron, G. (1994). On the convergence rates of IPA and FDC derivative estimators. Operations Research, 42(4), 643–656.
Article Google Scholar
Lioui, A., & Poncet, P. (2005). Dynamic asset allocation with forwards and futures. New York: Springer.
Google Scholar
Løland, A., & Lindqvist, O., (2008). Valuation of commodity-based swing options: A survey. Note SAMBA/38/80, Norwegian Computing Center.
Longstaff, F., & Schwartz, E. (2001). Valuing American options by simulation: A simple least-squares approach. Review of Financial Studies, 14(1), 113–147.
Article Google Scholar
MacAvoy, P. (2000). The natural gas market: Sixty years of regulation and deregulation. New Haven, London: Yale University Press.
Google Scholar
Maciejowska, K., & Weron, R., (2013). Forecasting of daily electricity spot prices by incorporating intra-day relationships: Evidence from the UK power market. In IEEE conference proceedings, 10th international conference in the European energy market (EEM).
Manoliu, M., & Tompaidis, S. (2002). Energy futures prices: Term structure models with Kalman filter estimation. Applied Mathematical Finance, 9(1), 21–43.
Article Google Scholar
McLeish, D. (2011). Monte Carlo simulation and finance. Hoboken, NJ: Wiley.
Google Scholar
Merton, R. (1973). Theory of rational option pricing. Bell Journal of Economics and Management Science, 4(1), 141–183.
Article Google Scholar
Musiela, M., & Rutkowski, M. (2004). Martingale methods in financial modelling (2nd ed.). Berlin: Springer.
Google Scholar
Pascucci, A. (2011). PDE and martingale methods in option pricing. Milano: Springer, Bocconi University Press.
Book Google Scholar
Pflug, G., & Broussev, N. (2009). Electricity swing options: Behavioral models and pricing. European Journal of Operational Research, 197(3), 1041–1050.
Article Google Scholar
Pilipovic, D. (2007). Energy risk: Valuing and managing energy derivatives (2nd ed.). New York: McGraw-Hill.
Google Scholar
Pilipovic, D., & Wengler, J. (1998). Getting into the swing. Energy Power Risk Management, 2(10).
Pinar, M. (2007). Robust scenario optimization based on downside-risk measure for mulit-period portfolio selection. OR Spectrum, 29(2), 295–309.
Article Google Scholar
Poterba, J. (1988). Mean reversion in stock prices: Evidence and implications. Journal of Financial Economics, 22(1), 27–59.
Article Google Scholar
Rockafellar, R., & Wets, R. (2009). Variational analysis (Vol. 317). Berlin: Springer.
Google Scholar
Rodríguez, R. (2008). Real option valuation of free destination in long-term liquefied natural gas supplies. Energy Economics, 30(4), 1909–1932.
Article Google Scholar
Rudin, W. (1964). Principles of mathematical analysis (3rd ed.). New York: McGraw-Hill.
Google Scholar
Schwartz, E. (1997). The stochastic behaviour of commodity prices: Implications for valuation and hedging. Journal of Finance, 52(3), 923–973.
Article Google Scholar
Schwartz, E., & Smith, J. (2000). Short-term variations and long-term dynamics in commodity prices. Management Science, 46(7), 893–911.
Article Google Scholar
Smith, C., & Zimmerman, J. (1976). Valuing employee stock option plans using option pricing models. Journal of Accounting Research, 14(2), 357–364.
Article Google Scholar
Thompson, A. (1995). Valuation of path-dependent contingent claims with multiple exercise decisions over time: The case of take-or-pay. Journal of Financial and Quantitative Analysis, 30(2), 271–293.
Article Google Scholar
van der Hoek, J., & Elliott, R. (2006). Binomial models in finance. New York: Springer.
Book Google Scholar
Vinga, S., (2004). Convolution integrals of normal distribution functions. Supplementary material to Vinga and Almeida (2004) “Rényi continuous entropy of DNA sequences”.
Wahab, M., & Lee, C. (2011). Pricing swing options with regime switching. Annals of Operations Research, 185(1), 139–160.
Article Google Scholar
Wahab, M., Yin, Z., & Edirisinghe, N. (2010). Pricing swing options in the electricity markets under regime-switching uncertainty. Quantitative Finance, 10(9), 975–994.
Article Google Scholar
Warin, X. (2012). Hedging swing contract on gas markets. Working Paper, Cornell University.

Download references

Acknowledgements

We thank the Verbundnetz Gas AG for supporting our research by supplying forward price data and by contributing important ideas for the design of the estimation methods developed in our article. We are also indebted to Max von Renesse, Ralf Wunderlich, Xaver Muschik, the editor, and an anonymous reviewer for valuable comments and suggestions. Generous financial support was provided by the Wissenschaftsförderung der Sparkassen-Finanzgruppe e.V.

Author information

Authors and Affiliations

Department of Finance, University of Leipzig, Grimmaische Str. 12, 04109, Leipzig, Germany
Hendrik Kohrs, Benjamin R. Auer & Frank Schuhmacher
Risk Management/Quantitative Analysis, VNG Handel & Vertrieb GmbH, Braunstr. 7, 04347, Leipzig, Germany
Hermann Mühlichen
Research Network Area Macro, Money and International Finance, CESifo Munich, Schackstr. 4, 80539, Munich, Germany
Benjamin R. Auer

Authors

Hendrik Kohrs
View author publications
You can also search for this author in PubMed Google Scholar
Hermann Mühlichen
View author publications
You can also search for this author in PubMed Google Scholar
Benjamin R. Auer
View author publications
You can also search for this author in PubMed Google Scholar
Frank Schuhmacher
View author publications
You can also search for this author in PubMed Google Scholar

Corresponding author

Correspondence to Benjamin R. Auer.

Appendices

Appendix

Proofs

Proof of Lemma 2.4

Let $f_{\mu ,\Sigma }$ denote the probability density function of a multivariate normal distribution with mean $\mu $ and covariance $\Sigma $. The unconditional probability density function $\varphi _Y$ of Y is

$$\begin{aligned} \varphi _Y(y)&= \int \varphi _{X,Y}(x,y) dx = \int \varphi _{Y}(y|X=x) \varphi _X(x) dx \\&= \int f_{\mu _y + x, \Sigma _y}(y) f_{\mu _x, \Sigma _x}(x) dx = \int f_{\mu _y, \Sigma _y}(y-x) f_{\mu _x, \Sigma _x}(x) dx \\&= (f_{\mu _x, \Sigma _x} *f_{\mu _y, \Sigma _y})(y) = f_{\mu _x + \mu _y, \Sigma _x + \Sigma _y}(y), \end{aligned}$$

where $*$ denotes the convolution operator (see Klenke 2007, p. 277). For a detailed verification of the last equation, see Vinga (2004). $\square $

Proof of Proposition 2.5

With $\varepsilon _n \sim \mathcal {N}(0, \sigma _\varepsilon ^2 I)$, we have $\hat{a}_n | a_n \sim \mathcal {N}(a_n, \sigma _\varepsilon ^2 (X' X)^{-1})$ (see Johnson and Wichern 2007, p. 370). Under $\mathbb {P}_\Sigma $, we have $a_n \sim \mathcal {N}(\eta ,\Sigma )$ by definition. Thus, Lemma 2.4 implies

$$\begin{aligned} \hat{a}_n \sim \mathcal {N}(\eta , \Sigma + \sigma _\varepsilon ^2 (X' X)^{-1}). \end{aligned}$$

Considering the independence assumptions, the proposition follows by the definition of the Wishart distribution (see Johnson and Wichern 2007, p. 174). $\square $

Proof of Proposition 3.1

$\mathcal {Q}$, taking values in $[0,1]^T $ and being constrained by (3.3), is a bounded and closed subset of $\mathcal {L}^2 \cap \mathcal {L}^\infty $. Using the Banach-Alaoglu theorem (see, Brezis 2010, Theorem 3.16), we can deduce that it is a weak compact subset of $\mathcal {L}^2$ (see Bonnans et al. 2012). The objective function $\mathcal {J}: \mathcal {Q}\rightarrow \mathbb {R}$, defined by

$$\begin{aligned} \mathcal {J}(q) = \mathbb {E}\left[ \sum _{t=0}^{T -1} f_t(S_t) \cdot q_t \right] , \end{aligned}$$

is weakly continuous, such that the image of the compact space $\mathcal {Q}$ under the continuous mapping $\mathcal {J}$ is also compact. Because $\mathcal {J}(\mathcal {Q})$ is a closed and bounded subset of $\mathbb {R}$, the function $\mathcal {J}$ reaches its supremum. $\square $

Proof of Lemma 3.3

(based on, Bardou et al. 2010, Property P3) We only prove the concavity property because piecewise affinity can be verified by backward induction similar to the proof of Proposition 3.4, which handles the case $Q_{\min }, Q_{\max }\in \mathbb {N}_0$.

For fixed $t \in \{0,...,T -1\}$, let $Q_t, Q'_t \in {[}\underline{Q}_t, \overline{Q}_t]$, $\lambda \in [0,1]$ and $F_t$ be given. Additionally, let $(q_t) \in \mathcal {Q}$ be a feasible policy reached with $\sum _{s=0}^{t-1} q_t = Q_t$ (almost surely) and

$$\begin{aligned} \mathbb {E}\left[ \sum _{s=t}^{T -1} f_s(S_s) q_s \right] = \mathfrak {V}_{t}(Q_{t}, F_{t}), \end{aligned}$$

and let $q' \in \mathcal {Q}$ be a feasible policy with $\sum _{s=0}^{t-1} q'_t = Q'_t$ and

$$\begin{aligned} \mathbb {E}\left[ \sum _{s=t}^{T -1} f_s(S_s) q'_s \right] = \mathfrak {V}_{t}(Q'_{t}, F_{t}). \end{aligned}$$

Note that, since q and $q'$ are both [0, 1]-valued, $\lambda q + (1-\lambda ) q' := (\lambda q_t + (1-\lambda ) q'_t)_{0\le t \le T -1}$ is also [0, 1]-valued. Furthermore, $\lambda q + (1-\lambda ) q'$ reaches $\lambda Q_t + (1-\lambda ) Q'_t$ on day t and fulfills the global constraints, such that it is a feasible policy. We have

$$\begin{aligned}&\lambda \mathfrak {V}_{t}(Q_{t}, F_{t}) + (1-\lambda ) \mathfrak {V}_{t}(Q'_t, F_{t}) \\&\quad = \lambda \mathbb {E}\left[ \sum _{s=t}^{T -1} f_s(S_s) q_s\right] + (1-\lambda ) \mathbb {E}\left[ \sum _{s=t}^{T -1} f_s(S_s) q'_s\right] \\&\quad = \mathbb {E}\left[ \sum _{s=t}^{T -1} f_s(S_s) (\lambda q_s + (1-\lambda ) q'_s)\right] \\&\quad \le \mathfrak {V}_{t}(\lambda Q_{t} + (1-\lambda ) Q'_t, F_{t}), \end{aligned}$$

i.e., $\mathfrak {V}_{t}(Q_{t}, F_{t})$ is concave in $Q_t$. $\square $

Proof of Proposition 3.4

^{Footnote 57} For $Q_t \in [\underline{Q}_t, \overline{Q}_t] \cap \mathbb {N}_0$, we have $\underline{q}_t, \overline{q}_t \in \mathbb {N}_0$. We show that, in this case, the mapping

$$\begin{aligned} q_t \mapsto f_t(S_t) \cdot q_t + \mathbb {E}[\mathfrak {V}_{t+1}(Q_{t+1}, F_{t+1})|\mathcal {F}_t] \end{aligned}$$

(A.1)

is affine on $[\underline{q}_t, \overline{q}_t]$ for all $t\in \{0,...,T -1\}$ and $F_t\in \mathbb {R}^{T -t}_+$. Without loss of generality, we assume $Q_{t} \ge Q_{\min } - (T-t-1)$ and $Q_{t} \le Q_{\max }-1$, such that $[\underline{q}_t, \overline{q}_t]=[0,1]$. We proceed by backward induction on t.

Basis For $t = T - 1$, we have to consider $Q_{T -1} \in [\underline{Q}_{T -1}, \overline{Q}_{T -1}]$ and $q_{T -1} \in [\underline{q}_{T -1}, \overline{q}_{T -1}]$. Under these constraints, we have $Q_{\min } \le Q_T \le Q_{\max }$ such that mapping (A.1) becomes

$$\begin{aligned} q_{T -1} \mapsto f_{T -1}(S_{T -1}) \cdot q_{T -1}, \end{aligned}$$

which is linear.

Induction hypothesis Assume that, for some $t \in \{1,...,T -1\}$, we have: For all $Q_t \in [\underline{Q}_t, \overline{Q}_t]$ with $Q_t\in \mathbb {N}_0$ and all $F_t\in \mathbb {R}^{T -t}_+$, the mapping

$$\begin{aligned} q_t \mapsto f_t(S_t) \cdot q_t + \mathbb {E}[\mathfrak {V}_{t+1}(Q_t + q_t, F_{t+1})|\mathcal {F}_t] \end{aligned}$$

is affine on $[\underline{q}_t, \overline{q}_t]$.

Inductive step Now we move from t to $t-1$. Let $Q_{t-1} \in [0 \vee (Q_{\min } - (T-t)), (Q_{\max }-1) \wedge (t-1)] \cap \mathbb {N}_0$. Thus, we have to show the affinity of

$$\begin{aligned} q_{t-1} \mapsto&\ f_{t-1}(S_{t-1}) \cdot q_{t-1} + \mathbb {E}[\mathfrak {V}_{t}(Q_{t-1}+q_{t-1}, F_{t})|\mathcal {F}_{t-1}] \\&\quad = \ f_{t-1}(S_{t-1}) \cdot q_{t-1} + \mathbb {E}[\max _{q_t \in [0,1]} f_t(S_t) \cdot q_t\\&\qquad + \mathbb {E}[\mathfrak {V}_{t+1}(Q_{t-1}+q_{t-1}+q_t, F_{t+1})|\mathcal {F}_t]|\mathcal {F}_{t-1}] \end{aligned}$$

on [0, 1]. For $q_{t-1}$ and $F_t$ both fixed, we look closer at

$$\begin{aligned} q_t \mapsto f_t(S_t) \cdot q_t + \mathbb {E}[\mathfrak {V}_{t+1}(Q_{t-1}+q_{t-1}+q_t, F_{t+1})|\mathcal {F}_t] \end{aligned}$$

(A.2)

on $[\underline{q}_{t},\overline{q}_{t}]$. Note that $1-q_{t-1} \in [\underline{q}_{t},\overline{q}_{t}]$ because $Q_{\min } - (T-t) \le Q_{t-1} \le Q_{\max }-1$. We use the induction hypothesis to conclude that (A.2) is piecewise affine; more precisely, affine on $[\underline{q}_{t}, 1-q_{t-1}]$ and $[1-q_{t-1}, \overline{q}_{t}]$.

If $q_t \in [\underline{q}_{t}, 1-q_{t-1}]$, we have $q_{t-1}+q_t \le 1$. We define $\tilde{Q}_t := Q_{t-1}$. Since $\tilde{Q}_t \in [\underline{Q}_t, \overline{Q}_t] \cap \mathbb {N}_0$, the mapping

$$\begin{aligned} \tilde{q}_t \mapsto f_t(S_t) \cdot \tilde{q}_t + \mathbb {E}[\mathfrak {V}_{t+1}(\tilde{Q}_t+\tilde{q}_t, F_{t+1})|\mathcal {F}_t] \end{aligned}$$

(A.3)

is affine on $[0 \vee (Q_{\min }-Q_{t-1}-(T-t-1)), 1]$ by induction hypothesis. Setting $\tilde{q}_t = q_{t-1}+q_t$ then implies that mapping (A.2) is affine on $[\underline{q}_{t}, 1-q_{t-1}]$.

At the same time, we have $q_{t-1}+q_t \ge 1$ if $q_t \in [1-q_{t-1}, \overline{q}_{t}]$. We now set $\tilde{Q}_t := Q_{t-1} + 1 \in [\underline{Q}_t, \overline{Q}_t] \cap \mathbb {N}_0$. Again, the induction hypothesis states that (A.3) is affine on $[0, 1 \wedge (Q_{\max }-Q_{t-1}-1)]$, which implies that mapping (A.2) is affine on $[1-q_{t-1}, \overline{q}_{t}]$ by setting $\tilde{q}_t = q_{t-1}+q_t-1$.

It follows that (A.2) is piecewise affine with monotonicity breakpoint at $1-q_{t-1}$ such that the mapping reaches its maximum at the interval endpoints $\underline{q}_{t}, \overline{q}_{t}$ or at its monotonicity breakpoint. We have

$$\begin{aligned} \mathfrak {V}_{t}(Q_{t-1} + q_{t-1}, F_{t}) =&\max _{q_t\in \{\underline{q}_{t}, 1-q_{t-1}, \overline{q}_{t}\}} f_t(S_t) \cdot q_t \\&+ \mathbb {E}[\mathfrak {V}_{t+1}(Q_{t-1}+q_{t-1}+q_t, F_{t+1})|\mathcal {F}_t]. \end{aligned}$$

For fixed $F_{t}$, we consider the mappings

$$\begin{aligned} q_{t-1} \mapsto f_t(S_t) \cdot q_t + \mathbb {E}[\mathfrak {V}_{t+1}(Q_{t-1}+q_{t-1}+q_t, F_{t+1})|\mathcal {F}_t] \end{aligned}$$

(A.4)

on [0, 1] for $q_t \in \{\underline{q}_{t}, 1-q_{t-1}, \overline{q}_{t}\}$.

For $q_t = 1-q_{t-1}$, (A.4) becomes

$$\begin{aligned} q_{t-1} \mapsto f_t(S_t) \cdot (1-q_{t-1}) + \mathbb {E}[\mathfrak {V}_{t+1}(Q_{t-1}+1, F_{t+1})|\mathcal {F}_t], \end{aligned}$$

which is affine.

For $q_t = \underline{q}_{t} = 0 \vee (Q_{\min }-Q_{t-1}-q_{t-1}-(T-t-1))$, note that $Q_{\min }-Q_{t-1}-(T-t-1) \in \mathbb {Z}$, such that we have either $\underline{q}_{t} = 0$ or $\underline{q}_{t} = Q_{\min }-Q_{t-1}-q_{t-1}-(T-t-1) \ge 0$ on the entire interval $q_{t-1}\in [0,1]$. If $\underline{q}_{t} = 0$, (A.4) becomes

$$\begin{aligned} q_{t-1} \mapsto \mathbb {E}[\mathfrak {V}_{t+1}(Q_{t-1}+q_{t-1}, F_{t+1})|\mathcal {F}_t], \end{aligned}$$

which is affine by induction hypothesis (setting $\tilde{Q}_t := Q_{t-1} \in \mathbb {N}_0$). Otherwise, we have

$$\begin{aligned} q_{t-1} \mapsto&f_t(S_t) \cdot (Q_{\min }-Q_{t-1}-q_{t-1}-(T-t-1)) \\&+ \mathbb {E}[\mathfrak {V}_{t+1}(Q_{\min }-(T-t-1), F_{t+1})|\mathcal {F}_t], \end{aligned}$$

which is also affine. For $q_t = \underline{q}_{t} = 1 \wedge (Q_{\max } - Q_{t-1} - q_{t-1})$, we can similarly verify the affinity of (A.4).

It follows that $q_{t-1} \mapsto \mathfrak {V}_{t}(Q_{t-1} + q_{t-1}, F_{t})$ is convex as a pointwise maximum of affine mappings on [0, 1]. It is affine because we know from Lemma 3.3 that it is also concave. Taking conditional expectations with respect to $\mathcal {F}_{t-1}$ and adding $f_{t-1}(S_{t-1}) \cdot q_{t-1}$ completes the inductive step.

Starting at $t=0$ with $Q_0 = 0 \in \mathbb {N}_0$, we conclude by forward induction that taking $q_t$ from $\{\underline{q}_t, \overline{q}_t\} \subseteq \{0,1\}$ is always one optimal choice because (A.1) is affine in the case of $Q_t \in \mathbb {N}_0$. Hence, we can optimally exercise in digital fashion. $\square $

Proof of Corollary 3.6

(based on, Bonnans et al. 2012, Corollary 4.4) Because $\phi (x)$ is locally Lipschitz continuous by Danskin’s theorem, Rademacher’s theorem (see, Rockafellar and Wets 2009, Theorem 9.60) states that it is totally differentiable almost everywhere.

Now, let $x \in \mathbb {R}^n$ and $\phi (x)$ be totally differentiable at x. Assume Eq. (3.11) does not hold, i.e., there exist $v_1^*, v_2^* \in V^*(x)$ such that ${{\mathrm{d}}}_x \psi (x, v_1^*)\not ={{\mathrm{d}}}_x \psi (x, v_2^*)$. That is, the corresponding gradients differ in at least one entry with index i. Consider the direction $h = (0,...,0,1,0,...0)$ where 1 is placed at position i. With Eq. (3.9), we then have

$$\begin{aligned} \nabla _{h} \phi (x) \not = -(\nabla _{-h} \phi (x)). \end{aligned}$$

Thus, $\phi (x)$ is not totally differentiable at x and we have a contradiction. $\square $

Proof of Lemma 3.7

(based on, Bonnans et al. 2012, Lemma 4.1 and Proposition 4.2) In the first step, we show that, for every t, the partial derivative of $\mathcal {J}$ with respect to F(0, t) exists. We define $g(F(0,t), \omega ) := f_t(S_t(F(0,t), \omega ))\cdot q_t$. Let $x\in \mathbb {R}_+$ and $(x_n)$ be a sequence in $\mathbb {R}_+$ with $x_n \not = x$ for every $n\in \mathbb {N}$ and $\lim _{n\rightarrow \infty } x_n = x$. We now show that the corresponding sequence of difference quotients converges. To this end, set

$$\begin{aligned} \Delta g_n(\omega ) := \frac{g(x_n,\omega )-g(x,\omega )}{x_n - x} \end{aligned}$$

for every $\omega \in \Omega $. In view of the Lipschitz continuity of $f_t$, Rademacher’s theorem (see, Rockafellar and Wets 2009, Theorem 9.60) states that it is differentiable almost everywhere and that its derivative is bounded. Combined with the partial differentiability of $S_t$ in F(0, t), we know that $g(\cdot ,\omega )$ is differentiable for almost every $\omega $ such that

$$\begin{aligned} \lim _{n\rightarrow \infty }\Delta g_n = g'(x,\cdot ) \end{aligned}$$

almost surely. According to the mean value theorem (see, Rudin 1964, Theorem 5.10), for every $n \in \mathbb {N}$ and almost every $\omega \in \Omega $, there exists $y_n(\omega )\in \mathbb {R}_+$ such that $\Delta g_n(\omega )= g'(y_n(\omega ),\omega )$. In particular, $g_n$ is bounded almost everywhere for every n. Since $g(x,\cdot )$ is integrable for every x, we can apply the dominated convergence theorem (see, Klenke 2007, Corollary 6.26) to see that $g'(x, \cdot )$ is integrable and

$$\begin{aligned} \lim _{n\rightarrow \infty } \frac{\mathbb {E}[g(x_n)]-\mathbb {E}[g(x)]}{x_n-x} = \lim _{n\rightarrow \infty } \mathbb {E}[\Delta g_n] = \mathbb {E}[g'(x)]. \end{aligned}$$

Rewritten in our original terms, we have

$$\begin{aligned} \frac{\partial \mathcal {J}(F_0, q)}{\partial F(0,t)} = \frac{\partial \mathbb {E}[f_t(S_t(F_0)) \cdot q_t]}{\partial F(0,t)} = \mathbb {E}\left[ \frac{\partial f_t(S_t)}{\partial F(0,t)}(F_0) \cdot q_t\right] . \end{aligned}$$

In the second step, we verify the continuity of the partial derivatives. Again, let $x^*\in \mathbb {R}_+$ and $(x_n)$ be a sequence in $\mathbb {R}_+$ with $x_n \not = x^*$ for every $n\in \mathbb {N}$ and $\lim _{n\rightarrow \infty } x_n = x^*$. Furthermore, denote $\tilde{g}(F(0,t)) := S_t(F(0,t))$ and $S_t^n := \tilde{g}(x_n)$, $S_t^* := \tilde{g}(x^*)$. We have to show that

$$\begin{aligned} \lim _{n\rightarrow \infty }\frac{\partial f_t(S^n_t)}{\partial F(0,t)} = \frac{\partial f_t(S^*_t)}{\partial F(0,t)} \end{aligned}$$

in $\mathcal {L}^1$. By the chain rule of differentiation, at the points where $f_t$ is differentiable, we have

$$\begin{aligned} \frac{\partial f_t(S_t)}{\partial F(0,t)} = \frac{\partial f_t(S_t)}{\partial S_t} \cdot \frac{\partial S_t}{\partial F(0,t)}. \end{aligned}$$

Denote $f_{\partial x} := \frac{\partial f}{\partial x}$. By assumption, the partial derivatives of $S_t^*$ and $S_t^n$ both have bounded density functions $\varphi _t^*$ and $\varphi _t^n$ with respect to the Lebesgue measure. Since $f_t$ is Lipschitz continuous, $\partial _{S_t} f_t \in \mathcal {L}^\infty (\Omega ,\mathcal {F},\mathbb {P}) \subseteq \mathcal {L}^q(\Omega ,\mathcal {F},\mathbb {P})$, $1\le q < \infty $. Because the space of continuous functions with compact support $C_c(\Omega )$ is dense in $\mathcal {L}^q(\Omega ,\mathcal {F},\mathbb {P})$, we have

$$\begin{aligned} \forall \varepsilon >0 \ \exists h: ||h-\partial _{S_t} f_t||_q \le \varepsilon . \end{aligned}$$

It follows that

$$\begin{aligned} ||h(S_t^n)-\partial _{S_t} f_t(S_t^n)||_q&= \left( \int _s |h(s)-\partial _s f_t(s)|^q \varphi _t^n(s) ds\right) ^{1/q} \\&\le ||\varphi _t^n||_\infty ^{1/q} \cdot ||h-\partial _{S_t} f_t||_q \\&\le ||\varphi _t^n||_\infty ^{1/q} \cdot \varepsilon \end{aligned}$$

and analogously

$$\begin{aligned} ||h(S_t^*)-\partial _{S_t} f_t(S_t^*)||_q \le ||\varphi _t^*||_\infty ^{1/q} \cdot \varepsilon . \end{aligned}$$

Moreover, $\lim _{n\rightarrow \infty } S_t^n = S_t^*$ almost surely and $\lim _{n\rightarrow \infty } \partial _{F(0,t)} S_t^n = \partial _{F(0,t)} S_t^*$ in $\mathcal {L}^p$, $1\le p<\infty $. By continuity of h and the dominated convergence theorem (see, Klenke 2007, Corollary 6.26), there exists a $N_h$ such that

$$\begin{aligned} ||h(S_t^n) - h(S_t^*)||_q \le \varepsilon \end{aligned}$$

for all $n\ge N_h$. Thus, we have

$$\begin{aligned}&||\partial _{S_t} f_t(S_t^n) - \partial _{S_t} f_t(S_t^*)||_q \\ \le \,&||\partial _{S_t} f_t(S_t^n) - h(S_t^n)||_q + ||h(S_t^n) - h(S_t^*)||_q + ||h(S_t^*) - \partial _{S_t} f_t(S_t^*)||_q \\ \le \,&(||\varphi _t^n||_\infty ^{1/q} + ||\varphi _t^*||_\infty ^{1/q}+ 1) \varepsilon \end{aligned}$$

for all $n\ge N_h$. Let $\varepsilon \rightarrow 0$. Then, we obtain $\lim _{n\rightarrow \infty } \partial _{S_t} f_t(S_t^n) = \partial _{S_t} f_t(S_t^*)$ in $\mathcal {L}^q$. Finally, taking $p^{-1}+q^{-1}=1$, $\lim _{n\rightarrow \infty } \partial _{F(0,t)} f_t(S_t^n) = \partial _{F(0,t)} f_t(S_t^*)$ in $\mathcal {L}^1$ follows by combining the convergence of $\partial _{S_t} f_t(S_t)$ in $\mathcal {L}^p$ and the convergence of $\partial _{F(0,t)} S_t$ in $\mathcal {L}^q$.

J having continuous partial derivatives is totally differentiable and its derivative is given by (3.12). In this perspective, $J'$ is weakly continuous with respect to q. $\square $

Proof of Proposition 3.8

To apply Danskin’s theorem (Theorem 3.5), we have to verify its assumptions. In view of Lemma 3.7, we only have to recall that the feasible solution space $\mathcal {Q}$ is weakly compact as argued in the proof of Proposition 3.1. Hence, Theorem 3.5 or rather Corollary 3.6 provides the result. $\square $

Proof of Proposition 3.9

We show by backward induction that $F_t \mapsto \mathfrak {V}_t(Q_t, F_t)$ is convex on $\mathbb {R}^{T -t}_+$ for every $t \in \{0,...,T -1\}$ and every valid quantity $Q_t$. Note that, for every t, we have $F(t,t) = S_t$ such that $F_t \mapsto f_t(S_t(F_t))$ is convex by assumption. For $t=T -1$ and $Q_{T -1} \in [\underline{Q}_{T -1}, \overline{Q}_{T -1}]$, we have

$$\begin{aligned} \mathfrak {V}_{T -1}(Q_{T -1}, F_{T -1})&= \max _{q_{T -1} \in [0,1]} f_{T -1}(S_{T -1}) \cdot q_{T -1} + \mathbb {E}[\mathfrak {V}_T (Q_T , F_ T )|\mathcal {F}_{T -1}] \\&= \max _{q_{T -1} \in [\underline{q}_{T -1},\overline{q}_{T -1}]} f_{T -1}(S_{T -1}) \cdot q_{T -1} , \end{aligned}$$

which, as a pointwise maximum over a set of convex functions, is convex in $F_{T -1}$.^{Footnote 58}

We now move from t to $t-1$. Let $Q_{t-1} \in [\underline{Q}_{t-1}, \overline{Q}_{t-1}]$. By induction assumption, $F_{t} \mapsto \mathfrak {V}_{t}(Q_{t-1} + q_{t-1}, F_{t})$ is convex for every $q_{t-1} \in [\underline{q}_{t-1}, \overline{q}_{t-1}]$. For fixed $\Delta W(t-1)$, the function $g_{t-1}(F_{t-1}, \Delta W(t-1))$ is affine in $F_{t-1}$ by assumption, such that the mapping $F_{t-1} \mapsto \mathfrak {V}_{t}(Q_{t-1} + q_{t-1}, g_{t-1}(F_{t-1}, \Delta W(t-1)))$ is convex. Denote by $\varphi _{t-1}$ the density function of $\Delta W_{t-1}$. Then, it follows that

$$\begin{aligned}&F_{t-1} \mapsto \mathbb {E}[\mathfrak {V}_{t}(Q_{t-1} + q_{t-1}, F_{t})|\mathcal {F}_{t-1}] \\&\quad = \int _w \mathfrak {V}_{t}(Q_{t-1} + q_{t-1}, g_{t-1}(F_{t-1}, w)) \varphi _{t-1}(w) dw \end{aligned}$$

is convex. Since $F_{t-1} \mapsto f_{t-1}(S_{t-1})$ is convex by assumption and taking the pointwise maximum or adding functions of this kind are operations preserving convexity,

$$\begin{aligned} \mathfrak {V}_{t-1}(Q_{t-1}, F_{t-1})&= \max _{q_{t-1} \in [\underline{q}_{t-1},\overline{q}_{t-1}]} f_{t-1}(S_{t-1}) \cdot q_{t-1} \\&\quad + \mathbb {E}[\mathfrak {V}_{t}(Q_{t-1}+q_{t-1}, F_{t})|\mathcal {F}_{t-1}] \end{aligned}$$

is convex, which concludes the induction step.

Because $v: \mathbb {R}^T \rightarrow \mathbb {R}$ is convex, the theorem of Alexandrov (1939) ensures the twice totally differentiability almost everywhere. $\square $

Additional results

The following tables and figures present results extending our main analysis. They are ordered according to their line of occurrence/reference in the main text (Tables 9, 10, 11, 12 and Figs. 12, 13, 14, 15, 16).

Table 9 Single-factor model

Full size table

Table 10 Principle component analysis

Full size table

Table 11 Long-term gamma estimates for the benchmark approach with sample size $N = 5000$

Full size table

Table 12 Benchmark approach with additive price shifts

Full size table

Additional derivations and discussions

Derivation of drift term (2.26) Based on the solution (2.21) of the spot price process in the multi-factor framework, we obtain

$$\begin{aligned} \log S_t = \log F(0,t) + \sum _{k=1}^K \left( -\frac{1}{2} \int _0^t \sigma _k^2(s,t) ds + \int _0^t \sigma _k(s,t) dW_k(s) \right) , \end{aligned}$$

which implies

$$\begin{aligned} \int _0^t \sigma _2(s,t) dW_2(s)=&\log S_t - \log F(0,t) + \frac{1}{2} \sum _{k=1}^K \int _0^t \sigma _k^2(s,t) ds \\&- \sum _{k\not = 2} \int _0^t \sigma _k(s,t) dW_k(s). \end{aligned}$$

It follows that

$$\begin{aligned}&\sum _{k=1}^K \left( - \int _0^t \sigma _k(s,t) \frac{\partial \sigma _k(s,t)}{\partial t} ds + \int _0^t \frac{\partial \sigma _k(s,t)}{\partial t} dW_k(s) \right) \\&\quad = \sum _{k=2}^K \left( -\! \int _0^t \sigma _k(s,t) \frac{\partial \sigma _k(s,t)}{\partial t} ds\right) \! -\! \vartheta \int _0^t \sigma _2(s,t) dW_2(s) +\! \int _0^t \frac{\partial \sigma _3(s,t)}{\partial t} dW_3(s) \\&\quad = \sum _{k=2}^K \left( - \int _0^t \sigma _k(s,t) \frac{\partial \sigma _k(s,t)}{\partial t} ds\right) + \int _0^t \frac{\partial \sigma _3(s,t)}{\partial t} dW_3(s) \\&\qquad - \vartheta \left( \log S_t - \log F(0,t) + \frac{1}{2} \sum _{k=1}^K \int _0^t \sigma _k^2(s,t) ds - \sum _{k\not = 2} \int _0^t \sigma _k(s,t) dW_k(s)\right) \\&\quad = \frac{1}{2} \vartheta \sigma _2^2 \int _0^t \exp (-2\vartheta (t-s)) ds + a \sigma _3^2 \cos (a(t + c)) \sin (a(t + c)) t \\&\qquad - a \sigma _3 \sin (a(t + c)) W_3(t) - \vartheta \\&\qquad \left( \log S_t - \log F(0,t) + \frac{1}{2} \left( \sigma _1^2 t + \sigma _3^2 \cos ^2(a(t+c)) t \right) \right. \\&\qquad \left. - \sigma _1 W_1(t) - \sigma _3 \cos (a(t + c)) W_3(t)\right) . \end{aligned}$$

With $\int _0^t \exp (-2\vartheta (t-s)) ds = \frac{1}{2\vartheta }(1-\exp (-2\vartheta t))$, we get

$$\begin{aligned} \mu (t) = \frac{\partial F(0,t)}{\partial t} + \mu _{\text {seas}}(t) + \vartheta \left( \log \mu _{\text {lt}}(t) + \frac{\sigma _2^2}{4 \vartheta } (1-e^{-2\vartheta t}) - \log (S_t)\right) , \end{aligned}$$

where

$$\begin{aligned} \mu _{\text {seas}}(t)&= a \sigma _3^2 \cos (a(t + c)) \sin (a(t + c)) t - a \sigma _3 \sin (a(t + c)) W_3(t), \\ \mu _{\text {lt}}(t)&= F(0,t) \exp \left( - \frac{1}{2} \sigma _1^2 t + \sigma _1 W_1(t) - \frac{1}{2} \sigma _3^2 \cos ^2(a(t+c)) \right. \\&\quad \left. +\, \sigma _3 \cos (a(t+c)) W_3(t)\right) . \end{aligned}$$

Thus we have obtained the drift term (2.26). $\square $

Error propagation Uncertainty in continuation values (emerging from, for example, approximation error in the LSM) directly influences the exercise strategy and thus fair value and sensitivity estimates. To analyse the impact of such uncertainty in general, we perform a simple simulation study. That is, in our benchmark setting with three state variables and $N = 1000$, we set $^\epsilon \mathcal {C}_t^{r} = \mathcal {C}_t^{r} + \epsilon _t^r$, where $\epsilon _t^r$ are i.i.d. standard normal random variables.^{Footnote 59} Table 13 reports the consequences of this procedure for our estimates, e.g., the change in the option value $\Delta v := \,^\varepsilon v-v$. Besides changes in fair values, pathwise deltas and TFD long-term gammas, we present the changes in the expected consumption quantity $\mathbb {E}[q^*] := \mathbb {E}[\sum _{t=0}^{T -1}q^*_t]$. As expected, the value estimate is reduced in all cases. In most cases, expected consumption and deltas, which are closely related, also decrease. Gamma changes occur in both directions.

Different random seeds In our analysis, we fix the random seed for two reasons. First, we ensure that the value, delta and gamma estimates correspond to each other because they are calculated based on the same price paths. Second, we opt for common random numbers because they are a variance reduction technique typically used when estimates are drawn from more than one simulated scenario. While finite differences derivative estimates are naturally biased (see, Glasserman 2003, chpt. 7.1), this dependent sampling method does not introduce additional bias.^{Footnote 60}

To illustrate the variance reduction, we consider the forward difference of $\delta $, i.e.,

$$\begin{aligned} \text {Var}(v(F_0 + \Delta F_0) - v(F_0)) = {\left\{ \begin{array}{ll}\mathcal {O}(1), &{} \text {in case (i),} \\ \mathcal {O}(\varepsilon ), &{} \text {in case (ii),} \end{array}\right. } \end{aligned}$$

where $\Delta F_0 = \varepsilon \cdot dF_0$. Case (i) reflects the independent (with different random seeds) simulation of $v(F_0)$ and $v(F_0 + \Delta F_0)$. We then have

$$\begin{aligned} \text {Var}(v(F_0 + \Delta F_0) - v(F_0)) \rightarrow 2 \text {Var}(v(F_0)) \end{aligned}$$

under the assumption that $\text {Var}(v(F_0))$ is continuous in $\varepsilon $. Consequently, the variance of the estimator $(v(F_0 + \Delta F_0) - v(F_0))/\varepsilon $ explodes (converges to infinity at rate $1/\varepsilon ^2$) for $\varepsilon \rightarrow 0$ and there is no hope of obtaining reasonable sensitivity estimates. Case (ii) occurs (under some mild conditions) when common random numbers are used for the simulations at $F_0$ and $F_0 + \Delta F_0$ (see L’Ecuyer and Perron 1994). Here, the overall variance of the derivative estimator stays bounded for $\varepsilon \rightarrow 0$. These considerations similarly hold for the regression delta estimates and for our second-order sensitivity estimates.

Despite this advantage, using (fixed or variable) random numbers always introduces a Monte Carlo error. To quantify this error, we analyse the distributions of our estimators for different random seeds. Specifically, we obtain the sampling distributions of the fair value, pathwise delta and TFD long-term gamma estimators (of our benchmark approach) using $S = 1000$ distinct seeds. All other settings remain unchanged. That is, the Monte Carlo error corresponds to $N = 1000$ simulated price paths. Figure 17 presents our results for the contracts with $Q_{\min } = 60\cdot i$ and $Q_{\max } = 180$, $i = 0,1,2,3$.^{Footnote 61}

Table 13 Error propagation

Full size table

Because of our low number of $N = 1000$ simulated price paths, we can observe non-negligible Monte Carlo error. The standard deviation of our estimators is naturally related to the magnitude of the estimated sensitivity. For example, the pathwise delta estimator of the inflexible contract with $Q_{\min } = 180$ has a very low standard error because the option’s gamma is nearly zero. To obtain more precise estimates and more narrow distributions, we could increase N, which significantly increases computation time. However, as pointed out in Sect. 4.2, we can illustrate all features and the proper functioning of our approach based on this sample size.

Optimal bandwidth choice We assume that v is three times continuous (directional) differentiable in a neighborhood of $F_0$. We elaborate the estimation procedure for the overall gamma in the forwards approach, i.e., for the second-order derivative of v in the direction $d F_0 = (1,...,1)'$. We denote by $v', v''$ etc. the derivatives in this direction.

By Taylor expansion, we have $\delta (F_0 + \Delta F_0) = \delta (F_0) + \varepsilon _N \delta '(F_0) + \frac{1}{2} \varepsilon _N^2 \delta ''(F_0 + \xi dF_0)$, where $\xi \in [0, \varepsilon _N]$. Thus, for the estimator’s bias, we get

$$\begin{aligned} \mathbb {E}[\hat{\gamma }] - \gamma&= \mathbb {E}\left[ \sum _{t=0}^{T -1} \frac{\partial S_t}{\partial F(0, t)} \cdot \frac{\Delta q^*_t(F_0)}{\varepsilon _N}\right] - \gamma \\&= \frac{\delta (F_0 + \Delta F_0) - \delta (F_0)}{\varepsilon _N} - \delta '(F_0) \\&= \frac{\varepsilon _N^2}{2} \delta ''(F_0 + \xi dF_0) \\&= \frac{\varepsilon _N^2}{2} (\delta ''(F_0) + o(1)) \end{aligned}$$

as $N\rightarrow \infty $. We write $y_n = o(x_n)$, if $\lim _{n\rightarrow \infty } |\frac{y_n}{x_n}|=0$, and $y_n = \mathcal {O}(x_n)$, if $\limsup _{n\rightarrow \infty } |\frac{y_n}{x_n}|<\infty $. For the variance of the estimator, we have

$$\begin{aligned} \text {Var}(\hat{\gamma })&= \text {Var}\left( \frac{1}{N} \sum _{n=1}^N \sum _{t=0}^{T -1} \frac{\partial \, ^nS_t}{\partial F(0, t)} \cdot \frac{\Delta ^n q^*_t(F_0)}{\varepsilon _N}\right) \\&= \frac{1}{N\varepsilon _N^2} \text {Var}\left( \sum _{t=0}^{T -1} \frac{\partial S_t}{\partial F(0, t)} \cdot \Delta q^*_t(F_0)\right) \\&=: \frac{1}{N\varepsilon _N^2} \mathcal {V}_\gamma (\varepsilon _N). \end{aligned}$$

Furthermore, we define

$$\begin{aligned} X := \sum _{t=0}^{T -1} \frac{\partial S_t}{\partial F(0, t)} \cdot \Delta q^*_t(F_0). \end{aligned}$$

Because we use common random numbers, we have $\lim _{N\rightarrow \infty } q^*_t(F_0 + \Delta F_0) = q^*_t(F_0)$ almost surely for every t, which implies $\lim _{N\rightarrow \infty } \mathcal {V}_\gamma (\varepsilon _N) = 0$. Thus, what is left is to determine the speed of the convergence. Note that simulating $F_t(F_0)$ and $F_t(F_0 + \Delta F_0)$ with independent samples would cause $\mathcal {V}_\gamma (\varepsilon _N) = \mathcal {O}(1)$ because

$$\begin{aligned} \text {Var}(X(F_0+\Delta F_0) - X(F_0))&= \text {Var}(X(F_0+\Delta F_0)) + \text {Var}(X(F_0)) \\&\rightarrow 2 \text {Var}(X(F_0)). \end{aligned}$$

Because the payoff is pathwise discontinuous, we do not expect $\mathcal {V}_\gamma (\varepsilon _N) = \mathcal {O}(\varepsilon _N^2)$ such that the overall variance of the estimator stays bounded as $\varepsilon _N \rightarrow 0$.^{Footnote 62}

We now present some heuristics to argue why we have at least $\mathcal {V}_\gamma (\varepsilon _N) = \mathcal {O}(\varepsilon _N)$. Because of

$$\begin{aligned} F(t, T) = F(0,T) \exp \left( \sum _{s=0}^{t-1}\sigma (s,T) \Delta W(s)\right) , \end{aligned}$$

we have $S_t(F_0 + \Delta F_0) - S_t(F_0) = \mathcal {O}(\varepsilon _N)$. For fixed $r_t$, it follows that $\mathcal {C}_t^{r_t}(F_t + \Delta F_t) - \mathcal {C}_t^{r_t}(F_t) = \mathcal {O}(\varepsilon _N)$ because $\mathcal {C}_t^{r_t}$ is differentiable in $F_t$ which can be seen by applying our previous results to the subproblem (3.5). Thus, toggling between $F_0$ and $F_0 + \Delta F_0$ shifts the exercise condition $S_t-\mathscr {K}+ \mathcal {C}_t^{r_t+1} - \mathcal {C}_t^{r_t}$ in $\mathcal {O}(\varepsilon _N)$. Since

$$\begin{aligned} \mathbb {P}(|S_t - \mathscr {K}+ \mathcal {C}_t^{r_t+1}-\mathcal {C}_t^{r_t}|<c\cdot \varepsilon _N) = \mathcal {O}(\varepsilon _N), \end{aligned}$$

the probability that the exercise condition crosses zero on day t for fixed $r_t$ is $\mathcal {O}(\varepsilon _N)$. Because we consider a discrete finite time horizon and a digital optimal control, there is only a finite number of exercise conditions which could lead to a discontinuous payoff. It follows that

$$\begin{aligned}&\mathbb {P}(X\text { is discontinuous in }[F_0, F_0 + \Delta F_0]) \\&\le \mathbb {P}\left( \bigcup _{t=0}^T \bigcup _{r_t=\underline{q}_t}^{\overline{q}_t} |S_t - \mathscr {K}+ \mathcal {C}_t^{r_t+1}-\mathcal {C}_t^{r_t}|<c\cdot \varepsilon _N\right) \\&\le \sum _{t=0}^T \sum _{r_t=\underline{q}_t}^{\overline{q}_t} \mathbb {P}(|S_t - \mathscr {K}+ \mathcal {C}_t^{r_t+1}-\mathcal {C}_t^{r_t}|<c\cdot \varepsilon _N) \\&= \mathcal {O}(\varepsilon _N), \end{aligned}$$

i.e., the probability that the overall payoff is discontinuous in $[F_0, F_0 + \Delta F_0]$ converges to zero as $\mathcal {O}(\varepsilon _N)$. Bridging discontinuities costs $\mathcal {O}(1)$, whereas we have $\mathcal {O}(\varepsilon _N)$ for a continuous payoff. Let $A := \{\omega \in \Omega : X(w)\text { is continuous in }[F_0, F_0 + \Delta F_0]\}$. It follows

$$\begin{aligned} \mathbb {E}[X]&= \mathbb {P}(\omega \in A) \mathbb {E}[X(\omega )|\omega \in A] + \mathbb {P}(\omega \not \in A) \mathbb {E}[X(\omega )|\omega \not \in A] \\&= \mathcal {O}(1)\mathcal {O}(\varepsilon _N) + \mathcal {O}(\varepsilon _N) \mathcal {O}(1) = \mathcal {O}(\varepsilon _N) \end{aligned}$$

and

$$\begin{aligned} \mathbb {E}[X^2]&= \mathbb {P}(\omega \in A) \mathbb {E}[X^2(\omega )|\omega \in A] + \mathbb {P}(\omega \not \in A) \mathbb {E}[X^2(\omega )|\omega \not \in A] \\&= \mathcal {O}(1) \mathcal {O}(\varepsilon _N^2) + \mathcal {O}(\varepsilon _N) \mathcal {O}(1) = \mathcal {O}(\varepsilon _N). \end{aligned}$$

Hence, we have

$$\begin{aligned} \mathcal {V}_\gamma = \text {Var}(X) = \mathbb {E}[X^2] - (\mathbb {E}[X])^2 = \mathcal {O}(\varepsilon _N). \end{aligned}$$

Table 14 Optimal bandwidth choice

Full size table

We now turn to Monte Carlo methods. For $\varepsilon _N \in \{-2, -1.5, -1, -0.5, 0, 0.5, 1, 1.5, 2\}$, we use the corresponding pathwise delta estimates in the OLS regression

$$\begin{aligned} \mathbb {E}[\delta |F_0 + \Delta F_0] = \delta (F_0) + \delta '(F_0) \cdot \varepsilon + \frac{1}{2} \delta ''(F_0) \cdot \varepsilon ^2. \end{aligned}$$

(C.1)

In addition, we set $^n\delta (\varepsilon _N) := \sum _{t=0}^{T -1} \frac{\partial \,\, ^nS_t}{\partial F(0, t)} \cdot \,^n q^*_t(F_0 + \Delta F_0)$, calculate

$$\begin{aligned} \hat{\mathcal {V}}_\gamma (\varepsilon _N) = \text {Var}(^n\delta (\varepsilon _N) - \,^n\delta (0)) \end{aligned}$$

for each $\varepsilon _N$ and then perform the regression

$$\begin{aligned} \mathbb {E}[\mathcal {V}_\gamma (\varepsilon _N)|\varepsilon _N] = \mathcal {V}'_\gamma \cdot |\varepsilon _N|. \end{aligned}$$

(C.2)

We report the estimates of ${\delta }''(F_0)$ and ${\mathcal {V}}'_\gamma $ in Table 14. These values, related to the bias and variance of our gamma estimator, are then used to derive the optimal bandwidth parameter minimizing (4.10), which is also presented in Table 14. $\square $

Pseudo code

Rights and permissions

Reprints and permissions

About this article

Cite this article

Kohrs, H., Mühlichen, H., Auer, B.R. et al. Pricing and risk of swing contracts in natural gas markets. Rev Deriv Res 22, 77–167 (2019). https://doi.org/10.1007/s11147-018-9146-x

Download citation

Published: 22 June 2018
Issue Date: 15 April 2019
DOI: https://doi.org/10.1007/s11147-018-9146-x

Keywords

JEL Classification

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Pricing and risk of swing contracts in natural gas markets

Abstract

Access this article

Similar content being viewed by others

The affine styled-facts price dynamics for the natural gas: evidence from daily returns and option prices

Option Pricing and Hedging Performance Under Stochastic Volatility and Stochastic Interest Rates

Commodity Spread Option with Cointegration

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix

Proofs

Proof of Lemma 2.4

Proof of Proposition 2.5

Proof of Proposition 3.1

Proof of Lemma 3.3

Proof of Proposition 3.4

Proof of Corollary 3.6

Proof of Lemma 3.7

Proof of Proposition 3.8

Proof of Proposition 3.9

Additional results

Additional derivations and discussions

Pseudo code

Rights and permissions

About this article

Cite this article

Keywords

JEL Classification

Navigation

Pricing and risk of swing contracts in natural gas markets

Abstract

Access this article

Similar content being viewed by others

The affine styled-facts price dynamics for the natural gas: evidence from daily returns and option prices

Option Pricing and Hedging Performance Under Stochastic Volatility and Stochastic Interest Rates

Commodity Spread Option with Cointegration

Notes

References

Acknowledgements

Author information

Authors and Affiliations

Corresponding author

Appendices

Appendix

Proofs

Proof of Lemma 2.4

Proof of Proposition 2.5

Proof of Proposition 3.1

Proof of Lemma 3.3

Proof of Proposition 3.4

Proof of Corollary 3.6

Proof of Lemma 3.7

Proof of Proposition 3.8

Proof of Proposition 3.9

Additional results

Additional derivations and discussions

Pseudo code

Rights and permissions

About this article

Cite this article

Share this article

Keywords

JEL Classification

Search

Navigation