Abstract
In this paper, we propose a novel model to analyze serially correlated two-dimensional functional data observed sparsely and irregularly on a domain which may not be a rectangle. Our approach employs a mixed effects model that specifies the principal component functions as bivariate splines on triangles and the principal component scores as random effects which follow an auto-regressive model. We apply the thin-plate penalty for regularizing the bivariate function estimation and develop an effective EM algorithm along with Kalman filter and smoother for calculating the penalized likelihood estimates of the parameters. Our approach was applied on simulated datasets and on Texas monthly average temperature data from January year 1915 to December year 2014. Supplementary materials accompanying this paper appear online.
Similar content being viewed by others
References
Absil PA, Mahony R, Sepulchre R (2009) Optimization algorithms on matrix manifolds. Princeton University Press, Princeton
Akaike H (1974) A new look at the statistical model identification. IEEE Trans Autom Control 19(6):716–723
Bosq D (2000) Linear processes in function spaces: theory and applications. Springer, New York
Cabrera BL, Schulz F (2017) Forecasting generalized quantiles of electricity demand: a functional data approach. J Am Stat Assoc 112(517):127–136
Chen L-H, Jiang C-R (2017) Multi-dimensional functional principal component analysis. Stat Comput 27(5):1181–1192
Cipra T, Romera R (1997) Kalman filter with outliers and missing observations. TEST 6:379–395
de Boor C (1978) A practical guide to splines. Springer, New York
Dempster AP, Laird NM, Rubin DB (1977) Maximum likelihood from incomplete data via the EM algorithm. J R Stat Soc: Ser B (Methodol) 39(1):1–22
Ding F, He S, Jones DE, Huang JZ (2022) Functional PCA with covariate-dependent mean and covariance structure. Technometrics 64(3):335–345
Durbin J, Koopman SJ (2012) Time series analysis by state space methods, 2nd edn. Oxford University Press, Oxford
Golub GH, Van Loan CF (2013) Matrix computations. JHU press, Baltimore
Hall P, Müller H-G, Wang J-L (2006) Properties of principal component methods for functional and longitudinal data analysis. Ann Stat 34(3):1493–1517
Hansen J, Sato M, Ruedy R, Lo K, Lea DW, Medina-Elizade M (2006) Global temperature change. Proc Natl Acad Sci 103(39):14288–14293
Huang JZ, Shen H, Buja A (2008) Functional principal components analysis via penalized rank one approximation. Electron J Stat 2:678–695
Hyndman RJ, Shang HL (2009) Forecasting functional time series. J Korean Stat Soc 38(3):199–211
Hyndman RJ, Ullah MS (2007) Robust forecasting of mortality and fertility rates: a functional data approach. Comput Stat Data Anal 51(10):4942–4956
James GM, Hastie TJ, Sugar CA (2000) Principal component models for sparse functional data. Biometrika 87(3):587–602
Jones PD, Wigley TM, Wright PB (1986) Global temperature variations between 1861 and 1984. Nature 322(6078):430–434
Karhunen K (1946) Zur spektraltheorie stochastischer prozesse. Ann Acad Sci Fenn Ser A. I, Math 34:1–7
Kokoszka P, Reimherr M (2013) Determining the order of the functional autoregressive model. J Time Ser Anal 34(1):116–129
Lai M-J, Schumaker LL (2007) Spline functions on triangulations. Cambridge University Press, Cambridge
Li Y, Guan Y (2014) Functional principal component analysis of spatiotemporal point processes with applications in disease surveillance. J Am Stat Assoc 109(507):1205–1215
Liu C, Ray S, Hooker G (2017) Functional principal component analysis of spatially correlated data. Stat Comput 27(6):1639–1654
Liu X, Guillas S, Lai M-J (2016) Efficient spatial modeling using the SPDE approach with bivariate splines. J Comput Graph Stat 25(4):1176–1194
Loève M (1946) Fonctions aléatoires à décomposition orthogonale exponentielle. La Rev Sci 84:159–162
Lorentz GG (1986) Bernstein polynomials, 2nd edn. Chelsea Publishing Co., New York
Menne MJ, Williams CN Jr, Vose RS (2009) The US historical climatology network monthly temperature data, version 2. Bull Am Meteor Soc 90(7):993–1008
Mercer J (1909) Functions of positive and negative type, and their connection the theory of integral equations. Philos Trans R Soc Lond. Ser A, Contain Pap Math Phys Character 209:415–446
Nelder JA, Mead R (1965) A simplex method for function minimization. Comput J 7(4):308–313
Ramsay JO, Silverman BW (2005) Functional data analysis, 2nd edn. Springer, New York
Rice JA, Wu CO (2001) Nonparametric mixed effects models for unequally sampled noisy curves. Biometrics 57(1):253–259
Ruppert D, Wand MP, Carroll RJ (2003) Semiparametric regression. Cambridge University Press, Cambridge
Schwarz G (1978) Estimating the dimension of a model. Ann Stat 6(2):461–464
Shang HL, Hyndman RJ (2017) Grouped functional time series forecasting: an application to age-specific mortality rates. J Comput Graph Stat 26(2):330–343
Shen H (2009) On modeling and forecasting time series of smooth curves. Technometrics 51(3):227–238
Shen H, Huang JZ (2008) Interday forecasting and intraday updating of call center arrivals. Manuf Serv Oper Manag 10(3):391–410
Shi H, Yang Y, Wang L, Ma D, Beg MF, Pei J, Cao J (2022) Two-dimensional functional principal component analysis for image feature extraction. J Comput Graph Stat 31(4):1127–1140
Staniswalis JG, Lee JJ (1998) Nonparametric regression analysis of longitudinal data. J Am Stat Assoc 93(444):1403–1418
Wang Y, Wang G, Wang L, Ogden RT (2020) Simultaneous confidence corridors for mean functions in functional data analysis of imaging data. Biometrics 76(2):427–437
Yao F, Müller H-G, Wang J-L (2005) Functional data analysis for sparse longitudinal data. J Am Stat Assoc 100:577–590
Yao F, Müller H-G, Wang J-L (2005) Functional linear regression analysis for longitudinal data. Ann Stat 33(6):2873–2903
Zhou L, Huang JZ, Carroll R (2008) Joint modelling of paired sparse functional data using principal components. Biometrika 95(3):601–619
Zhou L, Pan H (2014) Principal component analysis of two-dimensional functional data. J Comput Graph Stat 23(3):779–801
Zhou L, Pan H (2014) Smoothing noisy data for irregular regions using penalized bivariate splines on triangulations. Comput Stat 29(1):263–281
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflict of interest
The authors have no conflicts of interest to declare.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
This article was majorly completed while Shirun Shen was a Ph.D. candidate in Department of Statistics, Texas A &M University. This research was supported by Public Computing Cloud, Renmin University of China.
Supplementary Information
Below is the link to the electronic supplementary material.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Shen, S., Zhou, H., He, K. et al. Principal Component Analysis of Two-dimensional Functional Data with Serial Correlation. JABES (2023). https://doi.org/10.1007/s13253-023-00585-8
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s13253-023-00585-8