Abstract
This paper focuses on the discovery of unusual spatiotemporal associations across multiple phenomena from distinct application domains in a spatial neighborhood where each phenomenon is represented by anomalies from the domain. Such an approach can facilitate the discovery of interesting links between distinct domains, such as links between traffic accidents and environmental factors or road conditions, environmental impacts and human factors, disease spread, and hydrological trajectory, to name a few. This paper proposes techniques to discover spatiotemporal associations across distinct phenomena using a series of anomalous windows from each domain that represent a phenomenon. We propose a novel metric called influence score to quantify the associated influence between the phenomena. In addition, we also propose spatiotemporal confidence, support, and lift measures to quantify these associations. Two novel algorithms for finding multi-domain spatiotemporal associations across phenomena are proposed. We present experimental results across real-world phenomena that are linked and discuss the efficacy of our approach.
Similar content being viewed by others
Data Availability
Datasets used in this paper are derived from public sources, links to which are provided in the article. The code and sample dataset are available at the Github repository: https://github.com/MultiDataLab/Multi-Domain-Spatiotemporal-Associations
References
Tobler WR (1970) A computer movie simulating urban growth in the detroit region. Econ Geogr 46(sup1):234–240
Cools M, Moons E, Wets G (2010) Assessing the impact of weather on traffic intensity. Weather Clim Soc 2(1):60–68
Zhang B, Matchinski EJ, Chen B, Ye X, Jing L, Lee K (2019) Marine oil spills-oil pollution, sources and effects. In World seas: an environmental evaluation, p 391-406. Elsevier
Xu R, Rahmandad H, Gupta M, DiGennaro C, Ghaffarzadegan N, Amini H, Jalali MS (2021) Weather, air pollution, and SARSCoV-2 transmission: a global analysis. Lancet Planet Health 5(10):e671–e680
Mass balance of the antarctic ice sheet from 1992 to 2017 (2018) Nature, 558(7709):219-222
Bamber JL, Westaway RM, Marzeion B, Wouters B (2018) The land ice contribution to sea level during the satellite era. Environ Res Lett 13(6):063008
Rignot E, Mouginot J, Scheuchl B, Van Den Broeke M, Wessem MJV, Morlighem M (2019) Four decades of antarctic ice sheet mass balance from 1979–2017. Proc Natl Acad Sci 116(4):1095–1103
Smith B, Fricker HA, Gardner AS, Medley B, Nilsson J, Paolo FS, Holschuh N, Adusumilli S, Brunt K, Csatho B et al (2020) Pervasive ice sheet mass loss reflects competing ocean and atmosphere processes. Science 368(6496):1239–1242
Celik M, Shekhar S, Rogers JP, Shine JA (2008) Mixed-drove spatiotemporal cooccurrence pattern mining. IEEE Trans Knowl Data Eng 20(10):1322–1335
Janeja VP, Palanisamy R (2013) Multidomain anomaly detection in spatial datasets. Knowl Inf Syst 36(3):749–788
Lee J-G, Han J, Whang K-Y (2007) Trajectory clustering: a partition-and-group framework. In Proceedings of the 2007 ACM SIGMOD international conference on Management of data, p 593-604
Cressie N (2015) Statistics for spatial data. John Wiley & Sons
Schabenberger O, Gotway CA (2017) Statistical methods for spatial data analysis: Texts in statistical science. Chapman and Hall/CRC
Chuang A (1991) Time series analysis: univariate and multivariate methods. Taylor & Francis
Cao H, Cheung DW, Mamoulis N (2004) Discovering partial periodic patterns in discrete data sequences In Pacific-Asia conference on knowledge discovery and data mining, p 653-658. Springer
Huang Y, Shekhar S, Xiong H (2004) Discovering colocation patterns from spatial data sets: a general approach. IEEE Trans Knowl Data Eng 16(12):1472–1485
Lee I, Estivill-Castro V (2011) Exploration of massive crime data sets through data mining techniques. Appl Artif Intell 25(5):362–379
Estivill-Castro V, Lee I (2001) Data mining techniques for autonomous exploration of large volumes of geo-referenced crime data. In Proc. of the 6th International Conference on Geocomputation, p 24-26. Citeseer
Huang Y, Zhang P (2006) On the relationships between clustering and spatial co-location pattern mining. In 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI-06), p 513-522. IEEE
Koperski K, Han J (1995) Discovery of spatial association rules in geographic information databases. In International Symposium on Spatial Databases, p 47-66. Springer
Tao Y, Kollios G, Considine J, Li F, Papadias D (2004) Spatio-temporal aggregation using sketches. In Proceedings. 20th International Conference on Data Engineering, p 214-225. IEEE
Tsoukatos I, Gunopulos D (2001) Efficient mining of spatiotemporal patterns. In International Symposium on Spatial and Temporal Databases, p 425–442. Springer
Kulldorff M (1997) A spatial scan statistic. Commun Stat - Theory Methods 26(6):1481–1496
Neill DB, Moore AW (2006) Chapter 16 - methods for detecting spatial and spatio-temporal clusters. In: Wagner MM, Moore AW, Aryel RM (eds) Handbook of Biosurveillance. Academic Press, Burlington, pp 243–254
Xie Y, Shekhar S, Li Y (2022) Statisticallyrobust clustering techniques for mapping spatial hotspots: A survey. ACM Comput Surv (CSUR) 55(2):1–38
Fitzpatrick D, Ni Y, Neill DB (2021) Support vector subset scan for spatial pattern detection. Comput Stat Data Anal 157:107149
Kulldorff M, Mostashari F, Duczmal L, Yih WK, Kleinman K, Platt R (2007) Multivariate scan statistics for disease surveillance. Stat Med 26(8):1824–1833
Tao Y, Pi D (2008) A neighborhood-based trajectory clustering algorithm. In 2008 Workshop on Power Electronics and Intelligent Transportation System, p 272–275. IEEE
Post E, Alley RB, Christensen TR, Macias-Fauria M, Forbes BC, Gooseff MN, Iler A, Kerby JT, Laidre KL, Mann ME et al (2019) The polar regions in a 2 c warmer world. Sci Adv 5(12):eaaw9883
Shi L, Janeja VP (2009) Anomalous window discovery through scan statistics for linear intersecting paths (sslip). In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, p 767–776
Tango T, Takahashi K (2012) A flexible spatial scan statistic with a restricted likelihood ratio for detecting disease clusters. Stat Med 31(30):4207–4218
Mohammadi SH, Janeja VP, Gangopadhyay A (2009) Discretized spatio-temporal scan window. In Proceedings of the 2009 SIAM International Conference on Data Mining, p 1197–1208. SIAM
Janeja VP, Adam NR, Atluri V, Vaidya J (2010) Spatial neighborhood based anomaly detection in sensor datasets. Data Min Knowl Discov 20(2):221–258
Sugiyama M (2016) Introduction to statistical machine learning. Morgan Kaufmann
The county health rankings, a key component of the mobilizing action toward community health 1034 (match) project, 2010. http://www.countyhealthrankings.org/. Last Accessed 01-March-2011
Acknowledgements
This work is supported in part by the US Army Corps of Engineers, Engineers Research and Development Center, agreement number: W9132V-15-C-0004 and by the National Science Foundation (iHARP, Award #2118285).
Author information
Authors and Affiliations
Corresponding author
Ethics declarations
Conflicts of interest
The authors have no competing interests to declare that are relevant to the content of this article.
Additional information
Publisher's Note
Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.
Rights and permissions
Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.
About this article
Cite this article
Walkikar, P., Shi, L., Tama, B.A. et al. Discovery of multi-domain spatiotemporal associations. Geoinformatica (2023). https://doi.org/10.1007/s10707-023-00506-4
Received:
Revised:
Accepted:
Published:
DOI: https://doi.org/10.1007/s10707-023-00506-4