Skip to main content

Advertisement

Log in

Discovery of multi-domain spatiotemporal associations

  • Published:
GeoInformatica Aims and scope Submit manuscript

Abstract

This paper focuses on the discovery of unusual spatiotemporal associations across multiple phenomena from distinct application domains in a spatial neighborhood where each phenomenon is represented by anomalies from the domain. Such an approach can facilitate the discovery of interesting links between distinct domains, such as links between traffic accidents and environmental factors or road conditions, environmental impacts and human factors, disease spread, and hydrological trajectory, to name a few. This paper proposes techniques to discover spatiotemporal associations across distinct phenomena using a series of anomalous windows from each domain that represent a phenomenon. We propose a novel metric called influence score to quantify the associated influence between the phenomena. In addition, we also propose spatiotemporal confidence, support, and lift measures to quantify these associations. Two novel algorithms for finding multi-domain spatiotemporal associations across phenomena are proposed. We present experimental results across real-world phenomena that are linked and discuss the efficacy of our approach.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1
Fig. 2
Fig. 3
Fig. 4
Fig. 5
Fig. 6
Algorithm 1
Fig. 7
Algorithm 2
Algorithm 3
Fig. 8
Fig. 9
Fig. 10
Fig. 11
Fig. 12
Fig. 13

Similar content being viewed by others

Data Availability

Datasets used in this paper are derived from public sources, links to which are provided in the article. The code and sample dataset are available at the Github repository: https://github.com/MultiDataLab/Multi-Domain-Spatiotemporal-Associations

References

  1. Tobler WR (1970) A computer movie simulating urban growth in the detroit region. Econ Geogr 46(sup1):234–240

    Article  Google Scholar 

  2. Cools M, Moons E, Wets G (2010) Assessing the impact of weather on traffic intensity. Weather Clim Soc 2(1):60–68

    Article  Google Scholar 

  3. Zhang B, Matchinski EJ, Chen B, Ye X, Jing L, Lee K (2019) Marine oil spills-oil pollution, sources and effects. In World seas: an environmental evaluation, p 391-406. Elsevier

  4. Xu R, Rahmandad H, Gupta M, DiGennaro C, Ghaffarzadegan N, Amini H, Jalali MS (2021) Weather, air pollution, and SARSCoV-2 transmission: a global analysis. Lancet Planet Health 5(10):e671–e680

    Article  Google Scholar 

  5. Mass balance of the antarctic ice sheet from 1992 to 2017 (2018) Nature, 558(7709):219-222

  6. Bamber JL, Westaway RM, Marzeion B, Wouters B (2018) The land ice contribution to sea level during the satellite era. Environ Res Lett 13(6):063008

    Article  Google Scholar 

  7. Rignot E, Mouginot J, Scheuchl B, Van Den Broeke M, Wessem MJV, Morlighem M (2019) Four decades of antarctic ice sheet mass balance from 1979–2017. Proc Natl Acad Sci 116(4):1095–1103

    Article  Google Scholar 

  8. Smith B, Fricker HA, Gardner AS, Medley B, Nilsson J, Paolo FS, Holschuh N, Adusumilli S, Brunt K, Csatho B et al (2020) Pervasive ice sheet mass loss reflects competing ocean and atmosphere processes. Science 368(6496):1239–1242

    Article  Google Scholar 

  9. Celik M, Shekhar S, Rogers JP, Shine JA (2008) Mixed-drove spatiotemporal cooccurrence pattern mining. IEEE Trans Knowl Data Eng 20(10):1322–1335

    Article  Google Scholar 

  10. Janeja VP, Palanisamy R (2013) Multidomain anomaly detection in spatial datasets. Knowl Inf Syst 36(3):749–788

    Article  Google Scholar 

  11. Lee J-G, Han J, Whang K-Y (2007) Trajectory clustering: a partition-and-group framework. In Proceedings of the 2007 ACM SIGMOD international conference on Management of data, p 593-604

  12. Cressie N (2015) Statistics for spatial data. John Wiley & Sons

    MATH  Google Scholar 

  13. Schabenberger O, Gotway CA (2017) Statistical methods for spatial data analysis: Texts in statistical science. Chapman and Hall/CRC

    Book  MATH  Google Scholar 

  14. Chuang A (1991) Time series analysis: univariate and multivariate methods. Taylor & Francis

    Google Scholar 

  15. Cao H, Cheung DW, Mamoulis N (2004) Discovering partial periodic patterns in discrete data sequences In Pacific-Asia conference on knowledge discovery and data mining, p 653-658. Springer

  16. Huang Y, Shekhar S, Xiong H (2004) Discovering colocation patterns from spatial data sets: a general approach. IEEE Trans Knowl Data Eng 16(12):1472–1485

    Article  Google Scholar 

  17. Lee I, Estivill-Castro V (2011) Exploration of massive crime data sets through data mining techniques. Appl Artif Intell 25(5):362–379

    Article  Google Scholar 

  18. Estivill-Castro V, Lee I (2001) Data mining techniques for autonomous exploration of large volumes of geo-referenced crime data. In Proc. of the 6th International Conference on Geocomputation, p 24-26. Citeseer

  19. Huang Y, Zhang P (2006) On the relationships between clustering and spatial co-location pattern mining. In 2006 18th IEEE International Conference on Tools with Artificial Intelligence (ICTAI-06), p 513-522. IEEE

  20. Koperski K, Han J (1995) Discovery of spatial association rules in geographic information databases. In International Symposium on Spatial Databases, p 47-66. Springer

  21. Tao Y, Kollios G, Considine J, Li F, Papadias D (2004) Spatio-temporal aggregation using sketches. In Proceedings. 20th International Conference on Data Engineering, p 214-225. IEEE

  22. Tsoukatos I, Gunopulos D (2001) Efficient mining of spatiotemporal patterns. In International Symposium on Spatial and Temporal Databases, p 425–442. Springer

  23. Kulldorff M (1997) A spatial scan statistic. Commun Stat - Theory Methods 26(6):1481–1496

    Article  MathSciNet  MATH  Google Scholar 

  24. Neill DB, Moore AW (2006) Chapter 16 - methods for detecting spatial and spatio-temporal clusters. In: Wagner MM, Moore AW, Aryel RM (eds) Handbook of Biosurveillance. Academic Press, Burlington, pp 243–254

    Chapter  Google Scholar 

  25. Xie Y, Shekhar S, Li Y (2022) Statisticallyrobust clustering techniques for mapping spatial hotspots: A survey. ACM Comput Surv (CSUR) 55(2):1–38

    Article  Google Scholar 

  26. Fitzpatrick D, Ni Y, Neill DB (2021) Support vector subset scan for spatial pattern detection. Comput Stat Data Anal 157:107149

    Article  MathSciNet  MATH  Google Scholar 

  27. Kulldorff M, Mostashari F, Duczmal L, Yih WK, Kleinman K, Platt R (2007) Multivariate scan statistics for disease surveillance. Stat Med 26(8):1824–1833

    Article  MathSciNet  Google Scholar 

  28. Tao Y, Pi D (2008) A neighborhood-based trajectory clustering algorithm. In 2008 Workshop on Power Electronics and Intelligent Transportation System, p 272–275. IEEE

  29. Post E, Alley RB, Christensen TR, Macias-Fauria M, Forbes BC, Gooseff MN, Iler A, Kerby JT, Laidre KL, Mann ME et al (2019) The polar regions in a 2 c warmer world. Sci Adv 5(12):eaaw9883

    Article  Google Scholar 

  30. Shi L, Janeja VP (2009) Anomalous window discovery through scan statistics for linear intersecting paths (sslip). In Proceedings of the 15th ACM SIGKDD international conference on Knowledge discovery and data mining, p 767–776

  31. Tango T, Takahashi K (2012) A flexible spatial scan statistic with a restricted likelihood ratio for detecting disease clusters. Stat Med 31(30):4207–4218

    Article  MathSciNet  Google Scholar 

  32. Mohammadi SH, Janeja VP, Gangopadhyay A (2009) Discretized spatio-temporal scan window. In Proceedings of the 2009 SIAM International Conference on Data Mining, p 1197–1208. SIAM

  33. Janeja VP, Adam NR, Atluri V, Vaidya J (2010) Spatial neighborhood based anomaly detection in sensor datasets. Data Min Knowl Discov 20(2):221–258

    Article  MathSciNet  Google Scholar 

  34. Sugiyama M (2016) Introduction to statistical machine learning. Morgan Kaufmann

    MATH  Google Scholar 

  35. The county health rankings, a key component of the mobilizing action toward community health 1034 (match) project, 2010. http://www.countyhealthrankings.org/. Last Accessed 01-March-2011

Download references

Acknowledgements

This work is supported in part by the US Army Corps of Engineers, Engineers Research and Development Center, agreement number: W9132V-15-C-0004 and by the National Science Foundation (iHARP, Award #2118285).

Author information

Authors and Affiliations

Authors

Corresponding author

Correspondence to Vandana P. Janeja.

Ethics declarations

Conflicts of interest

The authors have no competing interests to declare that are relevant to the content of this article.

Additional information

Publisher's Note

Springer Nature remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Springer Nature or its licensor (e.g. a society or other partner) holds exclusive rights to this article under a publishing agreement with the author(s) or other rightsholder(s); author self-archiving of the accepted manuscript version of this article is solely governed by the terms of such publishing agreement and applicable law.

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Walkikar, P., Shi, L., Tama, B.A. et al. Discovery of multi-domain spatiotemporal associations. Geoinformatica (2023). https://doi.org/10.1007/s10707-023-00506-4

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • DOI: https://doi.org/10.1007/s10707-023-00506-4

Keywords

Navigation