skip to main content
research-article

Identifying and Explaining Safety-critical Scenarios for Autonomous Vehicles via Key Features

Published:18 April 2024Publication History
Skip Abstract Section

Abstract

Ensuring the safety of autonomous vehicles (AVs) is of utmost importance, and testing them in simulated environments is a safer option than conducting in-field operational tests. However, generating an exhaustive test suite to identify critical test scenarios is computationally expensive, as the representation of each test is complex and contains various dynamic and static features, such as the AV under test, road participants (vehicles, pedestrians, and static obstacles), environmental factors (weather and light), and the road’s structural features (lanes, turns, road speed, etc.). In this article, we present a systematic technique that uses Instance Space Analysis (ISA) to identify the significant features of test scenarios that affect their ability to reveal the unsafe behaviour of AVs. ISA identifies the features that best differentiate safety-critical scenarios from normal driving and visualises the impact of these features on test scenario outcomes (safe/unsafe) in two dimensions. This visualisation helps to identify untested regions of the instance space and provides an indicator of the quality of the test suite in terms of the percentage of feature space covered by testing. To test the predictive ability of the identified features, we train five Machine Learning classifiers to classify test scenarios as safe or unsafe. The high precision, recall, and F1 scores indicate that our proposed approach is effective in predicting the outcome of a test scenario without executing it and can be used for test generation, selection, and prioritisation.

REFERENCES

  1. [1] Baidu Apollo team (2017), Apollo: Open Source Autonomous Driving. Retrieved from https://github.com/ApolloAuto/apolloGoogle ScholarGoogle Scholar
  2. [2] BeamNG.tech. Retrieved from https://beamng.tech/Google ScholarGoogle Scholar
  3. [3] Mathworks Polyarea. Retrieved from https://www.mathworks.com/help/matlab/ref/polyarea.htmlGoogle ScholarGoogle Scholar
  4. [4] Mathworks Polygons. Retrieved from https://au.mathworks.com/help/map/create-and-display-polygons.htmlGoogle ScholarGoogle Scholar
  5. [5] Abdessalem Raja Ben, Panichella Annibale, Nejati Shiva, Briand Lionel C, and Stifter Thomas. 2018. Testing autonomous cars for feature interaction failures using many-objective search. In Proceedings of the 33rd IEEE/ACM International Conference on Automated Software Engineering (ASE’18). IEEE, 143154.Google ScholarGoogle ScholarDigital LibraryDigital Library
  6. [6] Abdi Hervé and Williams Lynne J.. 2010. Principal component analysis. Wiley Interdiscipl. Rev.: Comput. Stat. 2, 4 (2010), 433459.Google ScholarGoogle ScholarDigital LibraryDigital Library
  7. [7] Afzal Afsoon, Katz Deborah S., Goues Claire Le, and Timperley Christopher S.. 2021. Simulation for robotics test automation: Developer perspectives. In Proceedings of the 14th IEEE Conference on Software Testing, Verification and Validation (ICST’21). IEEE, 263274.Google ScholarGoogle ScholarCross RefCross Ref
  8. [8] Aghababaeyan Zohreh, Abdellatif Manel, Briand Lionel, Ramesh S., and Bagherzadeh Mojtaba. 2023. Black-box testing of deep neural networks through test case diversity. IEEE Transactions on Software Engineering 49, 5 (2023), 31823204.Google ScholarGoogle Scholar
  9. [9] Aleti Aldeida and Martinez Matias. 2021. E-APR: mapping the effectiveness of automated program repair techniques. Empirical Software Engineering 26, 5 (2021), 130.Google ScholarGoogle ScholarDigital LibraryDigital Library
  10. [10] Ali Elhashemi M., Ahmed Mohamed M., and Wulff Shaun S.. 2019. Detection of critical safety events on freeways in clear and rainy weather using SHRP2 naturalistic driving data: Parametric and non-parametric techniques. Safe. Sci. 119 (2019), 141149.Google ScholarGoogle ScholarCross RefCross Ref
  11. [11] Almanee Sumaya, Wu Xiafa, Huai Yuqi, Chen Qi Alfred, and Garcia Joshua. 2021. scenoRITA: Generating less-redundant, safety-critical and motion sickness-inducing scenarios for autonomous vehicles. arXiv:2112.09725. Retrieved from https://arxiv.org/abs/2112.09725Google ScholarGoogle Scholar
  12. [12] Aranganayagi S and Thangavel Kuttiyannan. 2007. Clustering categorical data using silhouette coefficient as a relocating measure. In Proceedings of the International Conference on Computational Intelligence and Multimedia Applications (ICCIMA’07), Vol. 2. IEEE, 1317.Google ScholarGoogle ScholarDigital LibraryDigital Library
  13. [13] Arcaini Paolo, Zhang Xiao-Yi, and Ishikawa Fuyuki. 2021. Targeting patterns of driving characteristics in testing autonomous driving systems. In Proceedings of the 14th IEEE Conference on Software Testing, Verification and Validation (ICST’21). IEEE, 295305.Google ScholarGoogle ScholarCross RefCross Ref
  14. [14] Arrieta Aitor, Wang Shuai, Arruabarrena Ainhoa, Markiegi Urtzi, Sagardui Goiuria, and Etxeberria Leire. 2018. Multi-objective black-box test case selection for cost-effectively testing simulation models. In Proceedings of the Genetic and Evolutionary Computation Conference. 14111418.Google ScholarGoogle ScholarDigital LibraryDigital Library
  15. [15] Arrieta Aitor, Wang Shuai, Sagardui Goiuria, and Etxeberria Leire. 2019. Search-based test case prioritization for simulation-based testing of cyber-physical system product lines. J. Syst. Softw. 149 (2019), 134.Google ScholarGoogle ScholarCross RefCross Ref
  16. [16] Abdessalem Raja Ben, Nejati Shiva, Briand Lionel C., and Stifter Thomas. 2016. Testing advanced driver assistance systems using multi-objective search and neural networks. In Proceedings of the 31st IEEE/ACM International Conference on Automated Software Engineering. 6374.Google ScholarGoogle ScholarDigital LibraryDigital Library
  17. [17] Birchler Christian, Ganz Nicolas, Khatiri Sajad, Gambi Alessio, and Panichella Sebastiano. 2022. Cost-effective simulation-based test selection in self-driving cars software with SDC-Scissor. In Proceedings of the 29th IEEE International Conference on Software Analysis, Evolution, and Reengineering. ZHAW Zürcher Hochschule für Angewandte Wissenschaften.Google ScholarGoogle ScholarCross RefCross Ref
  18. [18] Birchler Christian, Khatiri Sajad, Derakhshanfar Pouria, Panichella Sebastiano, and Panichella Annibale. 2021. Automated test cases prioritization for self-driving cars in virtual environments. arXiv:2107.09614. Retrieved from https://arxiv.org/abs/2107.09614Google ScholarGoogle Scholar
  19. [19] Birchler Christian, Khatiri Sajad, Derakhshanfar Pouria, Panichella Sebastiano, and Panichella Annibale. 2022. Single and multi-objective test cases prioritization for self-driving cars in virtual environments. Proc. ACM Meas. Anal. Comput. Syst. 32, 2 (2022). 1–30.Google ScholarGoogle Scholar
  20. [20] Breiman Leo. 2001. Random forests. Mach. Learn. 45, 1 (2001), 532.Google ScholarGoogle ScholarDigital LibraryDigital Library
  21. [21] Calò Alessandro, Arcaini Paolo, Ali Shaukat, Hauer Florian, and Ishikawa Fuyuki. 2020. Generating avoidable collision scenarios for testing autonomous driving systems. In Proceedings of the IEEE 13th International Conference on Software Testing, Validation and Verification (ICST’20). IEEE, 375386.Google ScholarGoogle ScholarCross RefCross Ref
  22. [22] Campos José, Ge Yan, Fraser Gordon, Eler Marcelo, and Arcuri Andrea. 2017. An empirical evaluation of evolutionary algorithms for test suite generation. In International Symposium on Search Based Software Engineering. Springer, 3348.Google ScholarGoogle ScholarCross RefCross Ref
  23. [23] Cara Irene and Gelder Erwin de. 2015. Classification for safety-critical car-cyclist scenarios using machine learning. In Proceedings of the IEEE 18th International Conference on Intelligent Transportation Systems. IEEE, 19952000.Google ScholarGoogle ScholarDigital LibraryDigital Library
  24. [24] Castellano Ezequiel, Cetinkaya Ahmet, Thanh Cédric Ho, Klikovits Stefan, Zhang Xiaoyi, and Arcaini Paolo. 2021. Frenetic at the SBST 2021 tool competition. In Proceedings of the IEEE/ACM 14th International Workshop on Search-Based Software Testing (SBST’21). 3637. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  25. [25] Chandrasekaran Jaganmohan, Lei Yu, Kacker Raghu, and Kuhn D. Richard. 2021. A combinatorial approach to testing deep neural network-based autonomous driving systems. In Proceedings of the IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW’21). IEEE, 5766.Google ScholarGoogle ScholarCross RefCross Ref
  26. [26] Chen Junjie, Yan Ming, Wang Zan, Kang Yuning, and Wu Zhuo. 2020. Deep neural network test coverage: How far are we? arXiv:2010.04946. Retrieved from https://arxiv.org/abs/2010.04946Google ScholarGoogle Scholar
  27. [27] Cohen Andrew R. and Vitányi Paul M. B.. 2014. Normalized compression distance of multisets with applications. IEEE Trans. Pattern Anal. Mach. Intell. 37, 8 (2014), 16021614.Google ScholarGoogle ScholarDigital LibraryDigital Library
  28. [28] Daszykowski Michal, Walczak Beata, and Massart D. L.. 2001. Looking for natural patterns in data: Part 1. Density-based approach. Chemometr. Intell. Lab. Syst. 56, 2 (2001), 8392.Google ScholarGoogle ScholarCross RefCross Ref
  29. [29] Winter Joost C. F. De, Gosling Samuel D., and Potter Jeff. 2016. Comparing the Pearson and Spearman correlation coefficients across distributions and sample sizes: A tutorial using simulations and empirical data. Psychol. Methods 21, 3 (2016), 273.Google ScholarGoogle ScholarCross RefCross Ref
  30. [30] Deng Yao, Zheng Xi, Zhang Mengshi, Lou Guannan, and Zhang Tianyi. 2022. Scenario-based test reduction and prioritization for multi-module autonomous driving systems. In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 8293.Google ScholarGoogle ScholarDigital LibraryDigital Library
  31. [31] Ding Wenhao, Xu Chejian, Arief Mansur, Lin Haohong, Li Bo, and Zhao Ding. 2023. A survey on safety-critical driving scenario generation—A methodological perspective. IEEE Transactions on Intelligent Transportation Systems 24 (2023), 69716988.Google ScholarGoogle ScholarDigital LibraryDigital Library
  32. [32] Ebadi Hamid, Moghadam Mahshid Helali, Borg Markus, Gay Gregory, Fontes Afonso, and Socha Kasper. 2021. Efficient and effective generation of test cases for pedestrian detection-search-based software testing of baidu apollo in SVL. In Proceedings of the IEEE International Conference on Artificial Intelligence Testing (AITest’21). IEEE, 103110.Google ScholarGoogle ScholarCross RefCross Ref
  33. [33] Edelsbrunner Herbert, Kirkpatrick David, and Seidel Raimund. 1983. On the shape of a set of points in the plane. IEEE Trans. Inf. Theory 29, 4 (1983), 551559.Google ScholarGoogle ScholarDigital LibraryDigital Library
  34. [34] Gambi Alessio, Huynh Tri, and Fraser Gordon. 2019. Generating effective test cases for self-driving cars from police reports. In Proceedings of the 27th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 257267.Google ScholarGoogle ScholarDigital LibraryDigital Library
  35. [35] Gambi Alessio, Jahangirova Gunel, Riccio Vincenzo, and Zampetti Fiorella. 2022. SBST tool competition 2022. In Proceedings of the 15th Workshop on Search-Based Software Testing. 2532.Google ScholarGoogle ScholarDigital LibraryDigital Library
  36. [36] Gambi Alessio, Mueller Marc, and Fraser Gordon. 2019. Automatically testing self-driving cars with search-based procedural content generation. In Proceedings of the 28th ACM SIGSOFT International Symposium on Software Testing and Analysis. 318328.Google ScholarGoogle ScholarDigital LibraryDigital Library
  37. [37] Géron Aurélien. 2022. Hands-on Machine Learning with Scikit-Learn, Keras, and TensorFlow. O’Reilly Media, Inc.Google ScholarGoogle Scholar
  38. [38] Ghani Kamran, Clark John A., and Zhan Yuan. 2009. Comparing algorithms for search-based test data generation of Matlab Simulink models. In Proceedings of the IEEE Congress on Evolutionary Computation. IEEE, 29402947.Google ScholarGoogle ScholarCross RefCross Ref
  39. [39] Han Jia Cheng and Zhou Zhi Quan. 2020. Metamorphic fuzz testing of autonomous vehicles. In Proceedings of the IEEE/ACM 42nd International Conference on Software Engineering Workshops. 380385.Google ScholarGoogle ScholarDigital LibraryDigital Library
  40. [40] Haq Fitash Ul, Shin Donghwan, and Briand Lionel. 2022. Efficient online testing for DNN-enabled systems using surrogate-assisted and many-objective optimization. In Proceedings of the 44th International Conference on Software Engineering. 811822.Google ScholarGoogle ScholarDigital LibraryDigital Library
  41. [41] Harman Mark and McMinn Phil. 2007. A theoretical & empirical analysis of evolutionary testing and hill climbing for structural test data generation. In Proceedings of the International Symposium on Software Testing and Analysis. 7383.Google ScholarGoogle ScholarDigital LibraryDigital Library
  42. [42] Hauer Florian, Gerostathopoulos Ilias, Schmidt Tabea, and Pretschner Alexander. 2020. Clustering traffic scenarios using mental models as little as possible. In Proceedings of the IEEE Intelligent Vehicles Symposium (IV’20). IEEE, 10071012.Google ScholarGoogle ScholarDigital LibraryDigital Library
  43. [43] Hauer Florian, Pretschner Alexander, and Holzmüller Bernd. 2019. Fitness functions for testing automated and autonomous driving systems. In International Conference on Computer Safety, Reliability, and Security. Springer, 6984.Google ScholarGoogle ScholarDigital LibraryDigital Library
  44. [44] Hauer Florian, Schmidt Tabea, Holzmüller Bernd, and Pretschner Alexander. 2019. Did we test all scenarios for automated and autonomous driving systems? In Proceedings of the IEEE Intelligent Transportation Systems Conference (ITSC’19). IEEE, 29502955.Google ScholarGoogle ScholarDigital LibraryDigital Library
  45. [45] Hinkle D. E., Wiersma W., and Jurs S. G.. 2002. Applied statistics for the behavioral sciences. CENGAGE Learning. https://books.google.com.au/books?id=74kDAAAACAAJGoogle ScholarGoogle Scholar
  46. [46] Humeniuk Dmytro, Antoniol Giuliano, and Khomh Foutse. 2022. AmbieGen tool at the SBST 2022 tool competition. In Proceedings of the 15th Workshop on Search-Based Software Testing. 4346.Google ScholarGoogle ScholarDigital LibraryDigital Library
  47. [47] Jahangirova Gunel, Stocco Andrea, and Tonella Paolo. 2021. Quality metrics and oracles for autonomous vehicles testing. In Proceedings of the 14th IEEE Conference on Software Testing, Verification and Validation (ICST’21). IEEE, 194204.Google ScholarGoogle ScholarCross RefCross Ref
  48. [48] Kaiser Bernhard. 2021. Application Story of ODD as Part of Safety Assurance. Retrieved from https://www.asam.net/index.php?eID=dumpFile&t=f&f=430&token=3135965e578e5bb92a01725cd37823c3979da158Google ScholarGoogle Scholar
  49. [49] Kalra Nidhi and Paddock Susan M.. 2016. How Many Miles of Driving Would It Take to Demonstrate Autonomous Vehicle Reliability. Technical Report. RAND Corporation, Santa Monica, CA, 11291134.Google ScholarGoogle Scholar
  50. [50] Kang Yanfei, Hyndman Rob J., and Smith-Miles Kate. 2017. Visualising forecasting algorithm performance using time series instance spaces. Int. J. Forecast. 33, 2 (2017), 345358.Google ScholarGoogle ScholarCross RefCross Ref
  51. [51] Kerber Jonas, Wagner Sebastian, Groh Korbinian, Notz Dominik, Kühbeck Thomas, Watzenig Daniel, and Knoll Alois. 2020. Clustering of the scenario space for the assessment of automated driving. In Proceedings of the IEEE Intelligent Vehicles Symposium (IV’20). IEEE, 578583.Google ScholarGoogle ScholarDigital LibraryDigital Library
  52. [52] Kim Baekgyu, Kashiba Yusuke, Dai Siyuan, and Shiraishi Shinichi. 2016. Testing autonomous vehicle software in the virtual prototyping environment. IEEE Embed. Syst. Lett. 9, 1 (2016), 58.Google ScholarGoogle ScholarDigital LibraryDigital Library
  53. [53] Klück Florian, Li Yihao, Nica Mihai, Tao Jianbo, and Wotawa Franz. 2018. Using ontologies for test suites generation for automated and autonomous driving functions. In Proceedings of the IEEE International Symposium on Software Reliability Engineering Workshops (ISSREW’18). IEEE, 118123.Google ScholarGoogle ScholarCross RefCross Ref
  54. [54] Klück Florian, Wotawa Franz, Neubauer Gerhard, Tao Jianbo, and Nica Mihai. 2021. Analysing experimental results obtained when applying search-based testing to verify automated driving functions. In Proceedings of the 8th International Conference on Dependable Systems and Their Applications (DSA’21). IEEE, 213219.Google ScholarGoogle ScholarCross RefCross Ref
  55. [55] Klück Florian, Zimmermann Martin, Wotawa Franz, and Nica Mihai. 2019. Genetic algorithm-based test parameter optimization for ADAS system testing. In Proceedings of the IEEE 19th International Conference on Software Quality, Reliability and Security (QRS’19). IEEE, 418425.Google ScholarGoogle ScholarCross RefCross Ref
  56. [56] Klück Florian, Zimmermann Martin, Wotawa Franz, and Nica Mihai. 2019. Performance comparison of two search-based testing strategies for ADAS system validation. In IFIP International Conference on Testing Software and Systems. Springer, 140156.Google ScholarGoogle ScholarDigital LibraryDigital Library
  57. [57] Kotsiantis Sotiris B.. 2013. Decision trees: A recent overview. Artif. Intell. Rev. 39, 4 (2013), 261283.Google ScholarGoogle ScholarDigital LibraryDigital Library
  58. [58] Kramer Oliver. 2016. Scikit-learn. In Machine Learning for Evolution Strategies. Springer, 4553.Google ScholarGoogle ScholarCross RefCross Ref
  59. [59] Kruber Friedrich, Wurst Jonas, and Botsch Michael. 2018. An unsupervised random forest clustering technique for automatic traffic scenario categorization. In Proceedings of the 21st International Conference on Intelligent Transportation Systems (ITSC’18). IEEE, 28112818.Google ScholarGoogle ScholarDigital LibraryDigital Library
  60. [60] Kruber Friedrich, Wurst Jonas, Morales Eduardo Sánchez, Chakraborty Samarjit, and Botsch Michael. 2019. Unsupervised and supervised learning with the random forest algorithm for traffic scenario clustering and classification. In Proceedings of the IEEE Intelligent Vehicles Symposium (IV’19). IEEE, 24632470.Google ScholarGoogle ScholarDigital LibraryDigital Library
  61. [61] Kulesza Alex, Taskar Ben, et al. 2012. Determinantal point processes for machine learning. Found. Trends Mach. Learn. 5, 2–3 (2012), 123286.Google ScholarGoogle ScholarCross RefCross Ref
  62. [62] Laurent Thomas, Klikovits Stefan, Arcaini Paolo, Ishikawa Fuyuki, and Ventresque Anthony. 2022. Parameter coverage for testing of autonomous driving systems under uncertainty. ACM Transactions on Software Engineering and Methodology 33, 3 (2022), 131.Google ScholarGoogle ScholarDigital LibraryDigital Library
  63. [63] Li Guanpeng, Li Yiran, Jha Saurabh, Tsai Timothy, Sullivan Michael, Hari Siva Kumar Sastry, Kalbarczyk Zbigniew, and Iyer Ravishankar. 2020. AV-FUZZER: Finding safety violations in autonomous driving systems. In Proceedings of the IEEE 31st International Symposium on Software Reliability Engineering (ISSRE’20). IEEE, 2536.Google ScholarGoogle ScholarCross RefCross Ref
  64. [64] Li Guanpeng, Li Yiran, Jha Saurabh, Tsai Timothy, Sullivan Michael, Hari Siva Kumar Sastry, Kalbarczyk Zbigniew, and Iyer Ravishankar. 2020. AV-FUZZER: Finding safety violations in autonomous driving systems. In Proceedings of the IEEE 31st International Symposium on Software Reliability Engineering (ISSRE’20). 2536. DOI:Google ScholarGoogle ScholarCross RefCross Ref
  65. [65] Li Yihao, Tao Jianbo, and Wotawa Franz. 2020. Ontology-based test generation for automated and autonomous driving functions. Inf. Softw. Technol. 117 (2020), 106200.Google ScholarGoogle ScholarDigital LibraryDigital Library
  66. [66] Lou Guannan, Deng Yao, Zheng Xi, Zhang Mengshi, and Zhang Tianyi. 2022. Testing of autonomous driving systems: Where are we and where should we go? In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 3143.Google ScholarGoogle ScholarDigital LibraryDigital Library
  67. [67] Lu Chengjie, Shi Yize, Zhang Huihui, Zhang Man, Wang Tiexin, Yue Tao, and Ali Shaukat. 2022. Learning configurations of operating environment of autonomous vehicles to maximize their collisions. IEEE Transactions on Software Engineering 49, 1 (2022), 384402.Google ScholarGoogle ScholarCross RefCross Ref
  68. [68] Lu Chengjie, Yue Tao, and Ali Shaukat. 2023. DeepScenario: An open driving scenario dataset for autonomous driving system testing. In IEEE/ACM 20th International Conference on Mining Software Repositories (MSR’23). 5256. Google ScholarGoogle ScholarCross RefCross Ref
  69. [69] Lu Chengjie, Zhang Huihui, Yue Tao, and Ali Shaukat. 2021. Search-based selection and prioritization of test scenarios for autonomous driving systems. In International Symposium on Search Based Software Engineering. Springer, 4155.Google ScholarGoogle ScholarDigital LibraryDigital Library
  70. [70] Majzik István, Semeráth Oszkár, Hajdu Csaba, Marussy Kristóf, Szatmári Zoltán, Micskei Zoltán, Vörös András, Babikian Aren A, and Varró Dániel. 2019. Towards system-level testing with coverage guarantees for autonomous vehicles. In Proceedings of the ACM/IEEE 22nd International Conference on Model Driven Engineering Languages and Systems (MODELS’19). IEEE, 8994.Google ScholarGoogle ScholarCross RefCross Ref
  71. [71] Math Rafael, Mahr Angela, Moniri Mohammad M., and Müller Christian. 2013. OpenDS: A new open-source driving simulator for research. In Proceedings of the GMM Symposium on Automotive meets Electronics. GMM-Symposium (AmE’13).Google ScholarGoogle Scholar
  72. [72] Minderhoud Michiel M. and Bovy Piet H. L.. 2001. Extended time-to-collision measures for road traffic safety assessment. Accident Anal. Prevent. 33, 1 (2001), 8997.Google ScholarGoogle ScholarCross RefCross Ref
  73. [73] Mouret Jean-Baptiste and Clune Jeff. 2015. Illuminating search spaces by mapping elites. arXiv:1504.04909. Retrieved from https://arxiv.org/abs/1504.04909Google ScholarGoogle Scholar
  74. [74] Muñoz Mario A.. 2021. InstanceSpace. Retrieved from https://github.com/andremun/InstanceSpaceGoogle ScholarGoogle Scholar
  75. [75] Muñoz Mario A. and Smith-Miles Kate. 2020. Generating new space-filling test instances for continuous black-box optimization. Evol. Comput. 28, 3 (2020), 379404.Google ScholarGoogle ScholarDigital LibraryDigital Library
  76. [76] Muñoz Mario A. and Smith-Miles Kate A.. 2017. Performance analysis of continuous black-box optimization algorithms via footprints in instance space. Evol. Comput. 25, 4 (2017), 529554.Google ScholarGoogle ScholarDigital LibraryDigital Library
  77. [77] Muñoz Mario A., Villanova Laura, Baatar Davaatseren, and Smith-Miles Kate. 2018. Instance spaces for machine learning classification. Mach. Learn. 107, 1 (2018), 109147.Google ScholarGoogle ScholarDigital LibraryDigital Library
  78. [78] Muñoz Mario A., Yan Tao, Leal Matheus R., Smith-Miles Kate A., Lorena Ana C., Pappa Gisele L., and M Rodrigues Romulo. 2021. An instance space analysis of regression problems. ACM Trans. Knowl. Discov. Data 15, 2 (2021).Google ScholarGoogle Scholar
  79. [79] Murtagh Fionn. 1991. Multilayer perceptrons for classification and regression. Neurocomputing 2, 5-6 (1991), 183197.Google ScholarGoogle ScholarCross RefCross Ref
  80. [80] Nguyen Vuong, Huber Stefan, and Gambi Alessio. 2021. SALVO: Automated generation of diversified tests for self-driving cars from existing maps. In Proceedings of the IEEE International Conference on Artificial Intelligence Testing (AITest’21). IEEE, 128135.Google ScholarGoogle ScholarCross RefCross Ref
  81. [81] Oliveira Carlos, Aleti Aldeida, Grunske Lars, and Smith-Miles Kate. 2018. Mapping the effectiveness of automated test suite generation techniques. IEEE Trans. Reliabil. 67, 3 (2018), 771785.Google ScholarGoogle ScholarCross RefCross Ref
  82. [82] Onieva Enrique, Hernández-Jayo Unai, Osaba Eneko, Perallos Asier, and Zhang Xiao. 2015. A multi-objective evolutionary algorithm for the tuning of fuzzy rule bases for uncoordinated intersections in autonomous driving. Inf. Sci. 321 (2015), 1430.Google ScholarGoogle ScholarDigital LibraryDigital Library
  83. [83] Panichella Annibale, Kifetew Fitsum Meshesha, and Tonella Paolo. 2017. Lips vs mosa: A replicated empirical study on automated test case generation. In International Symposium on Search Based Software Engineering. Springer, 8398.Google ScholarGoogle ScholarCross RefCross Ref
  84. [84] Panichella Annibale, Kifetew Fitsum Meshesha, and Tonella Paolo. 2018. A large scale empirical comparison of state-of-the-art search-based test case generators. Inf. Softw. Technol. 104 (2018), 236256.Google ScholarGoogle ScholarCross RefCross Ref
  85. [85] Panichella Sebastiano, Gambi Alessio, Zampetti Fiorella, and Riccio Vincenzo. 2021. Sbst tool competition 2021. In Proceedings of the IEEE/ACM 14th International Workshop on Search-Based Software Testing (SBST’21). IEEE, 2027.Google ScholarGoogle ScholarCross RefCross Ref
  86. [86] Rajabli Nijat, Flammini Francesco, Nardone Roberto, and Vittorini Valeria. 2020. Software verification and validation of safe autonomous cars: A systematic literature review. IEEE Access 9 (2020), 47974819.Google ScholarGoogle ScholarCross RefCross Ref
  87. [87] Riccio Vincenzo and Tonella Paolo. 2020. Model-based exploration of the frontier of behaviours for deep learning system testing. In Proceedings of the 28th ACM Joint Meeting on European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 876888.Google ScholarGoogle ScholarDigital LibraryDigital Library
  88. [88] Rish Irina et al. 2001. An empirical study of the naive Bayes classifier. In Proceedings of the IJCAI 2001 Workshop on Empirical Methods in Artificial Intelligence, Vol. 3. 4146.Google ScholarGoogle Scholar
  89. [89] Rong Guodong, Shin Byung Hyun, Tabatabaee Hadi, Lu Qiang, Lemke Steve, Možeiko Mārtiņš, Boise Eric, Uhm Geehoon, Gerow Mark, Mehta Shalin, et al. 2020. Lgsvl simulator: A high fidelity simulator for autonomous driving. In Proceedings of the IEEE 23rd International Conference on Intelligent Transportation Systems (ITSC’20). IEEE, 16.Google ScholarGoogle ScholarDigital LibraryDigital Library
  90. [90] Scalabrino Simone, Grano Giovanni, Nucci Dario Di, Oliveto Rocco, and Lucia Andrea De. 2016. Search-based testing of procedural programs: Iterative single-target or multi-target approach? In International Symposium on Search Based Software Engineering. Springer, 6479.Google ScholarGoogle ScholarCross RefCross Ref
  91. [91] Schubert Erich, Sander Jörg, Ester Martin, Kriegel Hans Peter, and Xu Xiaowei. 2017. DBSCAN revisited, revisited: why and how you should (still) use DBSCAN. ACM Trans. Datab. Syst. 42, 3 (2017), 121.Google ScholarGoogle ScholarDigital LibraryDigital Library
  92. [92] Shalev-Shwartz Shai, Shammah Shaked, and Shashua Amnon. 2017. On a formal model of safe and scalable self-driving cars. arXiv:1708.06374. Retrieved from https://arxiv.org/abs/1708.06374Google ScholarGoogle Scholar
  93. [93] Smith-Miles Kate, Baatar Davaatseren, Wreford Brendan, and Lewis Rhyd. 2014. Towards objective measures of algorithm performance across instance space. Comput. Operat. Res. 45 (2014), 1224.Google ScholarGoogle ScholarDigital LibraryDigital Library
  94. [94] Neelofar Neelofar, Smith-Miles Kate, Muñoz Mario Andrés, and Aleti Aldeida. 2022. Instance space analysis of search-based software testing. IEEE Transactions on Software Engineering 49, 4 (2022), 26422660.Google ScholarGoogle ScholarDigital LibraryDigital Library
  95. [95] Stocco Andrea and Tonella Paolo. 2022. Confidence-driven weighted retraining for predicting safety-critical failures in autonomous driving systems. J. Softw.: Evol. Process 34, 10 (2022), e2386.Google ScholarGoogle ScholarCross RefCross Ref
  96. [96] Stocco Andrea, Weiss Michael, Calzana Marco, and Tonella Paolo. 2020. Misbehaviour prediction for autonomous driving systems. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering. 359371.Google ScholarGoogle ScholarDigital LibraryDigital Library
  97. [97] Tang Yun, Zhou Yuan, Liu Yang, Sun Jun, and Wang Gang. 2021. Collision avoidance testing for autonomous driving systems on complete maps. In Proceedings of the IEEE Intelligent Vehicles Symposium (IV’21). IEEE, 179185.Google ScholarGoogle ScholarDigital LibraryDigital Library
  98. [98] Tang Yun, Zhou Yuan, Zhang Tianwei, Wu Fenghua, Liu Yang, and Wang Gang. 2021. Systematic testing of autonomous driving systems using map topology-based scenario classification. In Proceedings of the 36th IEEE/ACM International Conference on Automated Software Engineering (ASE’21). 13421346. DOI:Google ScholarGoogle ScholarDigital LibraryDigital Library
  99. [99] Tao Jianbo, Li Yihao, Wotawa Franz, Felbinger Hermann, and Nica Mihai. 2019. On the industrial application of combinatorial testing for autonomous driving functions. In Proceedings of the IEEE International Conference on Software Testing, Verification and Validation Workshops (ICSTW’19). IEEE, 234240.Google ScholarGoogle ScholarCross RefCross Ref
  100. [100] Taunk Kashvi, De Sanjukta, Verma Srishti, and Swetapadma Aleena. 2019. A brief review of nearest neighbor algorithm for learning and classification. In Proceedings of the International Conference on Intelligent Computing and Control Systems (ICCS’19). IEEE, 12551260.Google ScholarGoogle ScholarCross RefCross Ref
  101. [101] Tian Haoxiang, Jiang Yan, Wu Guoquan, Yan Jiren, Wei Jun, Chen Wei, Li Shuo, and Ye Dan. 2022. MOSAT: Finding safety violations of autonomous driving systems using multi-objective genetic algorithm. In Proceedings of the 30th ACM Joint European Software Engineering Conference and Symposium on the Foundations of Software Engineering. 94106.Google ScholarGoogle ScholarDigital LibraryDigital Library
  102. [102] Tian Haoxiang, Wu Guoquan, Yan Jiren, Jiang Yan, Wei Jun, Chen Wei, Li Shuo, and Ye Dan. 2022. Generating critical test scenarios for autonomous driving systems via influential behavior patterns. In Proceedings of the 37th IEEE/ACM International Conference on Automated Software Engineering. 112.Google ScholarGoogle ScholarDigital LibraryDigital Library
  103. [103] Tian Yuchi, Pei Kexin, Jana Suman, and Ray Baishakhi. 2018. Deeptest: Automated testing of deep-neural-network-driven autonomous cars. In Proceedings of the 40th International Conference on Software Engineering. 303314.Google ScholarGoogle ScholarDigital LibraryDigital Library
  104. [104] Timperley Christopher Steven, Afzal Afsoon, Katz Deborah S., Hernandez Jam Marcos, and Goues Claire Le. 2018. Crashing simulated planes is cheap: Can simulation detect robotics bugs early? In Proceedings of the IEEE 11th International Conference on Software Testing, Verification and Validation (ICST’18). IEEE, 331342.Google ScholarGoogle ScholarCross RefCross Ref
  105. [105] Vogel Katja. 2003. A comparison of headway and time to collision as safety indicators. Accident Anal. Prevent. 35, 3 (2003), 427433.Google ScholarGoogle ScholarCross RefCross Ref
  106. [106] Wu Huayao, Nie Changhai, Petke Justyna, Jia Yue, and Harman Mark. 2018. An empirical comparison of combinatorial testing, random testing and adaptive random testing. IEEE Trans. Softw. Eng. 46, 3 (2018), 302320.Google ScholarGoogle ScholarDigital LibraryDigital Library
  107. [107] Ye Xinchen and Wang Xuesong. 2022. Operational design domain of automated vehicles at freeway entrance terminals. Accident Anal. Prevent. 174 (2022), 106776.Google ScholarGoogle ScholarCross RefCross Ref
  108. [108] Zeller Andreas. 2017. Search-based testing and system testing: a marriage in heaven. In Proceedings of the IEEE/ACM 10th International Workshop on Search-Based Software Testing (SBST’17). IEEE, 4950.Google ScholarGoogle ScholarCross RefCross Ref
  109. [109] Zhang Mengshi, Zhang Yuqun, Zhang Lingming, Liu Cong, and Khurshid Sarfraz. 2018. DeepRoad: GAN-based metamorphic testing and input validation framework for autonomous driving systems. In Proceedings of the 33rd ACM/IEEE International Conference on Automated Software Engineering. 132142.Google ScholarGoogle ScholarDigital LibraryDigital Library
  110. [110] Zhong Ziyuan, Kaiser Gail, and Ray Baishakhi. 2023. Neural network guided evolutionary fuzzing for finding traffic violations of autonomous vehicles. IEEE Transactions on Software Engineering 49, 4 (2023), 18601875. Google ScholarGoogle ScholarDigital LibraryDigital Library
  111. [111] Zhou Husheng, Li Wei, Kong Zelun, Guo Junfeng, Zhang Yuqun, Yu Bei, Zhang Lingming, and Liu Cong. 2020. Deepbillboard: Systematic physical-world testing of autonomous driving systems. In Proceedings of the ACM/IEEE 42nd International Conference on Software Engineering. 347358.Google ScholarGoogle ScholarDigital LibraryDigital Library
  112. [112] Zofka Marc René, Klemm Sebastian, Kuhnt Florian, Schamm Thomas, and Zöllner J. Marius. 2016. Testing and validating high level components for automated driving: Simulation framework for traffic scenarios. In Proceedings of the IEEE Intelligent Vehicles Symposium (IV’16). IEEE, 144150.Google ScholarGoogle ScholarDigital LibraryDigital Library
  113. [113] Zohdinasab Tahereh, Riccio Vincenzo, Gambi Alessio, and Tonella Paolo. 2021. Deephyperion: Exploring the feature space of deep learning-based systems through illumination search. In Proceedings of the 30th ACM SIGSOFT International Symposium on Software Testing and Analysis. 7990.Google ScholarGoogle ScholarDigital LibraryDigital Library
  114. [114] Neelofar Neelofar and Aleti Aldeida. 2024. Towards reliable AI: Adequacy metrics for ensuring the quality of system-level testing of autonomous vehicles. In IEEE/ACM 46th International Conference on Software Engineering (ICSE’24). 805816.Google ScholarGoogle ScholarDigital LibraryDigital Library

Index Terms

  1. Identifying and Explaining Safety-critical Scenarios for Autonomous Vehicles via Key Features

    Recommendations

    Comments

    Login options

    Check if you have access through your login credentials or your institution to get full access on this article.

    Sign in

    Full Access

    • Published in

      cover image ACM Transactions on Software Engineering and Methodology
      ACM Transactions on Software Engineering and Methodology  Volume 33, Issue 4
      May 2024
      940 pages
      ISSN:1049-331X
      EISSN:1557-7392
      DOI:10.1145/3613665
      • Editor:
      • Mauro Pezzè
      Issue’s Table of Contents

      Permission to make digital or hard copies of all or part of this work for personal or classroom use is granted without fee provided that copies are not made or distributed for profit or commercial advantage and that copies bear this notice and the full citation on the first page. Copyrights for components of this work owned by others than the author(s) must be honored. Abstracting with credit is permitted. To copy otherwise, or republish, to post on servers or to redistribute to lists, requires prior specific permission and/or a fee. Request permissions from [email protected].

      Publisher

      Association for Computing Machinery

      New York, NY, United States

      Publication History

      • Published: 18 April 2024
      • Online AM: 11 January 2024
      • Accepted: 8 December 2023
      • Revised: 9 October 2023
      • Received: 10 May 2023
      Published in tosem Volume 33, Issue 4

      Check for updates

      Qualifiers

      • research-article
    • Article Metrics

      • Downloads (Last 12 months)311
      • Downloads (Last 6 weeks)87

      Other Metrics

    PDF Format

    View or Download as a PDF file.

    PDF

    eReader

    View online with eReader.

    eReader

    Full Text

    View this article in Full Text.

    View Full Text