Skip to main content
Log in

An Explained Artificial Intelligence-Based Solution to Identify Depression Severity Symptoms Using Acoustic Features

  • Published:
Doklady Mathematics Aims and scope Submit manuscript

Abstract

This paper represents our research to (i) propose an artificial intelligence, AI-based solution to identify depression and (ii) investigate our psychiatric knowledge. Concerning the first objective, we collected and annotated a new audio data set, and scrutinized the performance of eight regression approaches. Our studies showed that k-nearest neighbor and random forest form the group having the most acceptable results. Regarding our second objective, we determined the importance of the features of our best model using the SHapley Additive exPlanations approach: our findings showed that the fourth Mel-frequency cepstral coefficients, harmonic difference, and shimmer are the most important features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Fig. 1.
Fig. 2.

Similar content being viewed by others

REFERENCES

  1. Depressive disorder. https://www.who.int/news-room/fact-sheets/detail/depression

  2. E. Strumbelj and I. Kononenko, “Explaining prediction models and individual predictions with feature contributions,” Knowl. Inf. Syst. 41 (3), 647–665 (2014)

    Article  Google Scholar 

  3. F. Eyben, M. Wöllmer, and B. Schuller, “OpenSMILE: The Munich versatile and fast open-source audio feature extractor,” in Proceedings of the 18th ACM International Conference on Multimedia (ACM, New York, 2010), pp. 1459–1462.

  4. F. Eyben, K. R. Scherer, B. W. Schuller, J. Sundberg, E. André, C. Busso, L. Y. Devillers, J. Epps, P. Laukka, S. S. Narayanan, et al., “The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing,” IEEE Trans. Affective Comput. 7 (2), 190–202 (2015).

    Article  Google Scholar 

  5. J. Mockus, V. Tiešis, and A. Žilinskas, “The application of Bayesian methods for seeking the extremum,” in Towards Global Optimization (North-Holland, Amsterdam, 1978), pp. 117–129.

    Google Scholar 

  6. J. H. Friedman, “Greedy function approximation: A gradient boosting machine,” Ann. Stat. 29 (5), 1189–1232 2001).

    Article  MathSciNet  Google Scholar 

  7. J. L. Bentley, “Multidimensional binary search trees used for associative searching,” Commun. ACM 18 (9), 509–517 (1975).

    Article  Google Scholar 

  8. L. Breiman, “Random forests,” Mach. Learn. 45 (1), 5–32 (2001).

    Article  Google Scholar 

  9. M. Khudyakova, N. Antonova, M. Nelubina, A. Surova, A. Vorobyova, A. Minnigulova, N. Gronskaya, K. Yashin, I. Medyanik, T. Shishkovskaya, et al., “Discourse diversity database (3D) for clinical linguistics research: Design, development, and analysis,” Bakhtiniana Revista de Estudos do Discurso 18 (1), 32–57 (2023).

    Google Scholar 

  10. S. M. Lundberg and S. Lee, “A unified approach to interpreting model predictions,” in Advances in Neural Information Processing Systems, Ed. by I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Curran Associates, 2017), pp. 4765–4774.

    Google Scholar 

  11. K. P. Murphy, Probabilistic Machine Learning: An Introduction (MIT, Cambridge, Mass., 2022).

    Google Scholar 

  12. P. Wu, R. Wang, H. Lin, F. Zhang, J. Tu, and M. Sun, “Automatic depression recognition by intelligent speech signal processing: A systematic survey,” CAAI Transactions on Intelligence Technology (2022).

    Google Scholar 

  13. T. Hastie, S. Rosset, J. Zhu, and H. Zou, “Multi-class AdaBoost,” Stat. Interface 2 (3), 349–360 (2009).

    Article  MathSciNet  Google Scholar 

Download references

Funding

This study was supported by the grant for research centers in the field of AI provided by the Analytical Center for the Government of the Russian Federation (ACRF) in accordance with the agreement on the provision of subsidies (identifier of the agreement 000000D730321P5Q0002) and the agreement with HSE University no. 70-2021-00139.

Author information

Authors and Affiliations

Authors

Contributions

Sh. S.: Conceptualization, Methodology, Investigation, Software, Validation, Formal analysis, Writing – original draft (WOD), Writing – review and editing. K. A.: Investigation, Software, Validation, WOD. Sh. T.: Data curation. Kh. M.: Conceptualization, Data curation, Writing – review and editing. D. O.: Conceptualization, Writing – review and editing, Resources, Project administration.

All authors read and approved the final manuscript.

Corresponding authors

Correspondence to S. Shalileh, A. O. Koptseva, T. I. Shishkovskaya, M. V. Khudyakova or O. V. Dragoy.

Ethics declarations

The authors of this work declare that they have no conflicts of interest.

Additional information

Publisher’s Note.

Pleiades Publishing remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Check for updates. Verify currency and authenticity via CrossMark

Cite this article

Shalileh, S., Koptseva, A.O., Shishkovskaya, T.I. et al. An Explained Artificial Intelligence-Based Solution to Identify Depression Severity Symptoms Using Acoustic Features. Dokl. Math. 108 (Suppl 2), S374–S381 (2023). https://doi.org/10.1134/S1064562423701090

Download citation

  • Received:

  • Revised:

  • Accepted:

  • Published:

  • Issue Date:

  • DOI: https://doi.org/10.1134/S1064562423701090

Keywords:

Navigation