An Explained Artificial Intelligence-Based Solution to Identify Depression Severity Symptoms Using Acoustic Features

Shalileh, S.; Koptseva, A. O.; Shishkovskaya, T. I.; Khudyakova, M. V.; Dragoy, O. V.

doi:10.1134/S1064562423701090

An Explained Artificial Intelligence-Based Solution to Identify Depression Severity Symptoms Using Acoustic Features

Published: 09 February 2024

Volume 108, pages S374–S381, (2023)
Cite this article

Doklady Mathematics Aims and scope Submit manuscript

S. Shalileh^1,2,
A. O. Koptseva²,
T. I. Shishkovskaya³,
M. V. Khudyakova^1,4 &
…
O. V. Dragoy^1,5

64 Accesses
Explore all metrics

Abstract

This paper represents our research to (i) propose an artificial intelligence, AI-based solution to identify depression and (ii) investigate our psychiatric knowledge. Concerning the first objective, we collected and annotated a new audio data set, and scrutinized the performance of eight regression approaches. Our studies showed that k-nearest neighbor and random forest form the group having the most acceptable results. Regarding our second objective, we determined the importance of the features of our best model using the SHapley Additive exPlanations approach: our findings showed that the fourth Mel-frequency cepstral coefficients, harmonic difference, and shimmer are the most important features.

This is a preview of subscription content, log in via an institution to check access.

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions

Sentiment Analysis in Social Media Data for Depression Detection Using Artificial Intelligence: A Review

Article 19 November 2021

Detection of Autism Spectrum Disorder in Children Using Machine Learning Techniques

Article 22 July 2021

Automatic speech recognition: a survey

Article 10 November 2020

REFERENCES

Depressive disorder. https://www.who.int/news-room/fact-sheets/detail/depression
E. Strumbelj and I. Kononenko, “Explaining prediction models and individual predictions with feature contributions,” Knowl. Inf. Syst. 41 (3), 647–665 (2014)
Article Google Scholar
F. Eyben, M. Wöllmer, and B. Schuller, “OpenSMILE: The Munich versatile and fast open-source audio feature extractor,” in Proceedings of the 18th ACM International Conference on Multimedia (ACM, New York, 2010), pp. 1459–1462.
F. Eyben, K. R. Scherer, B. W. Schuller, J. Sundberg, E. André, C. Busso, L. Y. Devillers, J. Epps, P. Laukka, S. S. Narayanan, et al., “The Geneva minimalistic acoustic parameter set (GeMAPS) for voice research and affective computing,” IEEE Trans. Affective Comput. 7 (2), 190–202 (2015).
Article Google Scholar
J. Mockus, V. Tiešis, and A. Žilinskas, “The application of Bayesian methods for seeking the extremum,” in Towards Global Optimization (North-Holland, Amsterdam, 1978), pp. 117–129.
Google Scholar
J. H. Friedman, “Greedy function approximation: A gradient boosting machine,” Ann. Stat. 29 (5), 1189–1232 2001).
Article MathSciNet Google Scholar
J. L. Bentley, “Multidimensional binary search trees used for associative searching,” Commun. ACM 18 (9), 509–517 (1975).
Article Google Scholar
L. Breiman, “Random forests,” Mach. Learn. 45 (1), 5–32 (2001).
Article Google Scholar
M. Khudyakova, N. Antonova, M. Nelubina, A. Surova, A. Vorobyova, A. Minnigulova, N. Gronskaya, K. Yashin, I. Medyanik, T. Shishkovskaya, et al., “Discourse diversity database (3D) for clinical linguistics research: Design, development, and analysis,” Bakhtiniana Revista de Estudos do Discurso 18 (1), 32–57 (2023).
Google Scholar
S. M. Lundberg and S. Lee, “A unified approach to interpreting model predictions,” in Advances in Neural Information Processing Systems, Ed. by I. Guyon, U. V. Luxburg, S. Bengio, H. Wallach, R. Fergus, S. Vishwanathan, and R. Garnett (Curran Associates, 2017), pp. 4765–4774.
Google Scholar
K. P. Murphy, Probabilistic Machine Learning: An Introduction (MIT, Cambridge, Mass., 2022).
Google Scholar
P. Wu, R. Wang, H. Lin, F. Zhang, J. Tu, and M. Sun, “Automatic depression recognition by intelligent speech signal processing: A systematic survey,” CAAI Transactions on Intelligence Technology (2022).
Google Scholar
T. Hastie, S. Rosset, J. Zhu, and H. Zou, “Multi-class AdaBoost,” Stat. Interface 2 (3), 349–360 (2009).
Article MathSciNet Google Scholar

Download references

Funding

This study was supported by the grant for research centers in the field of AI provided by the Analytical Center for the Government of the Russian Federation (ACRF) in accordance with the agreement on the provision of subsidies (identifier of the agreement 000000D730321P5Q0002) and the agreement with HSE University no. 70-2021-00139.

Author information

Authors and Affiliations

Center for Language and Brain, HSE University, Moscow, Russia
S. Shalileh, M. V. Khudyakova & O. V. Dragoy
Vision Modeling Laboratory, HSE University, Moscow, Russia
S. Shalileh & A. O. Koptseva
Department of Endogenous Mental Disorders and Affective States, Federal State Budgetary Scientific Institution Mental Health Research Center, Moscow, Russia
T. I. Shishkovskaya
Center for Language and Brain, HSE University, Nizhny Novgorod, Russia
M. V. Khudyakova
Institute of Linguistics, Russian Academy of Sciences, Moscow, Russia
O. V. Dragoy

Authors

S. Shalileh
View author publications
You can also search for this author in PubMed Google Scholar
A. O. Koptseva
View author publications
You can also search for this author in PubMed Google Scholar
T. I. Shishkovskaya
View author publications
You can also search for this author in PubMed Google Scholar
M. V. Khudyakova
View author publications
You can also search for this author in PubMed Google Scholar
O. V. Dragoy
View author publications
You can also search for this author in PubMed Google Scholar

Contributions

Sh. S.: Conceptualization, Methodology, Investigation, Software, Validation, Formal analysis, Writing – original draft (WOD), Writing – review and editing. K. A.: Investigation, Software, Validation, WOD. Sh. T.: Data curation. Kh. M.: Conceptualization, Data curation, Writing – review and editing. D. O.: Conceptualization, Writing – review and editing, Resources, Project administration.

All authors read and approved the final manuscript.

Corresponding authors

Correspondence to S. Shalileh, A. O. Koptseva, T. I. Shishkovskaya, M. V. Khudyakova or O. V. Dragoy.

Ethics declarations

The authors of this work declare that they have no conflicts of interest.

Additional information

Publisher’s Note.

Pleiades Publishing remains neutral with regard to jurisdictional claims in published maps and institutional affiliations.

Rights and permissions

Reprints and permissions

About this article

Cite this article

Shalileh, S., Koptseva, A.O., Shishkovskaya, T.I. et al. An Explained Artificial Intelligence-Based Solution to Identify Depression Severity Symptoms Using Acoustic Features. Dokl. Math. 108 (Suppl 2), S374–S381 (2023). https://doi.org/10.1134/S1064562423701090

Download citation

Received: 01 August 2023
Revised: 18 August 2023
Accepted: 15 October 2023
Published: 09 February 2024
Issue Date: December 2023
DOI: https://doi.org/10.1134/S1064562423701090

Keywords:

Access this article

Log in via an institution

Price excludes VAT (USA)
Tax calculation will be finalised during checkout.

Instant access to the full article PDF.

Institutional subscriptions