Electronic Science and Technology ›› 2025, Vol. 38 ›› Issue (1): 88-94.doi: 10.16180/j.cnki.issn1007-7820.2025.01.012

Previous Articles    

Depression Detection Based on Cross User Audio Domain Adaptation Network

WU Wei1, MA Longhua1,2, ZHAO Xianghong2()   

  1. 1. School of Information Science and Engineering,Zhejiang Sci-Tech University,Hangzhou 310018,China
    2. School of Information Science and Engineering,Ningbo Tech University,Ningbo 315100,China
  • Received:2023-06-25 Revised:2023-07-16 Online:2025-01-15 Published:2025-01-06
  • Supported by:
    National Natural Science Foundation of China(61972350);National Natural Science Foundation of China(32073028);Ningbo Natural Science Foundation(2022J165)

Abstract:

Because of the subjective detection of depression, the use of user voice diagnosis of depression has become a more potential auxiliary way. However, the speech signals of different users are different. In this study, a CUADAN(Cross User Audio Domain Adaptation Network) is proposed to detect depression. Visual Mel spectrograms are extracted from the audio, and the feature extractor of the CUADAN model is used to extract deeper depression features from the Mel spectrograms. Since the source domain and target domain contain the voice features of different healthy users and depressed users, the domain classifier of CUADAN model is used to perform domain adaptation between different user data, so that unknown users can be detected by existing classifiers. The experimental results show that the CUADAN model has a higher depression detection accuracy, with an average accuracy of 81.0±2.4%. Therefore, the CUADAN model can effectively weaken the differences between different users' voices and improve the accuracy of cross-user depression detection.

Key words: domain adaptation, depression detection, CUADAN, audio, cross-user, Mel spectrogram, feature extraction, weakening differences

CLC Number: 

  • TP181