青少年认知功能亚型与生物声学语音特征、情绪状态间的相关分析

Correlation analysis of neurocognitive function subtypes, bio-acoustic speech features and emotional states in adolescents

ES评分 0

DOI
刊名
Journal of International Psychiatry
年,卷(期) 2025, 52(1)
作者
作者单位

武汉大学人民医院 孝感市康复医院

摘要
【摘要】目的 分析青少年各认知功能亚型间生物声学语音特征、情绪状态间的关联。 方法 本研究入组442名12至17岁青少年,男生208名,女生234名。采用中文版MATRICS成套认知测试评估受试者信息处理速度、语言学习和记忆、工作记忆、推理和问题解决能力、视觉学习和记忆、注意/警觉性等6个认知领域的成绩,以及1个社会认知(情绪管理)能力。精神科医生采用他评量表汉密尔顿焦虑量表、抑郁量表评定受试者焦虑、抑郁情绪状态;受试者使用自评量表9项患者健康问卷(PHQ-9)、7项广泛性焦虑障碍量表(GAD-7)测评其抑郁、焦虑情绪状态。受试者阅读正性、中性、负性情绪刺激相关的三段文字并用语音采集装置采集其语音;使用OpenSmile软件提取生物声学特征:均方根能量、12个维度的梅尔频谱倒谱系数、零交叉率、发声概率、基频。采用分层聚类方法,根据受试者认知功能测试成绩去检测认知亚型;分别计算2-6个认知亚型的ARI(Adjusted Rand Index)值,并采用十折交叉验证(随机抽样5000次)分别计算2-6个认知亚型的平均ARI值来评价聚类稳定性。统计各认知亚型间的生物声学语音特征、情绪状态组间差异。 结果 分层聚类分析中,认知聚类II亚型十折交叉平均ARI值(0.4838)较认知聚类III、IV、V、VI亚型的ARI值均高。认知聚类II亚型中的亚型1、2之间除外工作记忆、社会认知,在其余所有认知领域中亚型1均较亚型2的成绩好。认知亚型1的受试者阅读正性文档时的零交叉率较亚型2的低(F=4.768,P=0.03),阅读中性文档时零交叉率低(F=4.846 P=0.028),梅尔频谱倒谱系数-1的值较认知亚型2高(F=4.69 P=0.031)。亚型1与2之间的情绪状态无明显组间差异。结论 本研究首次在青少年健康群体中发现不同认知功能水平个体在阅读正性、中性情绪刺激任务相关的文本时其生物声学语音特征有显著差异;梅尔频谱倒谱系数-1、零交叉率可能有助于青少年群体的认知功能水平的区分、鉴别。
Abstract
Abstract Objective: To analyze the relationship between the bio-acoustic speech features and emotional states among the neurocognitive function subtypes of adolescents. Methods: The study included 442 adolescents aged 12 to 17, including 208 male and 234 female students. The Chinese version of the MATRICS cognitive test was used to assess the subjects' performance in 6 cognitive domains, including information processing speed, language learning and memory, working memory, reasoning and problem solving ability, visual learning and memory, attention/alertness, and 1 social cognition (emotion management) ability. Senior psychiatrists used the Hamilton Anxiety Scale and depression scale to evaluate the anxiety and depression of the subjects. The subjects used the self-rated 9-item Patient Health Questionnaire (PHQ-9) and the 7-item Generalized Anxiety Disorder Scale (GAD-7) to assess their depression and anxiety. Subjects read three paragraphs related to positive, neutral and negative emotional stimuli and collect their speech with a speech acquisition device. Bioacoustic features were extracted using OpenSmile software: Root Mean Square Energy, Meir spectrum cepstrum coefficients in 12 dimensions, Zero Crossing Rate, Voice Probability and Fundamental Frequency. A hierarchical clustering method was used to detect cognitive subtypes according to the cognitive function level of subjects. The ARI (Adjusted Rand Index) values of 2 to 6 cognitive subtypes were calculated respectively, and the average ARI values of 2 to 6 cognitive subtypes were calculated using 10-fold cross-validation (random sampling 5000 times) to evaluate the clustering stability. The differences of bioacoustic speech features and emotional states among different cognitive subtypes were measured. Results: In hierarchical cluster analysis, the average 10-fold crossover ARI value (0.4838) of cognitive cluster II subtypes was higher than that of cognitive cluster III, IV, V and VI subtypes. Subtype 1 and subtype 2 of cognitive cluster II performed better than subtype 2 in all other cognitive domains except working memory and social cognition. Subjects of cognitive subtype 1 had a lower Zero Crossing Rate when reading positive documents than subtype 2 (F=4.768, P=0.03), a lower Zero Crossing Rate when reading neutral documents (F=4.846 P=0.028), and a higher value of the Mel-Frequency Cipstal Coefficients-1 than cognitive subtype 2 (F=4.69 P=0.031). There was no significant difference in emotional state between subtypes 1 and 2. Conclusion: In this study, for the first time, it was found that the bio-acoustic speech features of individuals with different cognitive function levels were significantly different in healthy adolescents while reading texts related to positive and neutral emotional stimulation tasks. The Meir spectrum cepstrum coefficients-1 and Zero Crossing Rate may be helpful for the differentiation and identification of cognitive function levels in adolescents.
关键词
青少年;认知功能;生物声学语音特征;分层聚类
KeyWord
Adolescent; Cognition; Bio-acoustic Speech Features; Hierarchical Clustering
基金项目
页码 33-37
  • 参考文献
  • 相关文献
  • 引用本文

陈小磊, 董黎, 杜隆彬, 胡茂林, 彭欢, 孙霞, 王毅刚, 殷淑娴, 张圆圆, 宗小芬. 青少年认知功能亚型与生物声学语音特征、情绪状态间的相关分析 [J]. 国际精神病学杂志. 2025; 52; (1). 33 - 37.

  • 文献评论

相关学者

相关机构