*Result*: Analysis of language patterns in schizophrenia based on natural language processing.
Original Publication: Amsterdam, Elsevier/North-Holland Biomedical Press.
*Further Information*
*Background: Schizophrenia is a severe mental disorder, with language dysfunction being one of its prominent features. Studies have shown that language expression patterns in individuals with schizophrenia differ significantly from those of healthy controls. However, emotional states may influence language expression. This study aims to use natural language processing techniques to analyze stable language expression differences in individuals with schizophrenia under various emotional stimuli.
Methods: The study recruited 104 patients diagnosed with schizophrenia and 80 healthy controls. Participants watched videos depicting different emotional stimuli and then responded to standardized questions. A two-way multivariate analysis of variance was used to explore group differences in language expression. Multivariate linear regression was applied to investigate the relationship between text features and symptom severity. Additionally, machine learning techniques were employed to preliminarily assess the potential of language features in assisting diagnosis.
Results: Stable and significant differences were observed between the schizophrenia group and the healthy control group in conjunction usage and the standard deviation of sentence length. Specific language features, such as the frequency of the term "Jiushi" (equivalent to "just" or "exactly" in English), number of words, and mean dependency distance, showed significant and consistent associations with the severity of negative symptoms. Machine learning models achieved a diagnostic accuracy of 66.3 %.
Conclusions: Language expression in individuals with schizophrenia is influenced by emotional states, which may obscure certain disease-related features. However, stable language expression differences, such as conjunction frequency and sentence length variability, remain. Furthermore, language features can serve as stable predictors for multiple dimensions of negative symptoms.
(Copyright © 2025 Elsevier B.V. All rights reserved.)*
*Declaration of competing interest The authors declare no competing interests.*