Using multimodal machine learning to identify hidden subtypes of Major Depressive Disorder
Depression affects 280 million people worldwide, yet diagnosis remains subjective, relying on clinical interviews and questionnaires. Traditional methods can't detect hidden subtypes or objective biomarkers.
Apply unsupervised machine learning to discover hidden depression subtypes from multimodal clinical data: speech acoustics, linguistic patterns, and behavioral features.
Enable objective, data-driven depression diagnosis. Move from subjective assessment to personalized treatment based on discovered subtypes and biomarker patterns.
DAIC-WOZ Depression Database
33 interviewsTF-IDF (100) + COVAREP (296)
396 featuresPCA (95% variance)
27 componentsK-Means Algorithm
k=2Chi-square test
p=0.0112Gold standard clinical depression database from USC Institute for Creative Technologies (AVEC 2017)
Full interview transcripts with timestamps
COVAREP acoustic features (74 dimensions)
Facial Action Units (not used in this study)
PHQ-8 depression severity scores (0-24)
27 components retain 95.4% of variance (396 → 27 dimensions)
Green: Healthy (PHQ8<10), Red: Depressed (PHQ8≥10)
Unsupervised K-Means clustering (k=2)
χ² = 6.44, p = 0.0112 (significant correlation)
Elbow method and silhouette score optimization (k=2 optimal)
View complete analysis and visualizations in the Jupyter Notebook
USC Institute for Creative Technologies - DAIC-WOZ Depression Database (AVEC 2017)
COVAREP Team - Acoustic feature extraction toolkit
MIT License - For research and educational purposes
Ethical Note: This project is for research purposes only. It is not intended to replace professional medical diagnosis or treatment. If you or someone you know is experiencing depression, please seek help from qualified mental health professionals.