Dynamic Emotional Signature Graphs — a model-agnostic evaluator that decouples clinical states and scores therapeutic progress with asymmetric geometry.
Scroll
The blind spot in mental-health AI evaluation — surface empathy masking clinically directional deterioration
AI sounds warm, supportive, and understanding. Users feel heard — but clinical direction hasn't improved. The response validates without challenging.
Responses quietly reinforce catastrophic beliefs, hopeless predictions, and distorted self-labels. Polite on the surface, deteriorating underneath.
Language is warm, but clinical trajectory is worsening. Traditional evaluators only see the surface — they can't detect the direction of change.
DESG's core insight: we must track clinical trajectory direction, not single-turn surface quality.
LLMs as sensors, not judges — extract clinical states, then score with geometry.
Composite representation from dialogue: semantic (1536-D) + affective valence-arousal (2-D) + cognitive distortion distribution (10-D).
State sequences become directed emotional signature graphs — encoding temporal evolution, not independent per-turn scoring.
Distinguishes "improving" from "deteriorating" — exponential penalty for worsening, bounded reward for recovery.
From raw dialogue to clinical safety score — end-to-end offline evaluation without LLM judges.
Dialogue → 1548-D clinical state vectorh_sem ∥ h_emo ∥ h_cog
Clinical directional distance metric
Deterioration penalty, recovery reward
DESG graph + Hungarian GED
Nodes = states, edges = KL divergence
Momentum reward + distortion penalty wall
Productive / Neutral / Harmful
Each dimension corresponds to a clinically-defined cognitive distortion pattern, extracted as a probability simplex.
Amplifying small issues into catastrophic consequences
Assuming others' thoughts without verification
Predicting negative outcomes as certain
Imposing unreasonable "must/should" demands
Defining self/others with extreme labels
Focusing only on negative details, ignoring positives
Black-and-white extreme binary thinking
Inferring universal rules from single events
Attributing external events to oneself
Taking feelings as evidence of facts
Structured extraction via LLM · Probability simplex output · Embedded as cognitive track in clinical state space
Three independent clinical tracks, concatenated into a single state vector.
MiniLM-L6-v2 embeddings, zero-padded to 1536 dimensions
Valence + Arousal from circumplex model
10-class CBT distortion probability simplex
Semantic dimensions occupy 99.2% — but ablation experiments prove clinical features are the core discriminative substrate.
Recovery is rewarded slowly; deterioration is penalized exponentially.
Dialogue windows become directed graphs encoding temporal clinical state evolution.
Each dialogue turn x_t maps to a graph node carrying the 1548-D state vector.
Cognitive distribution divergence between adjacent nodes + temporal penalty γ·Δt.
Approximate graph edit distance via Hungarian algorithm O(n³) for optimal template matching.
Peer support, counseling dialogue, and crisis-oriented interaction — three distinct mental-health scenarios.
EmpatheticDialogues · Peer Support
dialogue windows
ESConv · Counseling Dialogue
dialogue windows
CRADLE-Dialogue · Crisis Intervention
dialogue windows
Split per dataset: 600 train / 200 dev / 200 held-out test · Total: 3,000 windows
3 × 200 held-out test windows · Sorted ascending
Complete ConcatANN F1 = 0.9202 · Delta F1 when removing each feature
Semantic-only → F1 0.6559 (−28.7%) · Clinical-only → F1 0.8755 (−4.9%) · Full model → F1 0.9202
From brute-force retrieval to graph matching — Mean Macro-F1 on 3×200 held-out test.
Spatial + temporal dual-stream late fusion
Brute-force spatial retrieval, kNN + 1548-D
MTP-pretrained temporal Transformer
Gated attention retrieval variant
Learned clinical manifold angular metric
DESG-Ensemble achieves 0.9353 Mean F1 with 100% coverage and 100% sycophancy specificity.
Clinical state geometry is the core discriminative substrate: removing directionality causes −33.3% F1.
Distortion reinforcement alignment (0.93 F1) is the most reliable clinical audit anchor.
Large-scale clinical expert annotation and prospective clinical validation.
Multilingual and cross-cultural stress testing.
Pre-deployment prospective human-in-the-loop audit system.
Limitations: offline evaluation benchmark, not a clinical trial · Benchmarks contain construction artifacts · EITE is a stress-test diagnostic only.
Shenzhen MSU-BIT University, Shenzhen, China