ED Triage GNNs: Optimizing Emergency Care
- ED triage GNNs are graph neural network models that capture patient encounters, clinical observations, and care pathways in emergency settings.
- They use link prediction on multi-relational knowledge graphs with domain-informed masks to improve the accuracy of care action predictions.
- Patient-similarity graphs with optimized thresholds enable effective node classification, supporting real-time triage decisions and resource management.
Emergency Department (ED) triage Graph Neural Networks (GNNs) refer to the application of GNN architectures to optimize, automate, and analyze the patient triage process within emergency medical settings. These approaches leverage the expressive capability of GNNs to capture heterogeneous, multi-relational, and high-dimensional patterns in clinical data, enabling robust modeling of patient interactions, institutional resource allocation, and care pathways. The two principal streams in the literature are graph-structured link prediction for multi-modal knowledge graphs as formulated by Valls et al. (Valls et al., 2023), and patient-similarity graph classification as exemplified by Defilippo et al. (Defilippo et al., 2024).
1. Formal Mathematical Frameworks
ED triage GNNs instantiate two principal modeling paradigms:
a) Link Prediction on Multi-relational Knowledge Graphs
Let denote a directed, multi-relational KG, where nodes represent patients, encounters, conditions, observations, and care actions; every edge with label and indicator . The embeddings matrix encodes node features, with trainable relational weights .
Crucially, two adjacency matrices are introduced:
- encodes factual KG topology ( if ),
- is the designer-specified GNN connectivity (communication mask), decoupled from to enforce clinically valid information flow.
A common message passing layer (Relational-GCN) updates node embeddings via:
Final scores for link prediction are computed as:
b) Node Classification on Patient-Similarity Graphs
Each ED visit (episode) is a node with feature vector , summarizing demographics, vital signs, and select history. Weighted edges are defined by patient-patient similarity , derived from distance/similarity metrics (cosine, Manhattan, Euclidean, Minkowski), with threshold controlling graph density.
The GNN (GCN, GATv2, GraphSAGE) propagates information through the patient graph via:
The inductive variant, GraphSAGE, employs neighborhood pooling and feature concatenation to enable inference on new nodes at triage.
2. Construction of ED Graphs: Nodes, Edges, and Features
Multi-modal Clinical KGs
- Node types: Patient, Encounter, Observation, Condition, CareAction
- Edge types: P→E, E→O, E→C, E→CA, direction encodes medical workflow
- Features: Node embeddings initialized per type, updated via message passing
Patient-similarity Graphs
- Nodes: Each represents a historical ED visit, characterized by a standardized 16-feature vector (e.g., age, blood pressure, smoking status)
- Edges: Weighted undirected; existence and strength determined by similarity (e.g., for cosine)
- Node labels: True triage code (Red, Orange, Yellow, Green)
3. Message Passing, Connectivity, and Information Flow
The definition of GNN connectivity () is a critical design variable. In the multi-relational context, Valls et al. demonstrate that domain-informed masks (e.g., only allow information flow Condition/Observation→Encounter, Patient→Encounter, Encounter→CareAction) yield higher accuracy (avg. $0.88$) versus using the raw KG ($0.43$) or bidirectional edges introducing test-time leakage ($0.19$).
For patient-similarity graphs, the choice of similarity metric and threshold modulates information propagation, with optimal values maximizing GNN performance before the graph becomes too dense (Defilippo et al., 2024).
GATv2 layers, employing attention coefficients , and GraphSAGE—using neighborhood pooling—both enable the model to capture complex patient-patient influences in the node classification paradigm.
4. Role of Negative Edges, Training Objective, and Loss Functions
In real-world KGs such as Synthea, only positive clinical events are recorded. Valls et al. propose systematic negative edge augmentation: for each encounter with observed care action , create explicit negatives for all . The logistic loss over positive and negative links:
enables robust contrastive learning; removing negative edges severely degrades accuracy from $0.88$ to $0.60$ (Valls et al., 2023).
In the node classification setting, class-balancing (with SMOTE+ENN) and categorical cross-entropy loss are applied:
ensuring stability and convergence during model optimization (Defilippo et al., 2024).
5. Model Architectures, Training Protocols, and Empirical Results
Key architectural and training choices include:
- Relational GCN (KG link prediction): 2-layer encoder, embedding dimension , binary logistic loss, Adam optimizer with learning rate schedule (), weight decay , training for $1000$ epochs, early stopping against validation metrics (Valls et al., 2023).
- GraphSAGE (patient similarity): 5 layers, dims , dropout , Adam optimizer (), up to 200 epochs, early stopping, inductive inference capability (Defilippo et al., 2024).
- GCN and GAT baselines: GCN (Kipf-Welling rule, up to 5 layers), GATv2 (Brody et al. 2021), both with architectures calibrated to input and output dimensions tailored for 4-class prediction.
Empirically, GraphSAGE on cosine-similarity graphs () achieves accuracy, outperforming SVM () and KNN () on raw features. R-GCN with domain-informed connectivity achieves accuracy $0.88$ on the Synthea-based triage link-prediction task, with performance degrading if the model is made deeper (over-smoothing) or negative edges are omitted (Valls et al., 2023, Defilippo et al., 2024).
Selected Results Table
| Model | Task | Test Accuracy | Data Source & Graph Type |
|---|---|---|---|
| R-GCN (L=2, d=5) | Link Prediction | 0.88 | Synthea KG, C4 connectivity |
| R-GCN (raw KG only) | Link Prediction | 0.43 | Synthea KG, C1 connectivity |
| GraphSAGE | Node Classification | ~0.89 | Patient-similarity (cosine, τ=0.95) |
| SVM, Tabular | Node Classification | 0.824 | 16-feature tabular |
6. Integration with Clinical Triage Workflows and Deployment
Deployment for ED triage divides into procedural steps:
- Clinical KG: Extract and encode EHR modalities for patients, encounters, observations, conditions, and care actions. Incorporate negative samples by generating all non-observed care actions per encounter as negatives. Define communication mask to enforce domain-compliant information flow (e.g., allow only O/C→E, P→E, E→CA).
- Patient-similarity graph: On live triage, collect the 16 standardized features for a new arrival, preprocess as per training, insert as node in the similarity graph (edges to historical visits exceeding similarity threshold), and forward-pass through the trained GraphSAGE model to assign a triage code from the output probability distribution.
Advantages of these GNN-based pipelines include reproducibility, independence of individual nurse subjectivity, utilization of embedded clinical knowledge, adaptability to inductive node arrival (for patient classification), and a demonstrated gain (5–10%) in accuracy over rule-based or standard ML pipelines (Valls et al., 2023, Defilippo et al., 2024).
7. Practical Guidelines and Limitations
To maximize ED triage GNN utility:
- Explicitly separate factual graph structure from GNN communication by masking , preventing test-time label leakage and focusing information flow on meaningful clinical relationships.
- Systematically generate negative samples if not recorded, pairing each encounter with all non-assigned downstream actions.
- Use moderate GNN depth () and embedding dimension (); over-deep GNNs suffer from over-smoothing.
- Tune similarity threshold to balance graph density and predictive signal in patient-similarity approaches.
- Early stopping and batch strategies: monitor for over-smoothing (notably beyond in R-GCNs) and use whole-graph or node-pair minibatches as appropriate.
These constraints and heuristics are validated in repeated-split experiments using both synthetic (Synthea) and real-world ED datasets, providing a robust foundation for scalable, interpretable, and clinically compatible ED triage automation (Valls et al., 2023, Defilippo et al., 2024).