HMACS: Hierarchical Multi-Agent Coding System

Updated 24 December 2025

HMACS is a hierarchical multi-agent system that uses fixed-length integer coding to represent structured agent roles and enable efficient task delegation.
It employs bespoke genetic operators and structured communication protocols to maintain substructure integrity and accelerate convergence in complex environments.
Empirical analyses in domains like ICD coding and smart grids show that HMACS achieves higher macro- and micro-F1 scores compared to traditional flat or LLM-based architectures.

A Hierarchical Multi-Agent Coding System (HMACS) refers to any system in which a collection of agents is organized in a hierarchical structure for the efficient assignment, negotiation, or verification of codes or tasks, leveraging the advantages of layered delegation, explicit role allocation, and structured communication. HMACS frameworks have been developed for both the general design of multi-agent organizations and specialized domains, such as automated medical coding. Core elements include fixed-length or structured representations of agent hierarchies, specialized algorithms for optimizing agent interactions, and explicit coordination mechanisms operating at each layer, often with formal task routing, contract, or negotiation protocols. HMACS enables efficient handling of combinatorial organizational spaces, modular agent role development, and improved reliability, explainability, and rare-case performance in settings such as clinical coding, smart grids, and industrial process management (Shen et al., 2014, Li et al., 2024, Moore, 18 Aug 2025).

1. Formal Structure and Encoding of Hierarchical Organizations

HMACS formalizes agent hierarchies using layered abstractions and codings that permit efficient search and manipulation. For a system with $N$ leaf agents and maximum hierarchy depth $M$ , there exists a surjective and unique mapping: $f\,:\, \{\text{all valid hierarchies of }N\,\text{leaves, depth} \leq M\} \rightarrow \mathbb{Z}^{N-1}$ such that $f(H) = a = (a_1, \ldots, a_{N-1}),\, a_i \in \{1,\ldots,M\}$ , where each $a_i$ encodes the level at which leaves $i$ and $i+1$ first separate in the hierarchy. Every valid hierarchical organization is thus represented as a fixed-length integer array, supporting both surjectivity and uniqueness: every array corresponds to at least one hierarchy, and vice versa (Shen et al., 2014).

This encoding bypasses costly tree-based data structures in evolutionary or optimization routines, allowing the search over $B = \{1,\ldots,M\}^{N-1}$ to comprehensively and efficiently sweep the organizational space. In real-world deployments (e.g., healthcare ICD coding), the hierarchy aligns with the natural sequence of domain roles—patient, physician, coder, reviewer, adjustor—each agent performing specialist functions in the procedural workflow (Li et al., 2024).

2. Genetic Operators and Search Acceleration

HMACS implements bespoke genetic operators exploiting subtree semantics:

Hierarchical crossover: Given two parent genotypes, crossover involves swapping subarray segments defined by hierarchy-level nodes. Offspring length may diverge; a repair strategy removes random genes from the longer and re-inserts them in the shorter until both return to the valid length ( $N-1$ ).
Small-perturbation mutation: Instead of random reassignment, each gene $a_i$ is nudged by $+1$ or $-1$ with some probability $\mu$ , clamped to $[1,M]$ .

This operator design maintains semantic integrity of subtrees during recombination and restricts mutations to local structure. These choices accelerate convergence toward high-utility organizations while preserving viable building blocks and avoiding disruptive reorganizations (Shen et al., 2014). Exhaustive enumeration is avoided: although the encoded space is of size $M^{N-1}$ , well-structured search via HMACS locates competitive solutions in thousands rather than millions of evaluations.

3. Taxonomy and Coordination Mechanisms

A comprehensive taxonomy of HMACS axes encompasses control hierarchy, information flow, role/task delegation, temporal layering, and communication structure. Key definitions include partitioning agent sets $A = \bigcup_{l=1}^L A^l$ across layers, $𝒯$ as the set of coded or delegated tasks, and formally specified matrices for information flow $I$ , task delegation $D$ , and communication edges $C$ (Moore, 18 Aug 2025).

Canonical coordination mechanisms include:

Contract-net protocols: Task $\tau$ is announced by a manager; interested contractors respond with bids computed from utility models (e.g., residual capacity minus cost). The manager assigns $\tau$ to the strongest bidder, with explicit escalation and timeout procedures for failures.
Hierarchical RL (feudal MARL): Each hierarchical layer $l$ defines an MDP for its abstraction level; upper layers assign sub-goals or macro-actions to lower. Rewards at $l+1$ depend on achieving goals set by $l$ , formalized in coupled Bellman equations. This recursive form supports decomposable global objectives.
Auction/market-based coordination: Social welfare $W = \sum_{i,\tau} D(i,\tau) U_i(\tau)$ is maximized via centralized or decentralized bidding, with combinatorial auction extensions.

All task and agent communications are typically encoded in structured formats, such as TLV or JSON, with role- and layer-specific headers and payloads. Reliability and accountability are handled through acknowledgement protocols, error-handling routines, and, in critical systems, explicit human-in-the-loop mechanisms (Moore, 18 Aug 2025).

4. Application Paradigms: Automated Medical Coding and Industrial Systems

HMACS has been instantiated in diverse domains. In ICD medical coding, the pipeline consists of role-specialized LLM-based agents (e.g., Patient, Physician, Coder, Reviewer, Adjustor), exchanging structured JSON messages as they progressively reason over discharge summaries, SOAP-encoded records, and ICD-9 code candidates. Distinct workflow modes (MAC-I, MAC-II) orchestrate self-correction, confrontation, and final adjudication, with explicit agent-chain sequencing (Li et al., 2024).

Industrial examples include:

Smart grid balancing: Central dispatch agents, regional controllers, and distributed energy resource (DER) agents operate in a tri-level HMACS. RL and contract-net protocols effect top-down price signals and bottom-up load aggregation, with inter-layer coupling for subgoal and information flow.
Oilfield operations: Field operations centers delegate maintenance and production optimization to well agents in a two-level HMACS, combining contract-net task allocation with local RL-based flow controllers (Moore, 18 Aug 2025).

5. Performance, Metrics, and Empirical Analysis

Empirical results in hierarchical LLM-based ICD coding demonstrate that HMACS outperforms both flat zero-shot Chain-of-Thought (CoT) prompting and LLM-designed agent systems, especially on rare code accuracy and interpretability. For instance, on the MIMIC-III “Top 50” ICD codes:

MAC-II (hierarchical, SOAP-driven): Macro-F1 $=0.748$ (best)
MAC-I: Macro-F1 $=0.693$
CoT-SC baseline: Macro-F1 $=0.726$

For rare codes, MAC-II achieves Micro-F1 $=0.376$ , exceeding all baselines. Critical ablation studies further show that removing the confrontation strategy or external knowledge degrades rare-code performance (e.g., confrontation ablation $\downarrow$ Micro-F1 from $0.376$ to $0.361$). In evolutionary search settings, HMACS attains near-optimal utility with dramatically fewer fitness calls than exhaustive methods, due to representation and operator choices (Shen et al., 2014, Li et al., 2024).

Workflow/Mode	MIMIC-III Macro-F1	MIMIC-III Rare Micro-F1
MAC-II (HMACS)	0.748	0.376
MAC-I (HMACS)	0.693	0.366
LLM-designed agents	0.743	0.333
CoT-SC (baseline)	0.726	0.331

6. Open Challenges and Future Directions

Ongoing research in HMACS identifies critical challenges:

Explainability and Trust: Generating justifications at every hierarchy layer remains nontrivial; models of human-agent trust adaptation are an area of active study.
Scalability and Reconfiguration: Methods for self-organizing, dynamically reconfiguring HMACS as agent population sizes grow (to $10^4$ – $10^6$ ) are needed. This includes fluid, context-sensitive switching between centralized and decentralized modes, and meta-coordination for hierarchy evolution (e.g., holarchy formation).
Safe Integration of Learning-Based Agents: Ensuring that high-capacity neural agents (e.g., LLMs) remain constrained to avoid semantic errors (“hallucination”), possibly via explicit safety layers and governance mechanisms.
Human-in-the-Loop and Accountability: Enabling seamless inspection, override, and logging at all layers, with support for multi-modal interfaces and robust blame assignment frameworks.

A plausible implication is that the integration of increasingly sophisticated learning agents into HMACS will require explicit codification of safety, transparency, and meta-adaptivity constraints to preserve both local autonomy and global efficiency (Moore, 18 Aug 2025).

7. Summary and Research Trajectory

Hierarchical Multi-Agent Coding Systems have established a rigorous framework combining formal representations, role-specialized agent design, and advanced coordination algorithms, validated in both combinatorial optimization and application domains. HMACS enables tractable organizational search, interpretable and reliable agent sequencing, and modular adaptation. The trajectory of research involves expanding taxonomic rigor, refining explainability and safety, accommodating larger and more dynamic agent populations, and deepening integration between symbolic and learning-based agent architectures across industrial, healthcare, and other safety-critical sectors (Shen et al., 2014, Li et al., 2024, Moore, 18 Aug 2025).