Papers
Topics
Authors
Recent
Search
2000 character limit reached

Agentic AI for Autonomic Management

Updated 23 February 2026
  • Agentic AI for autonomic management is defined by integrating LAMs, persistent memory, automated planning, tool-use interfaces, and multi-agent collaboration for self-managing systems.
  • Architectures employ reflection, planning, and tool-use patterns within frameworks like MAPE-K to enable dynamic reconfiguration and real-time anomaly detection.
  • Mathematical models and reinforcement learning underlie safe, optimal decision-making validated across mobile networks, business processes, and IT operations.

Agentic AI for Autonomic Management

Agentic AI for autonomic management represents an advanced paradigm wherein autonomous agents, powered by large AI models (LAMs), orchestrate the configuration, optimization, and adaptation of complex systems with minimal human intervention. By integrating multimodal perception, hierarchical planning, persistent memory, self-reflection, tool-use, and multi-agent collaboration, Agentic AI extends classical agents to realize the principles of autonomic computing—enabling fully self-managing infrastructure in domains such as mobile networks, business processes, IT infrastructure, anomaly management, and beyond (Pellejero et al., 4 Nov 2025).

1. Foundational Principles and Unified Definition

Agentic AI advances the capabilities of traditional AI agents by unifying several design patterns:

  • Foundation Models (LAMs): Serve as the engines for perception and reasoning, encompassing large language, vision, and retrieval models.
  • Persistent Memory (MM): Retains long-horizon context, past states, and reasoning traces.
  • Automated Planning (PP): Hierarchical goal decomposition leveraging LAM-driven rollout simulation.
  • Reflection Loops: Iterative self-evaluation and hypothesis refinement based on observed effects and predicted outcomes.
  • Tool-Use Interfaces (TT): Structured APIs allow agents to ingest data, perform analytics, and effect configuration changes.
  • Multi-Agent Collaboration (CC): Specialized agents interact and coordinate, often via lightweight protocols (e.g., JSON RPC over gRPC).

Formally, an Agentic AI system is characterized by the tuple A=G,S,M,P,T,CA = \langle G, S, M, P, T, C \rangle:

  • GG: set of high-level goals (e.g., maximize throughput)
  • SS: state space (system KPIs, configuration, faults)
  • MM: memory for context and trace retention
  • PP: planning module mapping (S,GS, G) to task hierarchies
  • TT: tool interfaces for operations
  • CC: communication for peer agent interaction

Classical agents lack the persistent memory, hierarchical planning, deep collaboration, and native LAM integration present in agentic systems (Pellejero et al., 4 Nov 2025).

2. Architectures and Design Patterns

Agentic AI architectures instantiate key patterns:

  • Reflection Pattern: Implements self-evaluation after action, retrieving memory to detect drift and applying a refinement operator refine(h,data)h\text{refine}(h, \text{data}) \to h'.
  • Planning Pattern: Decomposes goals into subtasks {τ1,,τn}\{\tau_1, \dots, \tau_n\}, scores action sequences using LAMs, and utilizes Hierarchical Task Networks with dynamic branching.
  • Tool-Use Pattern: Standardized interfaces (e.g., DataAccessTool, TimeSeriesAnalysisTool, ConfigActionTool) enable data ingestion, analytics, and actuation. LAMs generate structured tool calls.
  • Multi-Agent Collaboration Pattern: Specialized agents (e.g., AnalysisAgent, PlannerAgent, ValidationAgent) coordinate via orchestrators, leveraging message-passing protocols for robust system-wide management.

These patterns are embedded in layered or modular architectures, often grounded in the MAPE-K (Monitor-Analyze-Plan-Execute-Knowledge) framework, as seen in autonomic network, process management, microservices, and business model deployments (Dumas et al., 25 Jan 2026, Esposito et al., 27 Jun 2025, Pellejero et al., 4 Nov 2025, Bohnsack et al., 19 Jun 2025).

3. Mathematical Foundations for Autonomic Decision-Making

Agentic AI operationalizes autonomic management using mathematical constructs and formal optimization:

  • Time-Series Anomaly Detection:

μi(t)=1Wτ=tW+1txi(τ), σi(t)=1W(xi(τ)μi)2\mu_i(t) = \frac{1}{W} \sum_{\tau=t-W+1}^t x_i(\tau),\ \sigma_i(t) = \sqrt{\frac{1}{W} \sum (x_i(\tau) - \mu_i)^2}

An anomaly is flagged if xi(t)μi(t)>kσi(t)|x_i(t) - \mu_i(t)| > k \cdot \sigma_i(t).

  • Decision Policy (Reinforcement Learning):

st=[x1(t),,xN(t),configurationt]s_t = [x_1(t), \ldots, x_N(t), \text{configuration}_t]

The agent learns a policy π(as)\pi(a|s) approximated via a value function Qθ(s,a)Q_\theta(s,a) (MLP or Transformer), updated by gradient descent on the expected reward R(s,a)R(s,a).

  • Multi-Agent Optimization:

For cell ii, the network-wide utility:

U=iwif(Ti,Li),f(T,L)=αlog(1+T)βLU = \sum_{i} w_i f(T_i, L_i),\quad f(T, L) = \alpha \log(1+T) - \beta L

Agents collaboratively maximize U(param)U(\text{param}) subject to constraints (e.g., power, interference), approximating gradients online.

π=argmaxπE[t=0γtR(st,at)]\pi^* = \text{argmax}_\pi \mathbb{E} \left[ \sum_{t=0}^{\infty} \gamma^t R(s_t, a_t) \right]

subject to transition dynamics and constraint set C(st,at)C(s_t, a_t) (Dumas et al., 25 Jan 2026, Pellejero et al., 4 Nov 2025).

4. End-to-End Workflows: Mobile Network and Business Process Use Cases

Practical deployments exemplify distributed, closed-loop workflows:

Step RAN (5G/6G) Example Business Process Example
Sensing DataAccessTool fetches KPIs Data Layer ingests event logs
Analytics/Anomaly Detection TimeSeriesAnalysisTool computes stats Process intelligence mines anomalies
Planning/Orchestration LAM Reasoning, task decomposition Orchestration layer plans interventions
Action ConfigActionTool effects reconfiguration Action layer executes interventions
Reflection/Validation Outcome/KPI check, rollback by ValidationAgent Performance metrics monitored, plans iteratively refined
Learning/Memory Q-function/adapter fine-tuning, RAG memory Predictive models updated, memory embeds outcomes

In the RAN case, multi-agent collaboration is crucial: AnalysisAgent hypothesizes root cause, RetrievalAgent fetches cases, ValidationAgent verifies results, all overseen by a MasterOrchestratorAgent (Pellejero et al., 4 Nov 2025). In business processes, autonomous process mining detects deviations, digital twins simulate “what-if” impacts, and agentic automation adapts execution (Dumas et al., 25 Jan 2026).

5. Continuous Learning, Context Retention, and Safety

Agentic AI systems sustain adaptivity and safety by leveraging:

  • Persistent Memory (MM): Retention of trajectories, reasoning, and successful policies.
  • Retrieval-Augmented Generation (RAG): Recall of semantically similar episodes to inform current decision-making.
  • Continuous Learning: Online Q-function and LAM adapter updates using streaming data.
  • Safety Protocols and Constraints: Embedding operational boundaries (power, interference) and automated rollback. Verification mechanisms ensure that proposed actions cannot violate safety properties under any predicted scenario.

Challenges such as computational footprint (especially LAMs at the edge), secure communication, non-stationary learning stability, and efficient multi-agent orchestration are open research areas (Pellejero et al., 4 Nov 2025, Dumas et al., 25 Jan 2026).

6. Challenges, Limitations, and Open Research Questions

Unresolved issues and research vectors for agentic autonomic management include:

  • Scalability of Agentic Systems: Efficient coordination across thousands of heterogeneous agents while maintaining sub-second response times and system-wide safety (Singh et al., 17 Oct 2025).
  • Secure and Private Collaboration: Enforcing zero-trust, sandboxed interactions, and secure protocol extensions beyond natural language (e.g., Model Context Protocol).
  • Learning Guarantees: Ensuring stability and convergence of continuous adaptation, particularly under distributional drift.
  • Formal Verification: Embedding formal safety and liveness proofs into planner and validator agents.
  • Human-in-the-Loop Governance: Incorporating override and audit trails, and determining optimal placement of human verification in the operational loop (Dumas et al., 25 Jan 2026).
  • Impact Measurement and Multi-Dimensional Metrics: Designing benchmarks that measure capability, robustness, safety, human-centeredness, and cost efficiency (e.g., with Adaptive Multi-Dimensional Monitoring) (Shukla, 28 Aug 2025).
  • Architecture Generalization: Applying agentic autonomic management to other complex systems—intelligent buildings, supply chains, and industrial automation—by coupling physics-informed digital environments (“bodies”) with multi-agent orchestration layers (“brains”) (Jiang et al., 27 Jan 2026).

7. Impact Across Domains

Agentic AI-driven autonomic management is operationalized in:

  • Mobile Networks: RAN optimization, fault isolation, KPI-driven configuration, and lifecycle management under uncertain dynamics (Pellejero et al., 4 Nov 2025, Singh et al., 17 Oct 2025).
  • Business Process Management: Round-trip feedback between process mining, predictive/prescriptive models, and autonomous interventions, supporting the entire spectrum from human-driven to fully autonomous process execution (Dumas et al., 25 Jan 2026).
  • Microservice and IT Operations: MAPE-K+agentic enhancements for distributed anomaly detection, risk-sensitive remediation, and adaptive runbook generation (Esposito et al., 27 Jun 2025).
  • Enterprise Automation: Turnkey agentic frameworks for iterative system optimization, achieving self-configuration, self-optimization, self-monitoring, and self-healing without manual intervention (Yuksel et al., 2024, Okpala et al., 8 Feb 2025).
  • Energy Systems and Smart Infrastructure: Multi-agent orchestration for grid-interactive building management, tightly coupled with modular physics-based digital twins (Jiang et al., 27 Jan 2026).

These deployments demonstrate marked improvements in detection/repair latency, operational cost, robustness, explainability, and the ability to adapt in non-stationary environments.


Agentic AI systems, through LAM-based reasoning, planning, reflection, tool-use, persistent memory, and multi-agent collaboration, instantiate the principles of autonomic management. Their architecture and mathematical formalization, as demonstrated across real-world cases, provide a compelling substrate for the automation and self-governance of increasingly complex, dynamic infrastructures (Pellejero et al., 4 Nov 2025, Dumas et al., 25 Jan 2026, Murad et al., 20 Sep 2025, Singh et al., 17 Oct 2025, Esposito et al., 27 Jun 2025, Yuksel et al., 2024).

Topic to Video (Beta)

No one has generated a video about this topic yet.

Whiteboard

No one has generated a whiteboard explanation for this topic yet.

Follow Topic

Get notified by email when new papers are published related to Agentic AI for Autonomic Management.