Mitigating Direct Prompt Injection in Agent Skills
Develop robust defenses against direct prompt injection attacks in the Agent Skills framework, where adversarial instructions embedded in SKILL.md are interpreted with operator-level authority, and ascertain the architectural reforms required to enable effective mitigation given the absence of a formal behavioral specification.
References
Direct injection therefore remains an open problem that cannot be fully addressed within the current architectural framework.
— Towards Secure Agent Skills: Architecture, Threat Taxonomy, and Security Analysis
(2604.02837 - Li et al., 3 Apr 2026) in Section 7.1, Defense Directions (Against Prompt Injection (T3))