Can computer-use agents perform real professional work?
Determine whether current computer-use agents (graphical user interface agents that operate software via mouse and keyboard) can successfully execute real professional workflows that are long-horizon and heterogeneous across diverse software configured with domain-specific data.
References
Yet whether these agents can handle real professional work remains an open question.
— Gym-Anything: Turn any Software into an Agent Environment
(2604.06126 - Aggarwal et al., 7 Apr 2026) in Introduction (Section 1)