Safety and Safety Testing for Advanced Autonomous AI Systems

Develop and validate methods to make advanced autonomous AI systems safe and to properly test their safety prior to deployment.

Background

The authors stress that if advanced autonomous AI systems were built today, the field lacks the know-how to make them safe or to properly test their safety. This underscores the need for foundational safety research and the development of robust testing methodologies.

This open problem is tied to the urgency of creating governance and technical measures that can keep pace with rapid AI progress, ensuring that safety is demonstrably addressed before deployment.

References

If advanced autonomous AI systems were developed today, we would not know how to make them safe, nor how to properly test their safety.

Managing extreme AI risks amid rapid progress  (2310.17688 - Bengio et al., 2023) in Subsection A path forward