Arguments about Highly Reliable Agent Designs as a Useful Path to AI Safety

This paper is a revised and expanded version of my blog post Plausible cases for HRAD work, and locating the crux in the “realism about rationality” debate, now with David Manheim as co-author. … Read more

Similar