Central AI alignment problem: capabilities generalization and sharp left turn

(This post was factored out of a larger post that I (Nate Soares) wrote, with help from Rob Bensinger, who also rearranged some pieces and added some text to smooth things out. I’m not terribly happy… Read more

Similar