
#10: Stephen Casper on Technical and Sociotechnical AI Safety Research
Failed to add items
Sorry, we are unable to add the item because your shopping cart is already at capacity.
Add to basket failed.
Please try again later
Add to wishlist failed.
Please try again later
Remove from wishlist failed.
Please try again later
Adding to library failed
Please try again
Follow podcast failed
Unfollow podcast failed
-
Narrated by:
-
By:
About this listen
Stephen Casper, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more.
Our music is by Micah Rubin (Producer) and John Lisi (Composer).
For a transcript and relevant links, visit the Center for AI Policy Podcast Substack.
No reviews yet