Friday Aug 02, 2024

#10: Stephen Casper on Technical and Sociotechnical AI Safety Research

Stephen Casper, a computer science PhD student at MIT, joined the podcast to discuss AI interpretability, red-teaming and robustness, evaluations and audits, reinforcement learning from human feedback, Goodhart’s law, and more.

Our music is by Micah Rubin (Producer) and John Lisi (Composer).

For a transcript and relevant links, visit the Center for AI Policy Podcast Substack.

Comments (0)

To leave or reply to comments, please download free Podbean or

No Comments

Copyright 2024 All rights reserved.

Podcast Powered By Podbean

Version: 20240731