Bay Area Alignment Workshop
Santa Cruz, CA
24–25 October, 2024

Program Committee
Bay Area Alignment Workshop
The Bay Area Alignment Workshop was held 24–25 October, 2024, at Chaminade in Santa Cruz, and featured Anca Dragan speaking on Optimised Misalignment. Participants additionally explored topics such as threat models, safety cases, monitoring and assurance, interpretability, robustness, and oversight.
The Alignment Workshop series brings together top machine learning researchers and practitioners from industry, academia, and government. The workshop focuses on discussing and debating critical topics related to AI alignment, enabling participants to better understand potential risks from advanced AI, and strategies for solving them. Key issues discussed include model evaluations, interpretability, robustness, and AI governance.






























