Bay Area Alignment Workshop

Santa Cruz, CA

24–25 October, 2024

Program Committee

Anca Dragan

Director, AI Safety and Alignment, Google DeepMind; Associate Professor, UC Berkeley

UC Berkeley,Google DeepMind

Robert Trager

Co-Director

Oxford Martin AI Governance Initiative

Dawn Song

Professor

UC Berkeley

Dylan Hadfield-Menell

MIT

Adam Gleave

Co-founder & CEO

FAR.AI

Overview

The Bay Area Alignment Workshop was held 24–25 October, 2024, at Chaminade in Santa Cruz, and featured Anca Dragan speaking on Optimised Misalignment. Participants additionally explored topics such as threat models, safety cases, monitoring and assurance, interpretability, robustness, and oversight.

The Alignment Workshop series brings together top machine learning researchers and practitioners from industry, academia, and government. The workshop focuses on discussing and debating critical topics related to AI alignment, enabling participants to better understand potential risks from advanced AI, and strategies for solving them. Key issues discussed include model evaluations, interpretability, robustness, and AI governance.

Speakers

No items found.

Bay Area Alignment Workshop sessions

Stay tuned for recordings from Bay Area Alignment Workshop! Full sessions will be posted here, and on our Youtube channel, or you can follow us on X, LinkedIn, or Bluesky to hear about it when they go live.