Potential Harms, Misuse, and the Alignment Problem
Section 1 - WHY
Topic 3.1: Unintended Harms -- Second and Third-Order Effects
Presentation 5: Unintended Harms — Bias and Discrimination
Topics 3.2-3.3: Intentional Misuse and the Alignment Problem
Presentation 6: Intentional Misuse and the Alignment Problem
AI-assisted security audit using Gemini CLI on a deliberately insecure chatbot API (QuickChat).
Hendrycks - Introduction to AI Safety, Ethics, and Society
Chapter 1: Overview of Catastrophic AI Risks
Synthesized from instructor research and student contributions. Full resource cards with descriptions available on the Resources page.