Dylan Hadfield-Menell, UC Berkeley
View original resourceHadfield-Menell's Berkeley technical report formalises AI alignment as a principal-agent problem with incomplete contracts, drawing on mechanism design. Introduces inverse reward design and cooperative inverse reinforcement learning as alignment approaches.
Published
2021
Jurisdiction
United States
Category
Governance frameworks
Access
Public access
VerifyWise helps you implement AI governance frameworks, track compliance, and manage risk across your AI systems.