Publications

Value Alignment and Inverse Reinforcement Learning

Long-Term Risks From AI

Human-Robot Cooperation

Theories of (Bounded) Rationality