Publications

Value alignment and inverse reinforcement learning

Long-term risks from AI

Human-robot cooperation

Theories of (bounded) rationality