Publications

Books

Value Alignment and Inverse Reinforcement Learning

Human-Robot Cooperation

Multi-Agent Perspectives and Applications

Models of Bounded or Imperfect Rationality

Cognitive Science, uncategorized

AI Capabilities, uncategorized

Ethics for AI and AI Development

Robust Inference, Learning, and Planning

Adversarial Training and Testing

Causal Modeling and Reasoning

Long-Term & Societal-Scale AI Risks

Security Problems and Solutions

Foundations of Rational Agency

Transparency & Interpretability