Blog
For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria
Scott Emmons 05 Oct 2022
When AI systems are deployed in the real world, many cooperating AI agents will share the same source code or neural network weights. This motivates the study of symmetric team theory. In this talk, Scott shares the results of a new CHAI research paper: For Learning in Symmetric Teams, Local Optima are Global Nash Equilibria. There’s a mix of good and bad news, showing conditions when symmetric cooperation is both stable and unstable.
Designing Societally Beneficial Reinforcement Learning Systems
Tom Gilbert 10 Aug 2022
Many are concerned about the future long-term implications of reinforcement learning (RL) systems that can learn dynamically from interaction with human environments. However, RL systems are already being used today and proposed in a variety of near-term applications. For example, Deep RL is transitioning from a research field focused on game playing to a technology with real-world applications. Notable examples include DeepMind’s work on controlling a nuclear reactor or on improving Youtube video compression, or Tesla attempting to use a method inspired by MuZero for autonomous vehicle behavior planning. The exciting potential for real world applications of RL are also a harbinger for longer-term risks – for example RL policies are well known to be vulnerable to exploitation, and methods for safe and robust policy development are an active area of research.
How Platform Recommenders Work
Jonathan Stray 09 Feb 2022
A recommender system (or simply ‘recommender’) is an algorithm that takes a large set of items and determines which of those to display to a user—think the Facebook News Feed, the Twitter timeline, Google News, or the YouTube homepage. Recommenders are necessary tools to help navigate the sheer volume of content produced each day, but their scale and rapid development can cause unintended consequences. Facebook’s algorithms have been blamed for radicalizing users, TikTok’s for inundating teens with eating-disorder videos, and Twitter’s for political bias.