Parables on the Power of Planning in AI: From Poker to Diplomacy: Noam Brown (OpenAI)
- planning significantly improves machine learning system performance
- planning in machine learning systems can be implemented via e.g. Monte Carlo search
- planning is useful in domains with a large generator-verifier gap
- the generator-verifier refers to problems where it’s much easier to verify a solution than it is to generate one