PokéAgent Logo

A NeurIPS 2025 competition advancing AI decision-making through Pokémon. Featuring competitive battling and RPG speedrunning tracks to unify research in reinforcement learning and large language models.

Competition Tracks

Click to explore detailed information, submission guidelines, starter code, and datasets for each track.

About the Competition

Scientific Relevance

The PokéAgent Challenge positions Pokémon as an ideal testbed for artificial intelligence research, offering two complementary tracks that address fundamental challenges in decision-making.

This competition addresses critical frontiers in AI research at the intersection of reinforcement learning, game theory, planning, and language models. It creates a standardized benchmark for opponent modeling under partial observability and long-horizon reasoning—two capabilities essential for advancing AI beyond controlled environments.

Key Research Areas

  • Opponent Modeling: Track 1 requires sophisticated opponent modeling under partial observability.
  • Long-Horizon Planning: Track 2 challenges agents to maintain coherent planning across thousands of timesteps.
  • Strategic Adaptation: Both tracks require agents to generalize across varied scenarios and adapt to novel situations.
  • Knowledge Integration: Opportunity to develop methods that augment decision-making with existing reference materials.

Ready to Get Started?

Join the PokéAgent Challenge Discord server to register and connect with other participants!

Join Our Discord Server

Organizing Team

Seth Karten

Seth Karten

Princeton University

Jake Grigsby

Jake Grigsby

UT Austin

Stephanie Milani

Stephanie Milani

Carnegie Mellon University

Kiran Vodrahalli

Kiran Vodrahalli

Google DeepMind

Amy Zhang

Amy Zhang

UT Austin

Fei Fang

Fei Fang

Carnegie Mellon University

Yuke Zhu

Yuke Zhu

UT Austin

Chi Jin

Chi Jin

Princeton University