PokéAgent Logo

A NeurIPS 2025 competition advancing AI decision-making through Pokémon. Featuring competitive battling and RPG speedrunning tracks to unify research in reinforcement learning and large language models.

⏰ Competition Ends In

-- Days
-- Hours
-- Minutes
-- Seconds

Final submissions close October 31st, 2025

Competition Tracks

Click to explore detailed information, submission guidelines, starter code, and datasets for each track.

About the Competition

Scientific Relevance

The PokéAgent Challenge positions Pokémon as an ideal testbed for artificial intelligence research, offering two complementary tracks that address fundamental challenges in decision-making.

This competition addresses critical frontiers in AI research at the intersection of reinforcement learning, game theory, planning, and language models. It creates a standardized benchmark for opponent modeling under partial observability and long-horizon reasoning—two capabilities essential for advancing AI beyond controlled environments.

Key Research Areas

  • Opponent Modeling: Track 1 requires sophisticated opponent modeling under partial observability.
  • Long-Horizon Planning: Track 2 challenges agents to maintain coherent planning across thousands of timesteps.
  • Strategic Adaptation: Both tracks require agents to generalize across varied scenarios and adapt to novel situations.
  • Knowledge Integration: Opportunity to develop methods that augment decision-making with existing reference materials.

Ready to Get Started?

Join the PokéAgent Challenge Discord server to register and connect with other participants!

Join Our Discord Server Apply for Compute Credits

Compute Credit Application (Recommended for Students): Apply to receive GCP credits for cloud compute and Gemini API access. Roughly $100+ per team (pending submissions). Applications will be reviewed and distributed on a rolling basis until funds are depleted.

Prizes & Recognition

Multiple prize categories ensure diverse contributions are recognized and rewarded

Highest Ranking Teams

Performance-based prizes for teams placing high on the leaderboard in each track. Top performers in Track 1 tournament brackets and Track 2 speedrun rankings will receive prizes and recognition.

Research Impact Awards

Method-specific prizes recognizing innovative approaches across all AI paradigms. Categories may include: Best LLM-based method, Best RL method, or other breakthrough techniques.

NeurIPS 2025 Presentation

Workshop invitations for winning teams to present at NeurIPS 2025. Selected teams will also be invited as co-authors on the official competition report publication.

$10,000+ Total Prize Pool

Distributed across multiple categories to reward both excellence and innovation

Our Sponsors

Organizing Team

Seth Karten

Seth Karten

Princeton University

Jake Grigsby

Jake Grigsby

UT Austin

Stephanie Milani

Stephanie Milani

Carnegie Mellon University

Kiran Vodrahalli

Kiran Vodrahalli

Google DeepMind

Amy Zhang

Amy Zhang

UT Austin

Fei Fang

Fei Fang

Carnegie Mellon University

Yuke Zhu

Yuke Zhu

UT Austin

Chi Jin

Chi Jin

Princeton University

Cite This Work

If you use this competition in your research, please cite our paper:

@inproceedings{karten2025pokeagent,
  title        = {The PokeAgent Challenge: Competitive and Long-Context Learning at Scale},
  author       = {Karten, Seth and Grigsby, Jake and Milani, Stephanie and Vodrahalli, Kiran
                  and Zhang, Amy and Fang, Fei and Zhu, Yuke and Jin, Chi},
  booktitle    = {NeurIPS Competition Track},
  year         = {2025},
  month        = apr,
}