Track 2 Logo
This track challenges agents to complete a full Pokémon role-playing game (Pokémon Emerald) as quickly and efficiently as possible, navigating a massive, partially observable world with hundreds of NPCs and thousands of possible actions.

Long-horizon planning, efficient exploration, and strategic resource management are critical to succeeding in this track. Agents must learn to balance immediate objectives with long-term strategic goals, making decisions that span thousands of timesteps while adapting to the unpredictable nature of RPG gameplay.

The speedrunning challenge pushes AI systems to their limits in sequential decision-making, requiring sophisticated planning algorithms and efficient resource management to achieve optimal completion times in complex, open-world environments.

⏰ Competition Ends In

-- Days
-- Hours
-- Minutes
-- Seconds

Track 2 Submissions end October 31st, 2025

Starter Kits

Our starter kit provides a real-time agent loop with modular components for perception (game frame recognition), planning & memory (long term vs. short term goals, knowledge storage), and control (gameboy emulator action execution).


What's Included

  • Agent Scaffolding: Modular framework for building Pokémon Emerald speedrunning agents
  • Pokémon Emerald Wrapper: Custom emulator API for real-time game interaction
  • Baseline Implementation: Reference agent with VLM setup and basic planning
  • Evaluation Tools: Automated testing and performance measurement

Compute Credits Available

Recommended for students: Apply to receive GCP credits for cloud compute and Gemini API access. Roughly $100+ per team (pending submissions).

Apply for Compute Credits

Applications will be reviewed and distributed on a rolling basis until funds are depleted.

Submission Guidelines

How to Submit for Track 2

Submissions for this track focus on achieving maximum game completion under time constraints. Your agent must interact exclusively through our custom Pokémon Emerald emulator API. Use any method, as long as the final action comes from a neural network.

Important: All submissions will undergo anti-cheat verification to ensure fair competition. This includes validation of agent behavior, action logs, and verification that submissions follow the competition rules.

Submission Requirements

  • Code Archive: Your agent implementation as a ZIP or TAR.GZ file including all dependencies and README
  • Action & State Logs: Detailed logs created by the starter kit repository during your agent's run
  • Video Evidence: YouTube link to a screen recording showing the complete speedrun

Evaluation Criteria

  • Milestone Completion: Percentage of game milestones accomplished
  • Completion Time: Time taken to complete achieved milestones
  • Reproducibility: Clear documentation and reproducible results

Timeline

June 11th, 2025

Competition Website Launch

Official competition website goes live with preliminary documentation.

June 25th, 2025

Formal Competition Announcement

Full rules and track timeline announced. Starter code with a baseline RPG agent (scaffolding and VLM setup) and emulator API available for beta testers.

July 7th, 2025

Competition Begins

Track 2 Competition Begins. Submit runs of your Pokémon Emerald agent to the leaderboard.

October 31st, 2025

Results Announcement

Final results announced, winners notified.

December 2025

NeurIPS 2025 Presentation

Winners present their solutions at NeurIPS 2025.