[SPIKE] NLE vs MiniHack benchmark lane for Timmy #35
Reference in New Issue
Block a user
Delete Branch "%!s()"
Deleting a branch is permanent. Although the deleted branch may continue to exist for a short time before it actually gets removed, it CANNOT be undone in most cases. Continue?
Source: follow-on from #33 after deciding to treat NLE/MiniHack as the benchmark lane rather than Timmy’s home.
Goal:
Run a sharper spike on NLE/MiniHack as the “already-solved” research environment for agent gameplay benchmarking.
What we know so far:
Why this matters:
Acceptance:
Decision lens:
Sharper spike result:
Maintained upstreams:
Current signals:
Important practical detail:
python -m nle.scripts.playpython -m minihack.scripts.play --env ...Interpretation:
Recommendation:
Why MiniHack first:
Suggested first benchmark classes:
Boundary clarification:
Uniwizard (#94) context: NLE/MiniHack benchmark lane — deprioritized. Evennia is the framework. If we want RL benchmarks later, file a new issue.