bridgewalk

Visual reinforcement learning benchmark for controllability

MIT License

Downloads

21

Stars

6

Committers

View Code on GitHub View on X

Ecosystems: Python

BridgeWalk

BridgeWalk is a partially-observed reinforcement learning environment with dynamics of varying stochasticity. The player needs to walk along a bridge to reach a goal location. When the player walks off the bridge into the water, the current will move it randomly until it gets washed back on the shore. A good agent in this environment avoids this stochastic trap. The implementation of BridgeWalk is based on the Crafter environment.

Play Yourself

You can play the game yourself with an interactive window and keyboard input. The mapping from keys to actions, health level, and inventory state are printed to the terminal.

# Install with GUI
pip3 install 'bridgewalk[gui]'

# Start the game
bridgewalk

# Alternative way to start the game
python3 -m bridgewalk.run_gui

The following optional command line flags are available:

Flag	Default	Description
`--window <width> <height>`	800 800	Window size in pixels, used as width and height.
`--fps <integer>`	5	How many times to update the environment per second.
`--record <filename>.mp4`	None	Record a video of the trajectory.
`--view <width> <height>`	15 15	The layout size in cells; determines view distance.
`--length <integer>`	None	Time limit for the episode.
`--seed <integer>`	None	Determines world generation and creatures.

Training Agents

Installation: pip3 install -U bridgewalk

The environment follows the OpenAI Gym interface:

import bridgewalk

env = bridgewalk.Env(seed=0)
obs = env.reset()
assert obs.shape == (64, 64, 3)

done = False
while not done:
  action = env.action_space.sample()
  obs, reward, done, info = env.step(action)

Environment Details

Reward

A reward of +1 is given the first time in each episode when the agent reaches the island at the end of the bridge.

Termination

Episodes terminate after 250 steps.

Observation Space

Each observation is an RGB image that shows a local view of the world around the player.

Action Space

The action space is categorical. Each action is an integer index representing one of the possible actions:

Integer	Name	Description
0	`noop`	Do nothing.
1	`move_left`	Walk left.
2	`move_right`	Walk right.
3	`move_up`	Walk up.
4	`move_down`	Walk down.

Questions

Please open an issue on Github.

Package Rankings

Top 26.4% on Pypi.org

Badges

Extracted from project README

PyPI

Related Projects

RLBench

A large-scale benchmark and learning environment.

26 Sep 2019 1,128

gym-walk

Random walk OpenAI Gym environment.

diamond_env

Standardized Minecraft Diamond Environment for Reinforcement Learning

latplan

LatPlan : A domain-independent, image-based classical planner

gym-alttp-gridworld

A gym environment for Stuart Armstrong's model of a treacherous turn.

reinforcement-learning-an-introduction

Python Implementation of Reinforcement Learning: An Introduction

13 Sep 2016 13,499

bipedal-es

AI learning to walk in gym's BipedalWalker environment.

gigastep

crafter

Benchmarking the Spectrum of Agent Capabilities

10 Mar 2021 374

run-skeleton-run-in-3d

NeurIPS 2019: Learn to Move - Walk Around, 2nd place solution

embodied

Fast reinforcement learning research

RLexample

Some basic examples of playing with RL

09 Jan 2019 1,213

cowherd

Partially-observed visual reinforcement learning domain

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

07 Sep 2018 5,584

rlcard

Reinforcement Learning / AI Bots in Card (Poker) Games - Blackjack, Leduc, Texas, DouDizhu, Mahjo...

05 Sep 2019 2,811