C++-based high-performance parallel environment execution engine (vectorized env) for general RL environments.
APACHE-2.0 License
Bot releases are hidden (Show)
Published by Trinkle23897 about 1 year ago
Published by Trinkle23897 over 1 year ago
[Disclaimer: generated by ChatGPT API]
🚨 Hey contributors! 🚨
It's a great day to celebrate our open-source contributions to sail-sg/envpool! Over the past 78 days, we have merged some notable changes we'd like to share. Here's a summary of those changes:
🔧 PR #237 [MiniGrid Empty Env] @wangsiping97:
Port MiniGrid EmptyEnv to envpool.
🔧 PR #238 [lazy import] @Trinkle23897:
import envpool
will no longer raise an error due to procgen qt5 dependency, making fresh devboxes more accessible.
🔧 PR #242 [chore: update dev-cn dockerfile] @Benjamin-eecs:
The fast installation of dependencies makes development faster and more efficient.
🔧 PR #243 [Use new gym API in examples] @leninilyich:
The environment used to return five values which could not be unpacked into "done". It has been fixed and is working as intended.
🐛 PR #245 [typo fix in CarpoletSwingup
] @vwxyzjn:
A small typo in the CarpoletSwingup environment has been corrected.
🔧 PR #248 [chore: add timeout flag for pip requirements installation] @Benjamin-eecs:
An extended timeout for pip requirements installation has been added. It is especially helpful for users in CN.
🚀 PR #249 [fix(hansbug): use treevalue 1.4.7] @HansBug:
There was a bug in PR #246, which has now been fixed by upgrading the version of treevalue to 1.4.7.
🚀 PR #247 [deps: replace treevalue
with optree
] @Benjamin-eecs @XuehaiPan:
We have replaced treevalue
with optree
to speed up and support custom node type registration. It has introduced a new feature and a non-breaking change that adds core functionality.
🔧 PR #252 [Fix publish ci packaging not found
issue and change minigrid deterministic test] @Trinkle23897:
This PR fixed the packaging issue and changed the minigrid deterministic test. It will lead to more stable performance.
🐛 PR #254 [fix old gym version return type error] @Trinkle23897:
This PR fixes a bug introduced by PR #247.
🐛 PR #255 [fix atari incorrect trunc flag when episodic_life=True] @Trinkle23897:
A bug which caused incorrect trunc flag in atari with episodic_life=True has been fixed.
Each of these PRs has made significant contributions to our codebase. We encourage you to take a look and check out our latest improvements with new and fixed features. As always, we appreciate your support and hope to continue our partnership with you for a long time to come.
Thanks to everybody who contributed!
Published by Trinkle23897 almost 2 years ago
channel_first
option in procgen env, change default observation shape from (64, 64, 3)
to (3, 64, 64)
(#232, @Trinkle23897)Published by Trinkle23897 almost 2 years ago
Add 42 procgen environments (#197, @YukunJ, @LeoGuo98, @Trinkle23897)
Procgen library change in https://github.com/Trinkle23897/procgen
Changes:
qtdeclarative5-dev
, rebuild docker imagebase_path
Published by Trinkle23897 almost 2 years ago
Finally, all environments in gym are in EnvPool! You can fully deprecate Box2D / box2d-py / mujoco_py pypi package.
Published by Trinkle23897 almost 2 years ago
Now we are truly faster than dm_control!
Published by Trinkle23897 about 2 years ago
Published by Trinkle23897 about 2 years ago
Published by Trinkle23897 about 2 years ago
Published by Trinkle23897 over 2 years ago
jax.jit
on EnvPool (#149, @mavenlin)Published by Trinkle23897 over 2 years ago
Note: Compared with dm_control, EnvPool can have at about 2x free speedup with only single environment setting (#139, @Trinkle23897). For the next release, we are going to use mujoco source code (#141, @Trinkle23897) to make everything faster!
Published by Trinkle23897 over 2 years ago
Note: we are still in the progress of adding more environment from dm_control suite and box2d.
Published by Trinkle23897 over 2 years ago
This release is for a stable version for benchmarking. We will update the benchmark result for both Atari (Pong-v5) and Mujoco (Ant-v3) soon. We observe Ant-v3 can achieve 2M+ FPS with a 192-core machine.
gym_reset_return_info
option for returning (obs, info)
in gym.Env.reset
(#97)Published by Trinkle23897 over 2 years ago
ENVPOOL_TEST
(#93, #94)terminate_when_unhealthy
and exclude_current_positions_from_observation
for most of mujoco envs (#93)info["qpos0"]
and info["qvel0"]
in mujoco env when generating wheel (#93)Published by Trinkle23897 over 2 years ago
Published by Trinkle23897 over 2 years ago
We have successfully integrated Ant-v4 environment based on the newest deepmind/mujoco package! (#74)
More environments are coming soon.
Other enhancement: support element-wise bound, fix classic_control action space (#67)
Published by Trinkle23897 over 2 years ago
Published by Trinkle23897 almost 3 years ago
info["reward"]
(#50)Published by Trinkle23897 almost 3 years ago
Add toy_text (#41, #42, #43, #44, #45, #46)