warp-drive | Cuda Ecosystem Directory

warp-drive - v2.7 Release Latest Release

Published by Emerald01 8 months ago

Support continuous actions
Add Pendulum environment that can run up to 100K concurrent replicates
Add DDPG algorithms for training continuous action policies

warp-drive - v2.6 Release

Published by Emerald01 11 months ago

Extend to easily support single agent framework. We start to add gym.classic_control as examples. Cartpole, Mountain Car and Acrobot have been included and they can run up to 100K concurrent replicates.

warp-drive - v2.5 Release

Published by Emerald01 about 1 year ago

Introduce the random reset from the pre-defined reset pool. Users are able to provide reset data pool for the corresponding data array(i.e., reset_target). During reset, the target data array will randomly pick up the reset data from the reset pool for each individual environment replica independently.

warp-drive - v2.4 Release

Published by Emerald01 over 1 year ago

Introduce new device context management and autoinit_pycuda
Therefore, torch (any version) will not conflict with PyCUDA in the GPU context

warp-drive - v2.3 Release

Published by Emerald01 over 1 year ago

Release 2.3 (2022-03-22)

Add ModelFactory class to manage custom models
Add Xavier initialization for the model
Improve trainer.fetch_episode_states() so it can fetch (s, a, r) and can replay with argmax.

Release 2.2 (2022-12-20)

Factorize the data loading for placeholders and batches (obs, actions and rewards) for the trainer.

Release 2.1 (2022-10-26)

v2 trainer integration with Pytorch Lightning

warp-drive - v2.0 release

Published by Emerald01 about 2 years ago

supports the dual backends of both CUDA C and the JIT compiled Numba.
supports end-to-end simulation and training on multi-GPUs with either CUDA C or Numba.
full backward compatibility with v1.0

warp-drive - v1.6 release

Published by Emerald01 about 2 years ago

Using the extreme parallelization capability of GPUs, WarpDrive enables orders-of-magnitude faster RL compared to CPU simulation + GPU model implementations.

It is extremely efficient as it avoids back-and-forth data copying between the CPU and the GPU.
runs simulations across multiple agents and multiple environment replicas in parallel.
provides the auto scaling tools to achieve the optimal throughput per device (version 1.3).
performs the distributed asynchronous training among multiple GPU devices (version 1.4).
combine multiple GPU blocks for one environment replica (version 1.6).