world-models-ppo

World Model implementation with PPO in PyTorch. This repository builds on world-models for the VAE and MDN-RNN implementations and firedup for the PPO optimization of the Controller network. Check the firedup setup file for requirements.

First save a number of the CarRacing-v0 Gym environment rollouts used for the train and test sets in the data_dir folder:

python env/carracing.py --data_dir './env/data' ---n_fold_train 20 ---n_fold_test 1

Then train the Variational Autoencoder (VAE) using the stored rollouts:

from vae.train import run
run(data_dir='./env/data', vae_dir='./vae/model', epochs=5)

Using the pretrained VAE, we train the Recurrent Mixture Density Network (MDN-RNN) model to predict the future latent state:

from mdnrnn.train import run
run(data_dir='./env/data', vae_dir='./vae/model', mdnrnn_dir='./mdnrnn/model', epochs=5)

We can finally train the Controller network which steers the car with PPO:

from rl.algos.ppo.ppo import run
run(exp_name='carracing_ppo', epochs=100)

Related Projects

SiamMask

[CVPR2019] Fast Online Object Tracking and Segmentation: A Unifying Approach

04 Mar 2019 3,464

pytorch-ner

Pipeline for training NER models using PyTorch.

20 Nov 2020 54

attention-is-all-you-need-pytorch

A PyTorch implementation of the Transformer model in "Attention is All You Need".

14 Jun 2017 8,777

PyTorch-ML

Implement DNN or ML models and advanced policies with PyTorch.(Include experiment)

27 Mar 2018 11

pvae

code for "Continuous Hierarchical Representations with Poincaré Variational Auto-Encoders".

23 Apr 2019 123

continuousprediction

Formulating Model-based RL Dynamics as a continuous rather then one step prediction

27 Sep 2019 34

PyTorch-RL

PyTorch implementation of Deep Reinforcement Learning: Policy Gradient methods (TRPO, PPO, A2C) a...

17 Oct 2017 1,092

Amazon-Forest-Computer-Vision

Amazon Forest Computer Vision: Satellite Image tagging code using PyTorch / Keras with lots of Py...

08 Sep 2017 366

pytorch-ssd

MobileNetV1, MobileNetV2, VGG based SSD/SSD-lite implementation in Pytorch 1.0 / Pytorch 0.4. Out...

18 May 2018 1,390

Deep-reinforcement-learning-with-pytorch

PyTorch implementation of DQN, AC, ACER, A2C, A3C, PG, DDPG, TRPO, PPO, SAC, TD3 and ....

09 Jun 2018 3,876

PPO-PyTorch

Minimal implementation of clipped objective Proximal Policy Optimization (PPO) in PyTorch

27 Sep 2018 1,653

net2net

Network-to-Network Translation with Conditional Invertible Neural Networks

21 Oct 2020 221

robotics-rl-srl

S-RL Toolbox: Reinforcement Learning (RL) and State Representation Learning (SRL) for Robotics

18 Jan 2018 607

ENAS-pytorch

PyTorch implementation of "Efficient Neural Architecture Search via Parameters Sharing"

15 Feb 2018 2,695

srl-zoo

State Representation Learning (SRL) zoo with PyTorch - Part of S-RL Toolbox

30 Oct 2017 162