Double-Duel-Deep-Q-Network in PyTorch

This model is developed as a solution to Project 1 of Udacity Deep Reinforcement Learning Nanodegree. Image from official repo

Installation

Install the package requirements for this repository

pip install -r requirements.txt

Banana Environment

The agent was developed specifically to solve a banana collection environment developed in Unity, which can be downloaded from the following locations. The objective in the banana environment is for an agent to navigate and collect yellow bananas (+1 reward) while avoiding blue bananas (-1 reward). Download your specific environment and unpack it into the ./env_unity/ folder in this repo:

Environment with discrete state space (37 dimensions):

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

Environment with pixel state space.

Linux: click here
Mac OSX: click here
Windows (32-bit): click here
Windows (64-bit): click here

In both versions of the environment, the agent has an action space with four discrete actions;

0: forward
1: backwards
2: left
3: right

The environment is considered solved when the agent collect an average score of 13 bananas over 100 consecutive episodes.

Repository Structure

libs/agents.py: A DQN agent, which by default is configured to be a double dueling DQN.
libs/models.py: PyTorch models used by the DQN agent
libs/memory.py: Prioritized experience replay, using sum-tree as defined in libs/sumtree.py
libs/monitor.py: Functionality for training/testing the agent and interacting with the environment
main.py: Main command-line interface for training & testing the agent

Training the Agent

For training the agent on the discrete state space, the model can be run by using one of the following (only tested on windows!):

python main.py --environment env_unity/DiscreteBanana/Banana.exe --model_name DQN
python main.py --environment env_unity/DiscreteBanana/Banana.exe --model_name DuelDQN
python main.py --environment env_unity/DiscreteBanana/Banana.exe --model_name DQN --double
python main.py --environment env_unity/DiscreteBanana/Banana.exe --model_name DuelDQN --double

For training agent on the pixel state space, the following can be used (only tested on windows!)

python main.py --environment env_unity/VisualBanana/Banana.exe --model_name DQN
python main.py --environment env_unity/VisualBanana/Banana.exe --model_name DuelDQN
python main.py --environment env_unity/VisualBanana/Banana.exe --model_name DQN --double
python main.py --environment env_unity/VisualBanana/Banana.exe --model_name DuelDQN --double

Testing the agent

Once the agent has been trained, it can be run as follows:

python main.py --environment env_unity/VisualBanana/Banana.exe --model_name DQN --test --checkpoint logs/weights_env_unity_VisualBanana_DQN_single.pth

Profiling

When trying to optimize training speed, I've used the following to profile the code:

python -m cProfile -o profile.txt -s tottime main.py --environment env_unity/VisualBanana/Banana.exe --model_name DQN --double

Related Projects

king-pong

Deep Reinforcement Learning Pong Agent, King Pong, he's the best

25 Jun 2016 23

robot-agent

Fine-tuned LLaMa2 13B model designed for ReAct-style and Tree-Of-Thoughts style prompting.

15 Jul 2023 17

BiLSTM-CTC

04 Aug 2022 0

CALM-Dialogue

Official code for the paper "Context-Aware Language Modeling for Goal-Oriented Dialogue Systems"

13 Apr 2022 34

ReinLife

Creating Artificial Life with Reinforcement Learning

11 Feb 2020 74

TennisAgent-PyTorch

Solution to Project 3 of Udacity Deep Reinforcement Learning Nanodegree

26 Mar 2019 0

snake-ai

An AI agent that beats the classic game "Snake".

25 Apr 2023 1,601

ReacherAgent-PyTorch

Solution to Project 2 of Udacity Deep Reinforcement Learning Nanodegree

14 Mar 2019 0

3D_Control_RL_Scenario_Benchmarks

A selection of 3D control scenarios created in a highly efficient simulator, benchmarked with the...

14 Feb 2019 0

DQN-tensorflow

Tensorflow implementation of Human-Level Control through Deep Reinforcement Learning

15 May 2016 2,475

msc-2018-final

07 Jun 2018 66

Mechopter

PyGame-based quadcopter simulator & Reinforcement Learning Project

22 Oct 2023 5

deep-q-atari

Keras and OpenAI Gym implementation of the Deep Q-learning algorithm to play Atari games.

11 Jun 2016 48

latplan

LatPlan : A domain-independent, image-based classical planner

07 Jul 2017 75

Deep-Reinforcement-Learning-Algorithms-with-PyTorch

PyTorch implementations of deep reinforcement learning algorithms and environments

07 Sep 2018 5,584

BananaAgent-PyTorch