• Stars
    star
    1
  • Language
    Python
  • Created about 7 years ago
  • Updated about 7 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Solution for mountain car environment using TD Lambda eligibility trace and RBF cells

More Repositories

1

Landing-A-Rocket-With-Simple-Reinforcement-Learning

This repo gives an example of using a simple method of reinforcement learning to beat the Lunar Lander environment. The agent uses a combination of CEM and neural networks using the pytorch library.
Jupyter Notebook
18
star
2

TD3

Implementation of the TD3 algorithm written in Pytorch
Jupyter Notebook
10
star
3

MonteCarlo

Implementation of first visit Monte Carlo for prediction and control
Jupyter Notebook
8
star
4

core_rl

Repo of core reinforcement learning algorithms and explanations using pytorch lightning
Python
6
star
5

CNN-On-The-Cloud-

Code used to build an image classifier for the Fashion MNIST dataset. Built using the Keras library and trained on the FloydHub cloud platform
Jupyter Notebook
4
star
6

DDPG_Reacher

Experiment to implement the DDPG algorithm to train a mechanical arm to reach for a moving target inside the unity ML-Agents virtual environment
Jupyter Notebook
2
star
7

SAC

Pytorch implementation of the Soft Actor Critic Algorithm
Jupyter Notebook
2
star
8

Neural-Network-From-Scratch-Tumour-Diagnosis

This notebook goes through how to build a neural network using only numpy. The network classifies tumours, identifying if they are malignant or benign. This notebook uses the Breast Cancer Wisconsin dataset.
Jupyter Notebook
2
star
9

DQN_Tensorflow

A jupyter notebook implementing the DQN model in VizDoom
Jupyter Notebook
1
star
10

awesome-prompt-engineering

repo containing useful prompt engineering templates that I use for coding, research and productivity
1
star
11

MADDPG

Final project for the Udacity RL nano degree implementing Multi Agent Deep Deterministic Policy Gradients
ASP
1
star