• Stars
    star
    8
  • Rank 2,099,232 (Top 42 %)
  • Language
    Jupyter Notebook
  • Created about 6 years ago
  • Updated about 6 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Implementation of first visit Monte Carlo for prediction and control

More Repositories

1

Landing-A-Rocket-With-Simple-Reinforcement-Learning

This repo gives an example of using a simple method of reinforcement learning to beat the Lunar Lander environment. The agent uses a combination of CEM and neural networks using the pytorch library.
Jupyter Notebook
18
star
2

TD3

Implementation of the TD3 algorithm written in Pytorch
Jupyter Notebook
10
star
3

core_rl

Repo of core reinforcement learning algorithms and explanations using pytorch lightning
Python
6
star
4

CNN-On-The-Cloud-

Code used to build an image classifier for the Fashion MNIST dataset. Built using the Keras library and trained on the FloydHub cloud platform
Jupyter Notebook
4
star
5

DDPG_Reacher

Experiment to implement the DDPG algorithm to train a mechanical arm to reach for a moving target inside the unity ML-Agents virtual environment
Jupyter Notebook
2
star
6

SAC

Pytorch implementation of the Soft Actor Critic Algorithm
Jupyter Notebook
2
star
7

Neural-Network-From-Scratch-Tumour-Diagnosis

This notebook goes through how to build a neural network using only numpy. The network classifies tumours, identifying if they are malignant or benign. This notebook uses the Breast Cancer Wisconsin dataset.
Jupyter Notebook
2
star
8

DQN_Tensorflow

A jupyter notebook implementing the DQN model in VizDoom
Jupyter Notebook
1
star
9

MountainCar_TD_Lambda

Solution for mountain car environment using TD Lambda eligibility trace and RBF cells
Python
1
star
10

awesome-prompt-engineering

repo containing useful prompt engineering templates that I use for coding, research and productivity
1
star
11

MADDPG

Final project for the Udacity RL nano degree implementing Multi Agent Deep Deterministic Policy Gradients
ASP
1
star