Discover @Breakend Open Source projects

Peter Henderson (@Breakend)

Breakend

Stars
1,111
Global Rank 27,649 (Top 1.0 %)
Followers 206
Following 1
Registered over 12 years ago
Most used languages

Python
61.8 %

Jupyter Notebook
11.8 %

HTML
8.8 %

Java
5.9 %

C#
2.9 %

CSS
2.9 %

C++
2.9 %

OpenEdge ABL 2.9 %

experiment-impact-tracker

gym-extensions

This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement learning, etc.)

DeepReinforcementLearningThatMatters

Accompanying code for "Deep Reinforcement Learning that Matters"

PileOfLaw

A dataset for pretraining language models targeted for legal tasks.

Jupyter Notebook

DialogDatasets

A repository linking to publicly available dialog datasets. Feel free to send pull requests.

MotionDetection

A project on motion detection in a noisy environment (shaky or moving camera), through background subtraction with single Gaussian models.

OptionGAN

Code accompanying the OptionGAN paper.

echo

Android Mesh Networking Chat with WiFI-Direct

RLSSContinuousControlTutorial

Tutorial on continuous control at Reinforcement Learning Summer School 2017.

ReproducibilityInContinuousPolicyGradientMethods

These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implementation.

EthicsInDialogue

MultiStepBootstrappingInRL

Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.

SocraticSwarm

A simulator and algorithms using deccentralized receding horizon control for coordinating autonomous UAV systems in completing a search task.

SelfDestructingModels

SarsaVsExpectedSarsa

An a bias-variance tradeoff of Sarsa vs. Expected Sarsa with experiments.

Jupyter Notebook

BayesianPolicyGradients

CMACvTileCode

ValuePolicyIterationVariations

Experiments testing variants of Value and Policy iterations.

Jupyter Notebook

ExperimentsInIRL

TemporalYolo

Experiments on temporal YOLO

WhatShouldICite

This is an informal record of original citations that I'm aware of for key terms in scientific literature. It started because I didn't know what's the original work to cite for eligibility traces and it seems important to do proper credit assignment.

orion-pytorch-ppo-acktr-a2c

An adapted version of the ikostrikov RL algorithm implementation for use with the Oríon hyperparameter optimization framework.

DeepMultiObjectTracking

ClimateChangeFromMachineLearningResearch

drqawrapper

AdversarialGain

echo-laptop

This is the laptop client to to connect to echo nodes

LLM-Tuning-Safety.github.io

TARProtocols

Dataset of Discovery Validation Protocols

NeurIPS

A mirror for some of the NeurIPS website content with a new acronym.

Option-Critic-Turing-Machines

A development toybox and pitch for integrating the option-critic architecture with neural turing machines.

Jupyter Notebook

RL-Energy-Leaderboard

AquaBoxDataset

A dataset for bounding box prediction in underwater environments of the Aqua-family of hexapod robots.

Vulnerabilities-In-Discovery-Tech-Experiment-1

NLPAssignment1

Code for Comp599 Assignment 1 (TAC document classification using simple algos and uni/bigram models)

TemporalDeepQLearning

Experiments in temporal deep Q learning