There are no reviews yet. Be the first to send feedback to the community and the maintainers!
experiment-impact-tracker
gym-extensions
This repo is intended as an extension for OpenAI Gym for auxiliary tasks (multitask learning, transfer learning, inverse reinforcement learning, etc.)DeepReinforcementLearningThatMatters
Accompanying code for "Deep Reinforcement Learning that Matters"DialogDatasets
A repository linking to publicly available dialog datasets. Feel free to send pull requests.MotionDetection
A project on motion detection in a noisy environment (shaky or moving camera), through background subtraction with single Gaussian models.OptionGAN
Code accompanying the OptionGAN paper.echo
Android Mesh Networking Chat with WiFI-DirectRLSSContinuousControlTutorial
Tutorial on continuous control at Reinforcement Learning Summer School 2017.ReproducibilityInContinuousPolicyGradientMethods
These are experiments for examining reproducibility in Policy Gradient RL algorithms in Continuous domains. Mainly using the Rllab implementation.EthicsInDialogue
MultiStepBootstrappingInRL
Here, we compare Q(\sigma) learning presented by Sutton and Barto in [1] to Tree-Backup, n-step Expected Sarsa, and n-step Sarsa.SocraticSwarm
A simulator and algorithms using deccentralized receding horizon control for coordinating autonomous UAV systems in completing a search task.SelfDestructingModels
SarsaVsExpectedSarsa
An a bias-variance tradeoff of Sarsa vs. Expected Sarsa with experiments.BayesianPolicyGradients
CMACvTileCode
ValuePolicyIterationVariations
Experiments testing variants of Value and Policy iterations.ExperimentsInIRL
TemporalYolo
Experiments on temporal YOLOWhatShouldICite
This is an informal record of original citations that I'm aware of for key terms in scientific literature. It started because I didn't know what's the original work to cite for eligibility traces and it seems important to do proper credit assignment.orion-pytorch-ppo-acktr-a2c
An adapted version of the ikostrikov RL algorithm implementation for use with the Orรญon hyperparameter optimization framework.DeepMultiObjectTracking
ClimateChangeFromMachineLearningResearch
drqawrapper
AdversarialGain
echo-laptop
This is the laptop client to to connect to echo nodesLLM-Tuning-Safety.github.io
TARProtocols
Dataset of Discovery Validation ProtocolsNeurIPS
A mirror for some of the NeurIPS website content with a new acronym.Option-Critic-Turing-Machines
A development toybox and pitch for integrating the option-critic architecture with neural turing machines.RL-Energy-Leaderboard
AquaBoxDataset
A dataset for bounding box prediction in underwater environments of the Aqua-family of hexapod robots.Vulnerabilities-In-Discovery-Tech-Experiment-1
NLPAssignment1
Code for Comp599 Assignment 1 (TAC document classification using simple algos and uni/bigram models)TemporalDeepQLearning
Experiments in temporal deep Q learningLove Open Source and this site? Check out how you can help us