unixpickle/anyrl-py

Stars
156
Rank 239,589 (Top 5 %)
Language
Python
Created about 7 years ago
Updated almost 2 years ago

unixpickle/anyrl-py

unixpickle

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

A reinforcement learning framework

anyrl-py

This is a Python remake (and makeover) of anyrl. It is a general-purpose library for Reinforcement Learning which aims to be as modular as possible.

Installation

You can install anyrl with pip:

pip install anyrl

APIs

There are several different sub-modules in anyrl:

models: abstractions and concrete implementations of RL models. This includes actor-critic RNNs, MLPs, CNNs, etc. Takes care of sequence padding, BPTT, etc.
envs: APIs for dealing with environments, including wrappers and asynchronous environments.
rollouts: APIs for gathering and manipulating batches of episodes or partial episodes. Many RL algorithms include a "gather trajectories" step, and this sub-module fulfills that role.
algos: well-known learning algorithms like policy gradients or PPO. Also includes mini-algorithms like Generalized Advantage Estimation.
spaces: tools for using action and observation spaces. Includes parameterized probability distributions for implementing stochastic policies.

Motivation

Currently, most RL code out there is very restricted and not properly decoupled. In contrast, anyrl aims to be extremely modular and flexible. The goal is to decouple agents, learning algorithms, trajectories, and things like GAE.

For example, anyrl decouples rollouts from the learning algorithm (when possible). This way, you can gather rollouts in several different ways and still feed the results into one learning algorithm. Further, and more obviously, you don't have to rewrite rollout code for every new RL algorithm you implement. However, algorithms like A3C and Evolution Strategies may have specific ways of performing rollouts that can't rely on the rollout API.

Use of TensorFlow

This project relies on TensorFlow for models and training algorithms. However, anyrl APIs are framework-agnostic when possible. For example, the rollout API can be used with any policy, whether it's a TensorFlow neural network or a native-Python decision forest.

Style

I use autopep8 and flake8. Here is the command you can use to run autopep8:

autopep8 --recursive --in-place --max-line-length 100 .

I recommend the following flag for flake8: --max-line-length=100

gobfuscate

Obfuscate Go binaries and packages

JamWiFi

A GUI, easy to use WiFi network jammer for Mac OS X

kahoot-hack

Reverse engineering kahoot.it

muniverse

µniverse: RL environments for HTML5 games

Giraffe

Encode animated GIF files on the iPhone

weakai

AI algorithms implemented in Go

obs-tower2

My solution to the Unity Obstacle Tower Challenge

model3d

Create & render beautiful 3D models

audioset

Fetch and use Google's AudioSet dataset

sk2torch

Convert scikit-learn models to PyTorch modules

num-analysis

Learning some Numerical Analysis

cbyge

Reverse engineering Cync (formerly "C by GE") WiFi devices

fbmsgr

Reverse engineering Facebook Messenger

ANImageBitmapRep

A set of classes for easily manipulating images with bitmap data or CoreGraphics

car-data

Scraping and predicting car info

vq-vae-2

A PyTorch implementation of the VQ-VAE-2 paper

Benchmarks

Some language performance comparisons.

SnapchatHax

Hacking away at Snapchat from iOS!

learn-nerf

Learning about Neural Radiance Fields

ImageReflection

A simple addition to UIImage allowing the reflection of images

cve-2018-4407

Crash macOS and iOS devices with one packet

vq-voice-swap

Voice swapping with VQ-VAE and diffusion models

GifPro

My new and improved Gif encoder for Mac

LibOrange

A simple AOL Instant Messenger implementation for Objective-C

vae-textures

Texture mapping with variational auto-encoders

vq-draw

A discrete sequential VAE

Jupyter Notebook

PathIntersection

A class that can be used to find line intersections of CGPaths

learn-quantum

Learning about quantum computing

anynet

Framework for artificial neural networks

MP4Audio

A partially broken Objective-C API for extracting audio from MP4 files and editing metadata.

ANColorPicker

A custom mac-like color well for iPhone

sgdstore

Augmented RNN memory via live SGD

Mac-Utils

A series of small applications to increase the Mac OS X experience

whichlang

Using ML to recognize programming languages

spherenet

Implementing Deep Hyperspherical Learning

cuda

Go bindings for CUDA, done right.

svm-playground

Play around with SVMs in the browser

hopfield

Hopfield networks in TensorFlow

char-rnn

Generate text with recurrent neural nets

ddim

Denoising Diffusion Implicit Models

Jupyter Notebook

demoverse

Record demonstrations for µniverse

alux

A lightweight C++ kernel designed to run a JavaScript or Dart VM

rwa

RWA recurrent neural networks

camera-hijack

A chrome extension to mess with the webcam

treeagent

Decision tree ensembles as RL policies

SoundArt

Draw sound waves and hear them, iOS only

learnos

Reminding myself everything I knew about OSDev (and more)

ANExpressionParser

Terrible, old, Objective-C expression parser.

ImageTransfer

Bluetooth image transferring app for the iPhone

SocketKit

A C socket wrapper (with SSL) written in Objective-C

ScreenPear

A remote displays application for OS X, still in the works.

heatgrid

Emulate heat conduction in a solid

uno-ai

AI for the game Uno

FreeRez

A GUI Mac OS X application for setting the native resolution on a Retina MBP

voronoi-interp

Create cool animations by gradually adding pixels to an interpolated image.

sentigraph

Graph sentiment throughout a piece of text

bezier-mnist

MNIST, but with Bezier curves instead of pixels

ANDownload

A small download manager with pause&resume support for iphone and mac

anyrl

[Deprecated] APIs for Reinforcement Learning

VideoExporter

A basic Objective-C wrapper for AV Foundation's AVAssetWriter

SpinWheel

A UIView that the user can spin with touch events

godsalg

Trying to find God's algorithm on a Rubik's cube

statushub

A simple log aggregation tool

cnn-toys

Playing around with CNNs

Wolfram-API

An Objective-C implementation of the Wolfram API 2.0

dist-sys

Teaching myself about distributed systems

essentials

Things I wish were Go built-ins

chatbot

Instant messaging with a neural network

neuralspell

Spell and pronounce words with a neural network

polish

Denoising networks for ray traced images

text2emoji

Neural network that produces emojis from text

ffmpego

A Go package for encoding and decoding video and audio files.

torch-bandpass

An implementation of the Prism layer (https://arxiv.org/abs/2011.04823)

Jupyter Notebook

packet-proxy

A proxy for reverse engineering a communication protocol

setres

A CLI for setting the resolution on Mac OS X on the retina MBPs

markovchain

Markov chains for text and anything else

mnistdemo

Test MNIST classifiers from your browser

cubezapp

An amazing cube timer

Expressions

An object-oriented mathematical expression parser for Objective-C

uber-ga

Implementation of Uber's genetic algorithm for RL

learning-tf

Learning TensorFlow

pca-compress

Compressing neural network initializations with PCA

tweetembed

Build word embeddings for Tweets

LassoCapture-old

Extended screenshot options for Mac OS X

SlideToUnlock

A slide-to-unlock interface for iOS

anarch

API for architecture-specific abstractions in OS kernels

tf-env

RL environments written in pure TensorFlow

agg

Command-line tool for numerical aggregates

payrange

Tracking laundry machines

wav

A WAV encoding/decoding library for Go

voronoi-glass

Create a cool glass-like pattern using Voronoi cells

ANHTML

A lightweight HTML parser for Objective-C (ARC only)

captcha-crack

Cracking a simple captcha system

ErrorScatter

A small prank application for Mac OS X

anyvec

Precision-agnostic vector abstractions

solid-trace

Visualize 3D solids implemented as JavaScript boolean functions

smallpng

Lossy compression for PNG files

speechrecog

Tools for speech recognition

wavenet

A convenient TensorFlow package for the WaveNet architecture

gospeech

An attempt at speech synthesis in Go