DanielSlater/AlphaToe

Stars
163
Rank 231,141 (Top 5 %)
Language
Python
License
MIT License
Created about 8 years ago
Updated about 7 years ago

DanielSlater/AlphaToe

DanielSlater

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Applying the deep learning techniques from Alpha Go to play tic-tac-toe

AlphaToe

Applying the deep learning techniques from Alpha Go to play tic-tac-toe

These are the code examples to with my talk, the slide for which are in AlphaToe.pdf

As well as the slides, the file script/policy_gradient.py is a good starting point for the project. All networks are built using TensorFlow.

SetUp

To get running start by creating a virtual env/conda env with tensorFlow installed. Current instructions for this are at: https://www.tensorflow.org/versions/r0.11/get_started/os_setup.html#anaconda-installation

I've also found this useful: https://anaconda.org/jjhelmus/tensorflow

Then run the file file policy_gradient.py

This has been tested with python 2.7 and 3.5

PyGamePlayer

Module to help with running learning agents against PyGame games

Net2Net

numpy implementation of net 2 net from the paper Net2Net: Accelerating Learning via Knowledge Transfer http://arxiv.org/abs/1511.05641

PyDataLondon2016

Collection of examples, links and slides for the tutorial "Building a Pong playing AI in just 1 hour(plus 4 days training...)" presented at PyDataLondon 2016

PythonDeepLearningSamples

Samples for the book Python Deep Learning

WikiDataDotNet

Dot Net API for getting data from WikiData

tensordynamic

Dynamically generated nueral nets with TensorFlow

CascadeCorrelation

Cascade correlation algo in python, still needs some work

PythonPSO

ModelLearning

Experiments around model based learning