• Stars
    star
    337
  • Rank 124,560 (Top 3 %)
  • Language
    Python
  • License
    MIT License
  • Created almost 7 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Congratulation to DeepMind! This is a reengineering implementation (on behalf of many other git repo in /support/) of DeepMind's Oct19th publication: [Mastering the Game of Go without Human Knowledge]. The supervised learning approach is more practical for individuals. (This repository has single purpose of education only)

AlphaGOZero (python tensorflow implementation)

This is a trial implementation of DeepMind's Oct19th publication: Mastering the Game of Go without Human Knowledge.

DeepMind release AlphaZero Teaching Go. It's a lot of fun!


From Paper

Pure RL has outperformed supervised learning+RL agent

SL evaluation

Download trained model

  1. https://drive.google.com/drive/folders/1Xs8Ly3wjMmXjH2agrz25Zv2e5-yqQKaP?usp=sharing

  2. Place under ./savedmodels/large20/


Set up

Install requirement

python 3.6 tensorflow/tensorflow-gpu (version 1.4, version >= 1.5 can't load trained models)

pip install -r requirement.txt

Download Dataset (kgs 4dan)

Under repo's root dir

cd data/download
chmod +x download.sh
./download.sh

Preprocess Data

It is only an example, feel free to assign your local dataset directory

python preprocess.py preprocess ./data/SGFs/kgs-*

Train A Model

python main.py --mode=train

Play Against An A.I.

python main.py --mode=gtp —-gtp_poliy=greedypolicy --model_path='./savedmodels/your_model.ckpt'

Play in Sabaki

  1. In console:
which python

add result to the headline of main.py with #! prefix.

  1. Add the path of main.py to Sabaki's manage Engine with argument --mode=gtp

TODO:

  • AlphaGo Zero Architecture
  • Supervised Training
  • Self Play pipeline
  • Go Text Protocol
  • Sabaki Engine enabled
  • Tabula rasa (failed)
  • Distributed learning

Credit (orderless):

*Brain Lee *Ritchie Ng *Samuel Graván *森下 健 *yuanfengpang

More Repositories

1

C51-DDPG

This is a TensorFlow implementation of DeepMind's A Distributional Perspective on Reinforcement Learning.(C51-DDPG)
Python
10
star
2

CapsNet-Gravitational-Lensing

Estimating parameters of strong gravitational lenses with Capsule networks
Python
8
star
3

Feedback-Alignment

Feedback alignment is a backpropagation modification where the next layer weights become a fixed random matrix. [Lillicrap et al](https://www.nature.com/articles/ncomms13276) shows a FA is a regularizer where the next layer weights must learn to orient within 90 degree in order to perform effective training. The main obstacle of FA is to prove of general convergence under nonlinear dynamics.
Python
7
star
4

Matrix-CapsNet-EM-routing-tensorflow

This is a trial implementation of Hinton group's [MATRIX CAPSULES WITH EM ROUTING](https://openreview.net/pdf?id=HJWLfGWRb) in TensorFlow and Python programming language. (仅供交流学习使用)
Python
6
star
5

CapsNet-python-tensorflow

This is python TensorFlow implementation of [Dynamic Routing Between Capsules](https://arxiv.org/pdf/1710.09829.pdf) (仅供交流学习使用)
Python
5
star
6

Emotivoice_TTS

Python
4
star
7

Vulkan

My Vulkan Renderer
C++
3
star
8

Engine2021

Custom game engine made in 2021, this engine is inspired from what I learnt at DigiPen Institute of Technology
C++
3
star
9

AIResearchVault

Python
2
star
10

neural-combinatorial-optimization-rl-tensorflow

Python
2
star
11

Galaxy_Zoo_Capsule

Using capsule net to tell the existence of either spiral or ellpitical galaxy or both.
Jupyter Notebook
2
star
12

CS380_3D_Nav_UE4

C++
1
star
13

FPSCppTemplate-4.21

This project aims to create a KZ Jump like game in UE4! glhf!
C++
1
star
14

InterpreterProject

C++
1
star
15

Paper-I-read

This repo contains scientific paper I read as a reminder to myself. Hope this is helpful to you too.
1
star
16

HolodeckNavigationTask

C#
1
star
17

CA_MODs

C#
1
star
18

PyTorch-YOLOv3-Overlapping-Galaxy

Python
1
star