YuriyGuts/snake-ai-reinforcement

Stars
158
Rank 237,131 (Top 5 %)
Language
Python
License
MIT License
Created over 7 years ago
Updated almost 6 years ago

YuriyGuts/snake-ai-reinforcement

YuriyGuts

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

AI for Snake game trained from pixels using Deep Reinforcement Learning (DQN).

snake-ai-reinforcement

AI for Snake game trained from pixels using Deep Reinforcement Learning (DQN).

Contains the tools for training and observing the behavior of the agents, either in CLI or GUI mode.

Requirements

All modules require Python 3.6 or above. Note that support for Python 3.7 in TensorFlow is experimental at the time of writing, and requirements may need to be updated as new official versions get released.

Training on GPU is supported but disabled by default. If you have CUDA and would like to use a GPU, use the GPU version of TensorFlow by changing tensorflow to tensorflow-gpu in the requirements file.

To install all Python dependencies, run:

$ make deps

Pre-Trained Models

You can find a few pre-trained DQN agents on the Releases page. Pass the model file to the play.py front-end script (see play.py -h for help).

dqn-10x10-blank.model

An agent pre-trained on a blank 10x10 level (snakeai/levels/10x10-blank.json).
dqn-10x10-obstacles.model

An agent pre-trained on a 10x10 level with obstacles (snakeai/levels/10x10-obstacles.json).

Training a DQN Agent

To train an agent using the default configuration, run:

$ make train

The trained model will be checkpointed during the training and saved as dqn-final.model afterwards.

Run train.py with custom arguments to change the level or the duration of the training (see train.py -h for help).

Playback

The behavior of the agent can be tested either in batch CLI mode where the agent plays a set of episodes and outputs summary statistics, or in GUI mode where you can see each individual step and action.

To test the agent in batch CLI mode, run the following command and check the generated .csv file:

$ make play

To use the GUI mode, run:

$ make play-gui

To play on your own using the arrow keys (I know you want to), run:

$ make play-human

Running Unit Tests

$ make test

kaggle-quora-question-pairs

My solution to Kaggle Quora Question Pairs competition (Top 2%, Private LB log loss 0.13497).

Jupyter Notebook

midichlorian

A Visual Studio extension that allows you to write code and automate the IDE using MIDI musical instruments.

dechorder

Automatic chord recognition application powered by machine learning

syno-plex-update

Automatically check for Plex Media Server updates on Synology NAS and install them. Compatible with DSM 6 and DSM 7, including DSM 7.2.2+.

regex-builder

.NET library for human-readable declaration of regular expressions without having to remember the regex syntax. Looks similar to Expression Trees in .NET.

odsc-target-leakage-workshop

Workshop on Target Leakage in Machine Learning I taught at ODSC Europe 2018 (London) and ODSC East 2019, 2020 (Boston)

Jupyter Notebook

persistent-touch-id-sudo

Configures PAM on macOS via a Launch Daemon so that Touch ID for sudo is always available and persists across OS upgrades

unicode-virtual-keyboard

Windows utility that simplifies the input of Unicode characters by displaying a handy on-demand virtual keyboard with powerful character search functionality and global hotkey support.

thrones2vec

Using Word2Vec to explore semantic similarities between the entities of "A Song of Ice and Fire" ("Game of Thrones").

Jupyter Notebook

azure-cloud-ocr

A simple cloud OCR application that employs Windows Azure Web and Worker Roles, Blobs, Tables, Queues, and uses Google Tesseract for text recognition.

cartpole-q-learning

A cart pole balancing agent powered by Q-Learning.

lits-algorithms-course

Notes and handouts from the Algorithms course I taught at Lviv IT School.

pygoose

A Python package used as a utility tool belt for Kaggle competitions and other Data Science experiments.

dou-topic-modeling

Analyzing the topic structure of DOU.ua comments using Latent Dirichlet Allocation (LDA).

Jupyter Notebook

ansible-role-jupyter

An Ansible role to install and configure Jupyter for Python 3.

pythonscript-namebatch

Generates spells to summon Benedict Cumberbatch.

dunedynasty-macos

A fork of Dune Dynasty (http://dunedynasty.sourceforge.net/) that can be built and run on modern Macs, including Apple Silicon (M1)

datarobot-mlbench

Evaluation of the DataRobot platform on the mlbench benchmark [H. Zhang et al., 2017]

ucu-nlp-workshop

Supplementary resources for the NLP Summer Workshop I taught at UCU.

Jupyter Notebook

enex2csv

Convert Evernote ENEX files to CSV, optionally converting note content to Markdown

hammurabi

An online judge for algorithmic contests. Strict, but fair.

intel-8080-asm

A very simple Win32 assembler for Intel 8080 that produces COM binaries for CP/M. I built this during my 2nd university year as a replacement for the tool we had at our lab, which often failed to compile large programs and produced misleading error messages.

ucu-ai-checkers

Checkers game AI development tools for the CS301 AI class I teach at UCU.

winforms-auto-taborder-vsaddin

Visual Studio add-in that adds automatic TabOrder arrangement feature to Windows Forms designer

gdg-speech-classifier

A machine learning system that recognizes the word 'Google' in human speech (demo for my talk @ Lviv GDG meetup).

r-exercises

Programming exercises for R: http://www2.warwick.ac.uk/fac/sci/statistics/staff/academic-research/reed/rexercises.pdf

libcheckers

International checkers gameplay library for the CS301 AI course I teach at UCU.

ansible-role-anaconda

An Ansible role to install Anaconda on Linux, along with additional conda packages of your choice.

filesystem-monitor-service

Client-server application (WinForms client + NT Service + MS Access DB) for monitoring changes to a remote file system [university project, 2009]

streamlit-blackout-stats

Streamlit app for visualizing power outage statistics. Uses Google Sheets as the data source.