• Stars
    star
    135
  • Rank 267,597 (Top 6 %)
  • Language Jinja
  • Created almost 4 years ago
  • Updated over 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

TLeague Project Page

This is the project page for the following technical reports:

Lei Han∗, Jiechao Xiong∗, Peng Sun∗, Xinghai Sun, Meng Fang, Qingwei Guo, Qiaobo Chen, Tengfei Shi, Hongsheng Yu, Zhengyou Zhang. TStarBot-X: An Open-Sourced and Comprehensive Study for Efficient League Training in StarCraft II Full Game. arXiv preprint arXiv:2011.13729, 2020. (* Equal contribution, correspondence to the first three authors)

Peng Sun∗, Jiechao Xiong∗, Lei Han∗, Xinghai Sun, Shuxing Li, Jiawei Xu, Meng Fang, Zhengyou Zhang. TLeague: A Framework for Competitive Self-Play based Distributed Multi-Agent Reinforcement Learning. arXiv preprint arXiv:2011.12895, 2020. (* Equal contribution, correspondence to the first three authors)

Impatient reader for the StarCraft II AI TStarBot-X could see the TStarBot-X project page here.

Quick Start

Usage

Python>=3.6 is required. We've tested Python 3.6.5.

Minimal Working Example

To use the TLeague framework and run a minimal training, one needs to install the following basic packages:

  • TLeague: the main logic of Competitive SelfPlay MultiAgent Reinforcement Learning.
  • TPolicies: a lib for building Neural Net used in RL and IL.
  • Arena: a lib of environments and env-agent interfaces.

See the docs therein for how to install TLeague, TPolicies, Arena, respectively. Briefly, it amounts to git-cloning/downloading the repos and do the in-place pip installation. For examples,

git clone https://github.com/tencent-ailab/TLeague.git ~/TLeague
git clone https://github.com/tencent-ailab/TPolicies.git ~/TPolicies
git clone https://github.com/tencent-ailab/Arena.git ~/Arena
cd ~/TLeague && pip install -e . && cd ~
cd ~/TPolicies && pip install -e . && cd ~
cd ~/Arena && pip install -e . && cd ~
# manually install tensorflow 1.15.0 as required by TPolicies
pip install tensorflow==1.15.0

Then, try the example of training with the simple game pong-2p (an environment contained in Arena) as a sanity check. See the link here.

To run training for other environments, extra binaries and/or packages must be installed, as explained in the following.

StarCraft II Training

When installing the Arena package, one needs additionally install TImitate, which is a lib for SC2 observation and action, zstat extraction, replay parsing, etc. See also the link here.

Here are examples for both Reinforcement Learning (CSP-MARL) and Imitation Learning in a single machine.

TODO: pointer to the Docker Auto Build repo and say it's yet-another guide to installation from scratch.

TODO: texts for how to train with k8s

ViZDoom Training

When installing the Arena package, one needs additionally install ViZDoom (>=1.1.8), see the link here.

Here are examples of how to train ViZDoom in a single machine.

Refer also to the link here for how to (auto-)build the docker image, which is yet-another guide to installation from scratch.

For running training over a k8s cluster, see the link here.

Pommerman Training

When installing the Arena package, one needs additionally install Pommerman, see the link here.

Here are examples for how to train Pommerman in a single machine.

Refer also to the link here for how to (auto-)build the docker image, which is yet-another guide to installation from scratch.

For running training over a k8s cluster, see the link here.

Single Agent RL

TLeague also works for pure RL, which can be viewed as a special case of MARL where the number of agents equals to one. Here are examples for how to train gym Atari in a single machine.

Ensure the correct dependencies are installed:

pip install gym[atari]==0.12.1

Disclaimer

This is not an officially supported Tencent product. The code and data in this repository are for research purpose only. No representation or warranty whatsoever, expressed or implied, is made as to its accuracy, reliability or completeness. We assume no liability and are not responsible for any misuse or damage caused by the code and data. Your use of the code and data are subject to applicable laws and your use of them is at your own risk.

More Repositories

1

IP-Adapter

The image prompt adapter is designed to enable a pretrained text-to-image diffusion model to generate images with image prompt.
Jupyter Notebook
4,945
star
2

V-Express

V-Express aims to generate a talking head video under the control of a reference image, an audio, and a sequence of V-Kps images.
Python
2,170
star
3

persona-hub

Official repo for the paper "Scaling Synthetic Data Creation with 1,000,000,000 Personas"
Python
768
star
4

hifi3dface

Code and data for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".
Python
758
star
5

hok_env

Honor of Kings AI Open Environment of Tencent
Python
609
star
6

pika

a lightweight speech processing toolkit based on Pytorch and (Py)Kaldi
Python
338
star
7

grover

This is a Pytorch implementation of the paper: Self-Supervised Graph Transformer on Large-Scale Molecular Data
Python
322
star
8

bddm

BDDM: Bilateral Denoising Diffusion Models for Fast and High-Quality Speech Synthesis
Python
216
star
9

FRA-RIR

Python
167
star
10

DrugOOD

OOD Dataset Curator and Benchmark for AI-aided Drug Discovery
Python
149
star
11

PCDMs

Implementation code:Advancing Pose-Guided Image Synthesis with Progressive Conditional Diffusion Models
Jupyter Notebook
145
star
12

Frequency_Aug_VAE_MoESR

Latent-based SR using MoE and frequency augmented VAE decoder
Python
145
star
13

3m-asr

3M: Multi-loss, Multi-path and Multi-level Neural Networks for speech recognition
Python
117
star
14

TLeague

Python
79
star
15

RLogist

RLogist = RL (reinforcement learning) + Pathologist
Python
65
star
16

MDM

MDM
Python
43
star
17

Lodoss

Python
34
star
18

UltraDualPathCompression

A Pytorch-based implementation of the compression and decompression module in "Ultra Dual-Path Compression For Joint Echo Cancellation And Noise Suppression".
Jupyter Notebook
29
star
19

mini-hok

Mini HoK: a novel MARL benchmark based on the popular mobile game, Honor of Kings, to address limitations in existing environments such as complexity and accessibility.
Python
28
star
20

TriNet

TriNet: stabilizing self-supervised learning from complete or slow collapse on ASR.
Python
26
star
21

ICML21_OAXE

Python
23
star
22

season

[EMNLP 2022] Salience Allocation as Guidance for Abstractive Summarization
Python
22
star
23

hokoff

Python
20
star
24

hifi3dface_projpage

Project page for our paper "High-Fidelity 3D Digital Human Creation from RGB-D Selfies".
HTML
15
star
25

GrndPodcastSum

(ACL 2022) The source code for the paper "Towards Abstractive Grounded Summarization of Podcast Transcripts"
Python
13
star
26

OASum

13
star
27

EMNLP21_SemEq

This repo is the code release of EMNLP 2021 conference paper "Connect-the-Dots: Bridging Semantics between Words and Definitions via Aligning Word Sense Inventories".
Python
12
star
28

learning_singing_from_speech

Project page for our paper "DurIAN : DurIAN-SC: Duration Informed Attention Network based Singing Voice Conversion System".
10
star
29

valuationgame

Jupyter Notebook
9
star
30

Arena

Python
8
star
31

MetaLogic

Python
8
star
32

ZED

This is the repository for EMNLP 2022 paper "Efficient Zero-shot Event Extraction with Context-Definition Alignment"
Python
8
star
33

machine-translation

Open source on machine translation
7
star
34

TPolicies

Python
6
star
35

zebra-inference

Python
5
star
36

Interformer

Jupyter Notebook
5
star
37

FOLNet

This repository includes the code for First-Order Logic Network (FOLNet).
Python
4
star
38

TLeagueAutoBuild

Python
4
star
39

TImitate

Python
2
star
40

siam

2
star