• Stars
    star
    167
  • Rank 226,635 (Top 5 %)
  • Language
    Python
  • Created 10 months ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

PokéLLMon

This is the implementation for the paper "PokéLLMon: A Human-Parity Agent for Pokémon battles with Large Language Models."

PokemonBattle

The documentation will be fully updated within 3 days.

Requirements:

python >= 3.8
openai >= 1.7.2

Configuration

Configuring a Local Pokémon Showdown Server (Battle Engine)

  1. Install Node.js v10+.
  2. Clone the Pokémon Showdown repository and set it up:
git clone https://github.com/smogon/pokemon-showdown.git
cd pokemon-showdown
npm install
cp config/config-example.js config/config.js
node pokemon-showdown start --no-security
Enter "http://localhost:8000/" in your browsers.

Configuring OpenAI API

Get GPT-4 API from https://platform.openai.com/account/api-keys

Configuring Players

Register in your account on https://play.pokemonshowdown.com/ and get your password.

from poke_env import AccountConfiguration
# No authentication required for the local server
my_account_config = AccountConfiguration("your_username", "your_password")
player = Player(account_configuration=my_account_config)

# Authentication required for the official showdown server
from poke_env import AccountConfiguration, ShowdownServerConfiguration
my_account_config = AccountConfiguration("your_username", "your_password")
player = Player(account_configuration=my_account_config, server_configuration=ShowdownServerConfiguration)

Let's Play Pokémon Battles!!

Battle Against the Heuristic Bot Locally

python vs_bot.py # fill in your username and password for PokeLLMon

Battle Against a Human Player Locally

First, log into your other account manually on the local server, choosing "[Gen 8] Random Battle".

python vs_local_player.py # fill in your username and password for PokeLLMon

Battle Against Ladder Players on Pokémon Showdown

Open and log in: https://play.pokemonshowdown.com/

python vs_ladder_player.py # fill in your username and password for PokeLLMon, no need to set up local server.

Acknowledgement

The environment is implemented based on Poke Env.

More Repositories

1

awesome-LLM-game-agent-papers

A Survey on Large Language Model-Based Game Agents
202
star
2

DataPoisoning_FL

Code for Data Poisoning Attacks Against Federated Learning Systems
Python
158
star
3

TOG

Real-time object detection is one of the key applications of deep neural networks (DNNs) for real-world mission-critical systems. While DNN-powered object detection systems celebrate many life-enriching opportunities, they also open doors for misuse and abuse. This project presents a suite of adversarial objectness gradient attacks, coined as TOG, which can cause the state-of-the-art deep object detection networks to suffer from untargeted random attacks or even targeted attacks with three types of specificity: (1) object-vanishing, (2) object-fabrication, and (3) object-mislabeling. Apart from tailoring an adversarial perturbation for each input image, we further demonstrate TOG as a universal attack, which trains a single adversarial perturbation that can be generalized to effectively craft an unseen input with a negligible attack time cost. Also, we apply TOG as an adversarial patch attack, a form of physical attacks, showing its ability to optimize a visually confined patch filled with malicious patterns, deceiving well-trained object detectors to misbehave purposefully.
Jupyter Notebook
121
star
4

BERT4ETH

BERT4ETH: A Pre-trained Transformer for Ethereum Fraud Detection (WWW23)
Python
96
star
5

GPTLens

Large Language Model-Powered Smart Contract Vulnerability Detection: New Perspectives (TPS23)
Solidity
70
star
6

EllipticPlusPlus

Elliptic++ Dataset: A Graph Network of Bitcoin Blockchain Transactions and Wallet Addresses
Jupyter Notebook
64
star
7

CPL_attack

Jupyter Notebook
34
star
8

awesome_LLM-harmful-fine-tuning-papers

A survey on harmful fine-tuning issue for large language model
30
star
9

scale-fl

Code for ScaleFL
Python
28
star
10

AdaTrace

Utility-aware synthesis of differentially private and attack-resilient location traces
Java
21
star
11

LRBench

A learning rate recommending and benchmarking tool.
Python
19
star
12

CLDP

Secure and utility-aware data collection with condensed local differential privacy
16
star
13

FastSwap

Dynamic and Transparent Memory Sharing for Accelerating Big Data Analytics Workloads in Virtualized Cloud
C
16
star
14

Vaccine

This is the official code for the paper "Vaccine: Perturbation-aware Alignment for Large Language Models" (NeurIPS2024)
Shell
15
star
15

EnsembleBench

A holistic framework for promoting high diversity ensemble learning.
Python
13
star
16

XMemPod

A disaggregated memory orchestration system that virtualizes cluster wide memory to scale data intensive, large memory workloads in virtualized clouds
Batchfile
13
star
17

Fed-CDP

Gradient-Leakage Resilient Federated Learning
Python
13
star
18

DLEdge

This project accelerates deep learning models on the edge with the support of running ensemble learning for performance improvement. The current version accelerates YOLO object detectors with Intel Neural Compute Stick 2 using parallel programming.
Python
13
star
19

Lockdown

A backdoor defense for federated learning via isolated subspace training (NeurIPS2023)
Python
13
star
20

EENet

Code for Adaptive Deep Neural Network Inference Optimization with EENet
Python
11
star
21

GTDLBench

Benchmarking Deep Learning Frameworks
Python
10
star
22

EMO

Efficient Multi-Object Tracking for Edge devices
Python
10
star
23

DP-Ensemble

Diversity Optimized Ensemble
Python
8
star
24

STDLens

7
star
25

Lisa

This is the official code for the paper "Lazy Safety Alignment for Large Language Models against Harmful Fine-tuning" (NeurIPS2024)
Python
7
star
26

DP_modelpublishing

Python
7
star
27

GTAttackPod

Deep neural networks (DNNs) have demonstrated impressive performance on many challenging machine learning tasks. However, DNNs are vulnerable to adversarial inputs generated by adding maliciously crafted perturbations to the benign inputs. As a growing number of attacks have been reported to generate adversarial inputs of varying sophistication, the defense-attack arms race has been accelerated. This project collects state-of-the-art attack algorithms and organizes them with a easy-to-use interface.
Python
7
star
28

EVA

Fast Edge Video Analytics
C++
6
star
29

ZipZap

Python
6
star
30

Perception-Poisoning

Perception Poisoning Attacks in Federated Learning
Python
6
star
31

Booster

This is the official code for the paper "Booster: Tackling Harmful Fine-tuning for Large Language Models via Attenuating Harmful Perturbation".
Shell
5
star
32

DLForecast

Graph mining of the bitcoin transaction data for tasks such as transaction prediction.
Python
5
star
33

HQ-Ensemble

Hierarchical Ensemble Pruning
Python
4
star
34

Valet

Efficient Orchestration of Host and Remote Shared Memory for Memory Intensive Workloads
C
4
star
35

HeteRobust

Python
3
star
36

recap

Code for CVPR24 Paper - Resource-Efficient Transformer Pruning for Finetuning of Large Models
Python
3
star
37

MSJE

MSJE Pytorch implementation
Python
2
star
38

membership_vulnerability

Python
2
star
39

GTModelZoo

HTML
2
star
40

ModelCloak

Code for ICDM 2023 Model Cloaking against Gradient Leakage
Jupyter Notebook
2
star
41

MD-CNN

Network Code for Multi-Dimension Convolutional Neural Network for Bug Localization
Python
1
star
42

GRING-App-Face_Exp_recog

G-RING application : Face Expression Recognition Federated Learning
Go
1
star
43

FUSE

Robust Object Detection Fusion Against Deception
Python
1
star
44

GRING-App-Disease_Classification

G-RING application : Implementation of disease classification federated learning task on G-RING
Go
1
star
45

PFT

Python
1
star
46

XEnsemble-1.0

Code for the XEnsemble Robust Deep Learnning project
Python
1
star
47

gt-mobisim

Simulator for generating mobility traces and query traces for large numbers of mobile agents moving in a road network.
Java
1
star
48

LRBenchPlusPlus

Jupyter Notebook
1
star
49

GRING

Go
1
star
50

TripleBit

TripleBit, a fast and compact system for large scale RDF graph. This is a mirror of https://github.com/CGCL-codes/TripeBit.
C++
1
star
51

SEJE

SEJE Pytorch implementation
Python
1
star
52

llm-topla

Jupyter Notebook
1
star