moyix/fauxpilot

Stars
4,588
Rank 9,260 (Top 0.2 %)
Language
Python
Created over 2 years ago

moyix/fauxpilot

moyix

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

FauxPilot - an open-source GitHub Copilot server

FauxPilot

This is an attempt to build a locally hosted alternative to GitHub Copilot. It uses the SalesForce CodeGen models inside of NVIDIA's Triton Inference Server with the FasterTransformer backend.

Prerequisites

You'll need:

Docker
docker compose >= 1.28
An NVIDIA GPU with Compute Capability >= 6.0 and enough VRAM to run the model you want.
nvidia-docker
curl and zstd for downloading and unpacking the models.

Note that the VRAM requirements listed by setup.sh are total -- if you have multiple GPUs, you can split the model across them. So, if you have two NVIDIA RTX 3080 GPUs, you should be able to run the 6B model by putting half on each GPU.

Support and Warranty

lmao

Okay, fine, we now have some minimal information on the wiki and a discussion forum where you can ask questions. Still no formal support or warranty though!

Setup

This section describes how to install a Fauxpilot server and clients.

Setting up a FauxPilot Server

Run the setup script to choose a model to use. This will download the model from Huggingface/Moyix in GPT-J format and then convert it for use with FasterTransformer.

Please refer to How to set-up a FauxPilot server.

Client configuration for FauxPilot

We offer some ways to connect to FauxPilot Server. For example, you can create a client by how to open the Openai API, Copilot Plugin, REST API.

Please refer to How to set-up a client.

Terminology

API: Application Programming Interface
CC: Compute Capability
CUDA: Compute Unified Device Architecture
FT: Faster Transformer
JSON: JavaScript Object Notation
gRPC: Remote Procedure call by Google
GPT-J: A transformer model trained using Ben Wang's Mesh Transformer JAX
REST: REpresentational State Transfer

gpt-wpre

Whole-Program Reverse Engineering with GPT-3

pdbparse

Python code to parse Microsoft PDB files

creddump

Automatically exported from code.google.com/p/creddump

panda

Deprecated repo for PANDA 1.0 – see PANDA 2.0 repository

fpsmt_gpu

Solving floating point SMT constraints on a GPU

panda-malrec

A system to record malware using PANDA

wdepy

Decryption utility for PGP Whole Disk Encryption

elmfuzz

Evolving fuzzers with large language models

csaw23_nervcenter

Pwn+Crypto challenge for CSAW 2023 Finals

func_asm_pairgen

Horrifying scripts / infrastructure to extract info from a large amount of C/C++ code

2_ffast_2_furious

A more realistic demo of a buffer overflow cause by -ffast-math

irq_fuzzer

AsleepKeyboardDataset

mmgrep

Fast search for binary strings

hilbert_kcov

fbtools

Some python tools to hack on Fitbits

virtuoso

Automatically exported from code.google.com/p/virtuoso

polycoder_wrap

Wrapper to do text generation with VHellendoorn's PolyCoder model

codex_cli

Script to hook OpenAI's Codex up to a Linux VM and try to execute commands

panda_plugins_moyix

Repository for plugins that are useful to me but not generally applicable

pandalog_taint_parser

A fast, parallel parser for PANDA taint logs

ffdemo

Flush+Flush attack demo

vidcolortree

synthehol

A clone of Nick Fitzgerald's minisynth-rs

scripts

debbuild

Tools and scripts for rebuilding all of Debian with bear (I should have used rebuilderd :p)

AppSecAssignment1

ptrml

Using DNNs for dumb tasks: recognizing pointers

appsec_hw1

codeql_weird_minimal

Minimal example of weird CodeQL behavior

feckless-woof

bwlightning

cardinal

Cardinal Pill Testing on Linux

appsec_hw2

ipptests

Small tests to benchmark inter vs intra process communication.

heapmap