Max Woolf (@minimaxir)

Top repositories

1

big-list-of-naughty-strings

The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
Python
45,765
star
2

textgenrnn

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
Python
4,936
star
3

hacker-news-undocumented

Some of the hidden norms about Hacker News not otherwise covered in the Guidelines and the FAQ.
3,541
star
4

gpt-2-simple

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
Python
3,367
star
5

simpleaichat

Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
Python
3,329
star
6

facebook-page-post-scraper

Data scraper for Facebook Pages, and also code accompanying the blog post How to Scrape Data From Facebook Page Posts for Statistical Analysis
Python
2,100
star
7

person-blocker

Automatically "block" people in images (like Black Mirror) using a pretrained neural network.
Python
2,024
star
8

automl-gs

Provide an input CSV and a target field to predict, generate a model + code to run it.
Python
1,836
star
9

aitextgen

A robust Python tool for text-based AI training and generation using GPT-2.
Python
1,827
star
10

stylecloud

Python package + CLI to generate stylistic wordclouds, including gradients and icon shapes!
Python
817
star
11

gpt-3-experiments

Test prompts for OpenAI's GPT-3 API and the resulting AI-generated texts.
Python
710
star
12

video-to-gif-osx

A set of utilities that allow the user to easily convert video files to very-high-quality GIFs on OS X.
Shell
396
star
13

copy-syntax-highlight-osx

Copy Syntax Highlight for OS X is an OS X service which copies the selected text to the clipboard, with proper syntax highlighting for the given language.
380
star
14

gpt-2-cloud-run

Text-generation API via GPT-2 for Cloud Run
HTML
313
star
15

reactionrnn

Python module + R package to predict the reactions to a given text using a pretrained recurrent neural network.
Python
298
star
16

download-tweets-ai-text-gen

Python script to download public Tweets from a given Twitter account into a format suitable for AI text generation.
Python
219
star
17

tweet-generator

Train a neural network optimized for generating tweets based off of any number of Twitter users.
Python
218
star
18

char-embeddings

A repository containing 300D character embeddings derived from the GloVe 840B/300D dataset, and uses these embeddings to train a deep learning model to generate Magic: The Gathering cards using Keras
Python
214
star
19

magic-the-gifening

A Twitter bot which tweets Magic: the Gathering cards with appropriate GIFs superimposed onto them.
Python
214
star
20

system-dashboard

Minimalist Win/OSX/Linux System Dashboard using Flask and Freeboard
HTML
199
star
21

imgmaker

Create high-quality images programmatically with easily-hackable templates.
Python
168
star
22

ctrl-gce

Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.
Shell
154
star
23

ai-generated-pokemon-rudalle

Python script to preprocess images of all Pokémon to finetune ruDALL-E
Python
140
star
24

imgbeddings

Python package to generate image embeddings with CLIP without PyTorch/TensorFlow
Python
122
star
25

mtg-gpt-2-cloud-run

Code and UI for running a Magic card text generator API via GPT-2
HTML
119
star
26

get-all-hacker-news-submissions-comments

Simple Python scripts to download all Hacker News submissions and comments and store them in a PostgreSQL database.
Python
116
star
27

hacker-news-gpt-2

Dump of generated texts from GPT-2 trained on Hacker News titles
114
star
28

reddit-bigquery

Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily
R
110
star
29

facebook-ad-library-scraper

A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.
Python
108
star
30

optillusion-animation

Python code to submit rotated images to the Cloud Vision API + R code for visualizing it
Python
99
star
31

chatgpt_api_test

Demos utilizing the ChatGPT API
Jupyter Notebook
95
star
32

gpt-3-client

A client for OpenAI's GPT-3 API for ad hoc testing of prompt without using the web interface.
Python
90
star
33

stable-diffusion-negative-prompt

Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.
Jupyter Notebook
86
star
34

stylistic-word-clouds

Python scripts for creating stylistic word clouds
Python
85
star
35

gpt3-blog-title-optimizer

Python code for building a GPT-3 based technical blog post optimizer.
Jupyter Notebook
84
star
36

amazon-spark

R Code + R Notebook for analyzing millions of Amazon reviews using Apache Spark
HTML
83
star
37

twcloud

Python package + CLI to generate wordclouds of Twitter tweets.
Python
75
star
38

twitter-cloud-run

A (relatively) minimal configuration app to run Twitter bots on a schedule that can scale to unlimited bots.
Python
75
star
39

get-profile-data-of-repo-stargazers

This repository contains a script used to get the GitHub profile information of all the people who've Stared a given GitHub repository
Python
68
star
40

deep-learning-cpu-gpu-benchmark

Repository to benchmark the performance of Cloud CPUs vs. Cloud GPUs on TensorFlow and Google Compute Engine.
HTML
67
star
41

gpt-j-6b-experiments

Test prompts for GPT-J-6B and the resulting AI-generated texts
55
star
42

icon-image

Python script to quickly generate a Font Awesome icon imposed on a background for steering AI image generation.
Python
53
star
43

hacker-news-download-all-stories

Download *ALL* the submissions from Hacker News
Python
52
star
44

ml-data-generator

Python script to generate fake datasets optimized for testing machine learning/deep learning workflows
Python
51
star
45

clickbait-cluster

Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly
HTML
47
star
46

keras-cntk-docker

Docker container for keras + cntk intended for nvidia-docker
Python
42
star
47

foursquare-venue-scraper

A Foursquare data scraper that gathers all venues within a specified geographic area.
Python
39
star
48

interactive-facebook-reactions

Jupyter notebook + Code for processing Facebook Reactions data and making Interactive Charts
HTML
38
star
49

youtube-video-scraper

Tools for scraping YouTube video metadata (mostly for training AI on video titles)
Python
35
star
50

nyc-taxi-notebook

R Code + Jupyter notebook for analyzing and visualizing NYC Taxi data
R
31
star
51

sdxl-experiments

Jupyter Notebooks for experimenting with Stable Diffusion XL 1.0
Jupyter Notebook
30
star
52

yelp-review-analysis

Repository containing script on how I processed and charted Yelp data.
R
29
star
53

subreddit-generator

Train a neural network optimized for generating Reddit subreddit posts
Python
28
star
54

predict-reddit-submission-success

Repository w/ Jupyter + R Notebooks for creating a model to predict the success of Reddit submissions with Keras.
HTML
28
star
55

tritonize

Convert images to a styled, minimal representation, quickly with NumPy
Python
28
star
56

langchain-problems

Demos of some issues with LangChain.
Jupyter Notebook
27
star
57

keras-cntk-benchmark

Code for Benchmarking CNTK performance on Keras vs. TensorFlow
Python
26
star
58

autotweet-from-googlesheet

A minimal proof-of-concept Python script to tweet human-curated Tweets on a schedule.
Python
26
star
59

frames-to-gif-osx

An application that allows the user to easily convert frames to very-high-quality GIFs on OS X.
26
star
60

minimaxir.github.io

Blog Posts and Theme for https://minimaxir.com
HTML
25
star
61

legaladvice-gpt2

Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles
24
star
62

ggplot-tutorial

Repository for ggplot2 tutorial
R
23
star
63

sf-arrests-when-where

R Code + Jupyter notebook for replicating analysis of when and where arrests in San Francisco occur.
R
21
star
64

pokemon-3d

Code + Visualizations processing and visualizing Pokémon data in 3D
HTML
20
star
65

reddit-gpt-2-cloud-run

Reddit title generator API based on GPT-2
HTML
20
star
66

facebook-keyword-regression-analysis

Regression Analysis for Facebook keywords.
R
20
star
67

chatgpt-structured-data

Demos of ChatGPT's function calling/structured data support.
Jupyter Notebook
20
star
68

stylecloud-examples

Examples of stylistic word clouds generated via the stylecloud Python package
Python
19
star
69

stack-overflow-survey

Code + Visualizations for processing 2016 Stack Overflow Survey Data
Jupyter Notebook
19
star
70

get-bars-from-foursquare

A quick pair of Python scripts to retrieve all bars within a given area, then retrieve metadata and process it.
Python
19
star
71

subreddit-related

Code and visualizations for related/similar subreddits
Jupyter Notebook
18
star
72

get-heart-rate-csv

A small Python script to get the heart rate data generated from an Apple Watch in a CSV form
Python
18
star
73

ai-generated-magic-cards

Tools for encoding Magic: The Gathering cards into a form suitable for AI text generation
Python
17
star
74

tensorflow-multiprocess-ray

Proof of concept on how to use TensorFlow for prediction tasks in a multiprocess setting.
Python
17
star
75

pokemon-ai

A text-generating AI to generate Pokémon names.
Python
17
star
76

reddit-comment-length

R code needed to reproduce Relationship between Reddit Comment Score and Comment Length for 1.66 Billion Comments visualization
R
17
star
77

mtg-card-creator-api

Code for running a Magic card image generator API
Python
16
star
78

reddit-graph

Jupyter notebook + Code for reproducing Reddit Subreddit graphs
Jupyter Notebook
16
star
79

ncaa-basketball

R Code + R Notebook on how to process and visualize NCAA basketball data.
R
16
star
80

automl-gs-examples

Examples + Visualizations of datasets modeled using automl-gs
Python
15
star
81

chatgpt-tips-analysis

Jupyter Notebooks for testing the impact of tip incentives for ChatGPT
Jupyter Notebook
15
star
82

sfba-compensation

Jupyter notebook + Code for scraping AngelList data and making an interactive chart of SFBA salaries/equity
HTML
14
star
83

hacker-news-comment-analysis

Code used for analysis of Hacker News comments.
R
13
star
84

char-tsne-visualization

Visualizations of character embeddings from derived character vectors.
HTML
13
star
85

resetera-gpt-2

Scraper of ResetEra threads and posts to get them into a format suitable for feeding them into GPT-2.
Python
13
star
86

get-data-from-photos-from-instagram-tags

Processes data from images which are tagged with the specified Instagram tag.
Python
12
star
87

imdb-data-analysis

R Code + R Notebook on how to process and visualize the official IMDb datasets.
12
star
88

hn-heatmaps

Code and data necessary to reproduce heatmaps relating HN Submission time to submission score.
R
12
star
89

sf-crimes-covid

Spot checking impact of SF shelter-in-places on crime reporting.
12
star
90

gpt-2-fanfiction

Experiments with generating GPT-2 fanfiction on specified topics.
11
star
91

notebooks

This GitHub Repository stores my R Notebooks, allowing GitHub Pages to serve the R Notebooks on my website
HTML
11
star
92

imgur-decline

R Code + R Notebook for analyzing the decline of Imgur on Reddit.
HTML
11
star
93

all-marvel-comics-characters

Creates a .csv of all Marvel Comics Characters + Statistics via the Marvel API
Python
10
star
94

movie-gender

Data and code for analyzing Movie Lead Gender.
Jupyter Notebook
10
star
95

online-class-charts

Code needed to reproduce data analysis and charts for MIT/Harvard Online Course Data
R
9
star
96

ggplot2-web

R Code + R Notebook on how to make high quality data visualizations on the web with ggplot2.
HTML
9
star
97

reddit-subreddit-keywords

Code + Jupyter notebook for analyzing and visualizing means and medians of keywords in the top Reddit Subreddits.
R
8
star
98

reddit-mean-score

Quick data visualization for Reddit Mean Submission Score by Subreddit
8
star
99

sf-arrests-predict

R Code + R Notebook for predicting arrest types in San Francisco.
HTML
8
star
100

breach-network

R Code + R Notebook for creating an interactive graph network of Have I Been Pwned data using R and Plotly.
HTML
8
star