Max Woolf (@minimaxir)

Top repositories

1

big-list-of-naughty-strings

The Big List of Naughty Strings is a list of strings which have a high probability of causing issues when used as user-input data.
Python
46,104
star
2

textgenrnn

Easily train your own text-generating neural network of any size and complexity on any text dataset with a few lines of code.
Python
4,941
star
3

hacker-news-undocumented

Some of the hidden norms about Hacker News not otherwise covered in the Guidelines and the FAQ.
3,616
star
4

simpleaichat

Python package for easily interfacing with chat apps, with robust features and minimal code complexity.
Python
3,463
star
5

gpt-2-simple

Python package to easily retrain OpenAI's GPT-2 text-generating model on new texts
Python
3,402
star
6

facebook-page-post-scraper

Data scraper for Facebook Pages, and also code accompanying the blog post How to Scrape Data From Facebook Page Posts for Statistical Analysis
Python
2,116
star
7

person-blocker

Automatically "block" people in images (like Black Mirror) using a pretrained neural network.
Python
2,022
star
8

automl-gs

Provide an input CSV and a target field to predict, generate a model + code to run it.
Python
1,845
star
9

aitextgen

A robust Python tool for text-based AI training and generation using GPT-2.
Python
1,831
star
10

stylecloud

Python package + CLI to generate stylistic wordclouds, including gradients and icon shapes!
Python
825
star
11

gpt-3-experiments

Test prompts for OpenAI's GPT-3 API and the resulting AI-generated texts.
Python
702
star
12

video-to-gif-osx

A set of utilities that allow the user to easily convert video files to very-high-quality GIFs on OS X.
Shell
395
star
13

copy-syntax-highlight-osx

Copy Syntax Highlight for OS X is an OS X service which copies the selected text to the clipboard, with proper syntax highlighting for the given language.
381
star
14

gpt-2-cloud-run

Text-generation API via GPT-2 for Cloud Run
HTML
313
star
15

reactionrnn

Python module + R package to predict the reactions to a given text using a pretrained recurrent neural network.
Python
299
star
16

gpt-2-keyword-generation

Method to encode text for GPT-2 to generate text based on provided keywords
Python
260
star
17

download-tweets-ai-text-gen

Python script to download public Tweets from a given Twitter account into a format suitable for AI text generation.
Python
220
star
18

tweet-generator

Train a neural network optimized for generating tweets based off of any number of Twitter users.
Python
218
star
19

char-embeddings

A repository containing 300D character embeddings derived from the GloVe 840B/300D dataset, and uses these embeddings to train a deep learning model to generate Magic: The Gathering cards using Keras
Python
214
star
20

magic-the-gifening

A Twitter bot which tweets Magic: the Gathering cards with appropriate GIFs superimposed onto them.
Python
212
star
21

system-dashboard

Minimalist Win/OSX/Linux System Dashboard using Flask and Freeboard
HTML
200
star
22

imgmaker

Create high-quality images programmatically with easily-hackable templates.
Python
175
star
23

ctrl-gce

Set up the CTRL text-generating model on Google Compute Engine with just a few console commands.
Shell
151
star
24

ai-generated-pokemon-rudalle

Python script to preprocess images of all Pokémon to finetune ruDALL-E
Python
138
star
25

imgbeddings

Python package to generate image embeddings with CLIP without PyTorch/TensorFlow
Python
134
star
26

mtg-gpt-2-cloud-run

Code and UI for running a Magic card text generator API via GPT-2
HTML
120
star
27

get-all-hacker-news-submissions-comments

Simple Python scripts to download all Hacker News submissions and comments and store them in a PostgreSQL database.
Python
119
star
28

hacker-news-gpt-2

Dump of generated texts from GPT-2 trained on Hacker News titles
117
star
29

facebook-ad-library-scraper

A Python scraper for the Facebook Ad Library, using the official Facebook Ad Library API.
Python
114
star
30

reddit-bigquery

Code + Jupyter notebook for analyzing and visualizing Reddit Data quickly and easily
R
112
star
31

optillusion-animation

Python code to submit rotated images to the Cloud Vision API + R code for visualizing it
Python
99
star
32

chatgpt_api_test

Demos utilizing the ChatGPT API
Jupyter Notebook
96
star
33

gpt-3-client

A client for OpenAI's GPT-3 API for ad hoc testing of prompt without using the web interface.
Python
90
star
34

stable-diffusion-negative-prompt

Jupyter Notebooks for experimenting with negative prompting with Stable Diffusion 2.0.
Jupyter Notebook
87
star
35

stylistic-word-clouds

Python scripts for creating stylistic word clouds
Python
85
star
36

gpt3-blog-title-optimizer

Python code for building a GPT-3 based technical blog post optimizer.
Jupyter Notebook
83
star
37

amazon-spark

R Code + R Notebook for analyzing millions of Amazon reviews using Apache Spark
HTML
83
star
38

twcloud

Python package + CLI to generate wordclouds of Twitter tweets.
Python
76
star
39

twitter-cloud-run

A (relatively) minimal configuration app to run Twitter bots on a schedule that can scale to unlimited bots.
Python
76
star
40

deep-learning-cpu-gpu-benchmark

Repository to benchmark the performance of Cloud CPUs vs. Cloud GPUs on TensorFlow and Google Compute Engine.
HTML
67
star
41

get-profile-data-of-repo-stargazers

This repository contains a script used to get the GitHub profile information of all the people who've Stared a given GitHub repository
Python
67
star
42

icon-image

Python script to quickly generate a Font Awesome icon imposed on a background for steering AI image generation.
Python
53
star
43

gpt-j-6b-experiments

Test prompts for GPT-J-6B and the resulting AI-generated texts
53
star
44

ml-data-generator

Python script to generate fake datasets optimized for testing machine learning/deep learning workflows
Python
51
star
45

hacker-news-download-all-stories

Download *ALL* the submissions from Hacker News
Python
51
star
46

clickbait-cluster

Code + Jupyter Notebooks for Visualizing Clusters of Clickbait Headlines Using Spark, Word2vec, and Plotly
HTML
47
star
47

keras-cntk-docker

Docker container for keras + cntk intended for nvidia-docker
Python
42
star
48

foursquare-venue-scraper

A Foursquare data scraper that gathers all venues within a specified geographic area.
Python
39
star
49

interactive-facebook-reactions

Jupyter notebook + Code for processing Facebook Reactions data and making Interactive Charts
HTML
38
star
50

youtube-video-scraper

Tools for scraping YouTube video metadata (mostly for training AI on video titles)
Python
38
star
51

nyc-taxi-notebook

R Code + Jupyter notebook for analyzing and visualizing NYC Taxi data
R
31
star
52

sdxl-experiments

Jupyter Notebooks for experimenting with Stable Diffusion XL 1.0
Jupyter Notebook
30
star
53

yelp-review-analysis

Repository containing script on how I processed and charted Yelp data.
R
29
star
54

langchain-problems

Demos of some issues with LangChain.
Jupyter Notebook
29
star
55

subreddit-generator

Train a neural network optimized for generating Reddit subreddit posts
Python
28
star
56

predict-reddit-submission-success

Repository w/ Jupyter + R Notebooks for creating a model to predict the success of Reddit submissions with Keras.
HTML
28
star
57

autotweet-from-googlesheet

A minimal proof-of-concept Python script to tweet human-curated Tweets on a schedule.
Python
27
star
58

tritonize

Convert images to a styled, minimal representation, quickly with NumPy
Python
27
star
59

keras-cntk-benchmark

Code for Benchmarking CNTK performance on Keras vs. TensorFlow
Python
26
star
60

frames-to-gif-osx

An application that allows the user to easily convert frames to very-high-quality GIFs on OS X.
26
star
61

minimaxir.github.io

Blog Posts and Theme for https://minimaxir.com
HTML
25
star
62

ggplot-tutorial

Repository for ggplot2 tutorial
R
24
star
63

legaladvice-gpt2

Dump of generated texts from GPT-2 trained on /r/legaladvice subreddit titles
23
star
64

chatgpt-structured-data

Demos of ChatGPT's function calling/structured data support.
Jupyter Notebook
22
star
65

sf-arrests-when-where

R Code + Jupyter notebook for replicating analysis of when and where arrests in San Francisco occur.
R
22
star
66

pokemon-3d

Code + Visualizations processing and visualizing Pokémon data in 3D
HTML
21
star
67

reddit-gpt-2-cloud-run

Reddit title generator API based on GPT-2
HTML
20
star
68

facebook-keyword-regression-analysis

Regression Analysis for Facebook keywords.
R
20
star
69

chatgpt-tips-analysis

Jupyter Notebooks for testing the impact of tip incentives for ChatGPT
Jupyter Notebook
20
star
70

stylecloud-examples

Examples of stylistic word clouds generated via the stylecloud Python package
Python
19
star
71

stack-overflow-survey

Code + Visualizations for processing 2016 Stack Overflow Survey Data
Jupyter Notebook
19
star
72

get-heart-rate-csv

A small Python script to get the heart rate data generated from an Apple Watch in a CSV form
Python
19
star
73

get-bars-from-foursquare

A quick pair of Python scripts to retrieve all bars within a given area, then retrieve metadata and process it.
Python
19
star
74

subreddit-related

Code and visualizations for related/similar subreddits
Jupyter Notebook
19
star
75

ai-generated-magic-cards

Tools for encoding Magic: The Gathering cards into a form suitable for AI text generation
Python
17
star
76

tensorflow-multiprocess-ray

Proof of concept on how to use TensorFlow for prediction tasks in a multiprocess setting.
Python
17
star
77

pokemon-ai

A text-generating AI to generate Pokémon names.
Python
17
star
78

reddit-comment-length

R code needed to reproduce Relationship between Reddit Comment Score and Comment Length for 1.66 Billion Comments visualization
R
17
star
79

mtg-card-creator-api

Code for running a Magic card image generator API
Python
16
star
80

automl-gs-examples

Examples + Visualizations of datasets modeled using automl-gs
Python
16
star
81

reddit-graph

Jupyter notebook + Code for reproducing Reddit Subreddit graphs
Jupyter Notebook
16
star
82

ncaa-basketball

R Code + R Notebook on how to process and visualize NCAA basketball data.
R
16
star
83

pokemon-embeddings

Jupyter Notebooks and an R Notebook for encoding Pokémon embeddings and creating data visualizations.
Jupyter Notebook
16
star
84

sfba-compensation

Jupyter notebook + Code for scraping AngelList data and making an interactive chart of SFBA salaries/equity
HTML
14
star
85

resetera-gpt-2

Scraper of ResetEra threads and posts to get them into a format suitable for feeding them into GPT-2.
Python
14
star
86

get-data-from-photos-from-instagram-tags

Processes data from images which are tagged with the specified Instagram tag.
Python
13
star
87

hacker-news-comment-analysis

Code used for analysis of Hacker News comments.
R
13
star
88

char-tsne-visualization

Visualizations of character embeddings from derived character vectors.
HTML
13
star
89

imdb-data-analysis

R Code + R Notebook on how to process and visualize the official IMDb datasets.
12
star
90

hn-heatmaps

Code and data necessary to reproduce heatmaps relating HN Submission time to submission score.
R
12
star
91

sf-crimes-covid

Spot checking impact of SF shelter-in-places on crime reporting.
12
star
92

imgur-decline

R Code + R Notebook for analyzing the decline of Imgur on Reddit.
HTML
11
star
93

gpt-2-fanfiction

Experiments with generating GPT-2 fanfiction on specified topics.
11
star
94

notebooks

This GitHub Repository stores my R Notebooks, allowing GitHub Pages to serve the R Notebooks on my website
HTML
11
star
95

all-marvel-comics-characters

Creates a .csv of all Marvel Comics Characters + Statistics via the Marvel API
Python
10
star
96

movie-gender

Data and code for analyzing Movie Lead Gender.
Jupyter Notebook
10
star
97

online-class-charts

Code needed to reproduce data analysis and charts for MIT/Harvard Online Course Data
R
9
star
98

ggplot2-web

R Code + R Notebook on how to make high quality data visualizations on the web with ggplot2.
HTML
9
star
99

reddit-subreddit-keywords

Code + Jupyter notebook for analyzing and visualizing means and medians of keywords in the top Reddit Subreddits.
R
8
star
100

reddit-mean-score

Quick data visualization for Reddit Mean Submission Score by Subreddit
8
star