Jason Michael Baumgartner (@pushshift)
  • Stars
    star
    1,586
  • Global Rank 19,407 (Top 0.7 %)
  • Followers 310
  • Following 5
  • Registered over 9 years ago
  • Most used languages
    Python
    90.0 %
    Go
    6.7 %
    Perl
    3.3 %
  • Location πŸ‡ΊπŸ‡Έ United States
  • Country Total Rank 6,014
  • Country Ranking
    Python
    995
    Perl
    2,405
    Go
    4,784

Top repositories

1

api

Pushshift API
Python
1,242
star
2

telegram

Pushshift Telegram Ingest
Python
83
star
3

tiktok

Module to access TikTok Private API
Python
51
star
4

reddit_sse_stream

A Server Side Event stream to deliver Reddit comments and submissions in near real-time to a client.
Python
47
star
5

zreader

Read compressed NDJSON .zst files easily
Python
30
star
6

rinzler

A high performance indexing and search system for managing big data
Go
17
star
7

Parallel-NDJSON-Reader

Parallel NDJSON Reader for Python
Python
14
star
8

Reddit-Bot-Detector

Script to extract highly probable bots for further analysis
Python
11
star
9

imdb_to_json

Fetch movie data from IMDB and output in JSON format.
Python
9
star
10

google_bigquery

8
star
11

tradier

Tradier API Examples
Python
8
star
12

trump_tweets

Code example and data for all available Trump Tweets
Python
7
star
13

token_manager

Code to handle multiple Twitter user access tokens when making requests
Python
6
star
14

ndjson_processor

High Speed multiprocessing ndjson processor
Python
5
star
15

python-zstandard-compression-test

Python script to test the zstandard module
Python
5
star
16

US_Election_Data

Code to grab election data from CNN's election data API
Python
4
star
17

gab_mastodon

Ingest scripts and Elasticsearch Mapping for Gab's new Mastodon Site
Python
4
star
18

Big-Data-Scripts

Miscellaneous Python and Perl Scripts for working with Big Data files that are new-line delimited JSON Objects
Python
4
star
19

tweet-id_components

Go program to extract tweet id components
Go
3
star
20

Docker-Reddit-Sphinxsearch

Docker container for sphinxsearch -- used for adding Reddit comments for search
Perl
3
star
21

ps_proxy_manager

Pushshift Proxy Manager
Python
2
star
22

ap_story_fetcher

Associated Press Story Fetcher
Python
2
star
23

officer_dot_com

Example code to start parsing data from the website officer.com
Python
2
star
24

realdonaldtrump

Archive of tweets from the @realdonaldtrump Twitter account
2
star
25

parse_wiki_tables

Simple Example to parse out data from Wikipedia tables using selectolax
Python
1
star
26

scrape_subreddit_categories

reddit
Python
1
star
27

binary_search

Example of a binary search implementation using real data (Reddit author info)
Python
1
star
28

meetup

Code for ingesting meetup.com streams (comments, photos, etc.)
Python
1
star
29

extract_json_from_html

This script will make it much easier to extract a JSON object from HTML (e.g. getting Tiktok data)
Python
1
star
30

archiver

Server to handle POST requests with raw data which then archives the data permanently
Python
1
star
31

discord-pushshift-bot

Discord Pushshift Bot
Python
1
star
32

browser_extension_parser

Parser module for Facebook observations returned from the browser extension
Python
1
star