tabletop
Tabletop.js gives spreadsheets legsmapsheet
Easily create interactive maps from data in Google Spreadsheets. Supports Google Maps, Leaflet, Mapbox, MapQuest, etc.selenium-github-actions
flatware
S3 caching of Google Spreadsheets for Tabletop.jspersine
Persine is an automated tool to study and reverse-engineer algorithmic recommendation systems.fuzzy_pandas
Fuzzy matches and merging of datasets in pandas using csvmatchkull
A tool to interactively select text regions of PDFs and images. Mostly for use with PDFQuery or tesseract (UZN/OCR zone files)spotify-langchain-gpt
Building Spotify playlists based on vibes using LangChain and GPTgs-spreadsheet-manager
A slightly more object-oriented, ORM-y, and altogether pleasant way of dealing with Google Spreadsheets when using Google Apps Script.hexagon-grids-for-adobe-illustrator
A simple script to automatically build hexagon grids in Illustrator, for things like D3 hex cartogram visualizationstesseract-uzn
Simplify using uzn files with tesseract for OCRai2html-walkthroughs
Code samples for ai2html walkthroughsdesigningviz
A step-by-step design guide for data visualizationsstorytelling-2015
NICAR20-geocoding-apis
Geocoding APIs repo for NICAR20 sessionsimplified-scrollama-scrollytelling
Non-d3, barebones version of scrollama.js scrollytelling examplesnicar23-huggingface
python-disease-modeling
A walkthrough of how SIR infectious disease modeling works, along with a do-it-yourself Python COVID model that you can use to simulate a lockdown.data-studio-projects
autoscraper-changes
Tiny template that uses GitHub Actions and Python to automatically scrape the BBC's homepage every four hours.clipped-and-georeferenced-images-from-google-earth-in-qgis-3
Georeference images from Google Earth in QGIS 3, then clip them so they aren't shaped all weird.d3-topojson-data-guide
How to wrangle your data into a useable TopoJSON format for making D3 visualizationsire24-scraping
Tutorials for IRE 2024 about using Playwright to scrape state-level license and violations dataNICAR20-intro-d3
Base for an introductory walkthrough of D3's features for NICAR 2020interactives-class-content
abraji24-ai-audio-video
nicar23-building-ai-editors
Conceptual framework for building AI editors to help with journalism workflowshardboiled
A node.js detective kit for investigating how web sites are built.mediaparty-folktales
Notes and code for learning Hungarian folktales through semantic search and GPT: Media Party 2023 lightning talknicar24-beyond-chatgpt
2023-abraji-ai-workshop
Journalism in AI workshop for Abraji 2023lede-notes
Notes from Summer 2015's Lede Python classplayfair-projects
Common repository of projects for Playfairchatgpt-for-educators
Slides for a talk about the use and abuse of ChatGPT in the classroomdata-studio-projects-2022
Projects for Columbia J School's Data Studio 2022nb2jekyll
Use nbconvert to convert IPython/Jupyter Notebooks into Jekyll-compatible Markdown + Front MatterNICAR20-apis
page-templates
final_final_v2_truefinal
quarto-tutorial
Quarto framework tutorial for C+J/European Data Journalism Conf 2023how-to-host-on-github-pages
lede-projects-2023
Projects for Lede Program 2023data-studio-projects-2024
excel-tutorials
lede-2023-auto-scraper
data-studio-projects-2023
Projects for Columbia Journalism School's Data Studio course, Spring 20232023-ona-before-beyond-chatgpt
Slides and links for AI Tools from Before (and Beyond!) ChatGPT (ONA 2023)lede-projects-2022
Projects for Lede Program 2022walkthrough-github
this is a walkthrough about how to use git and github desktopbad-air-cities
An auto-updating website of major cities' air quality indexdotenv-example
autoscraper-history
auto-scraper
an automatic scraper?????automatic-scraper
sample-d3-project
2024-abraji-ai-docs
2023-journalismai
example-svelte-viz-page
lede-websites
Thousands of websites for various classes I've taught over the years, primarily for Columbia's Lede Programjupyter-notebook-site-autobuild
Lil' baby static site builder that creates a website from Jupyter notebooks and markdown filesdata-studio
japanese-dialect-quiz
Japanese dialect quiz built using Claude artifactsflask-energy
2024-ds-dojo
words
pandas-question-factory
a question factory. for pandas.foundations
Code for Lede Program 2017 Foundations class I guess????slack-files
List your large slack files so you can search for and delete themnormal-ai
bottlejack
automatic-scraper-bbc
fanciframe
Let's embed iframes with grace and beautydata-buddies
It's little datas!!!track_url
A Sublime Text 2 plugin for following along with the ever-changing contents of a URL. I use it so students can follow along.typing-of-the-dead
Code to extract phrases from the impossilby incredible Dreamcast game The Typing of the Deadsample-story
charset-downloader
7-bit ASCII ought to be enough for Π°ΠΏΡΠ±ΠΎΠ΄Ρadvanced-scraping-with-playwright
Notes for the Advanced Scraping with Playwright session in February 2024demodemo
fancy-github-guide
A guide to a fancy github experience https://jonathansoma.com/fancy-github/miscellaneous-data
Miscellaneous and most probably useless data sets that I've used for this or thatautoscraper-mailer
Scrapes on a schedule, emailing out whatever it finds.tiny-spider
Convert spreadsheets into network-friendly formats, a.k.a. from π to πΈ, using π·altair-reference
transliteration_project
2023-ona-trusting-ai
Slides and links for Trusting AI in the newsroom: Hallucinations, bias, security and labor (ONA 2023)nyc-schools
Love Open Source and this site? Check out how you can help us