• Stars
    star
    105
  • Rank 326,545 (Top 7 %)
  • Language
    JavaScript
  • License
    ISC License
  • Created over 3 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Collaborative data collection tool developed by the Associated Press

AP Harvester

Documentation Status

AP Harvester is an open source, collaborative data collection platform designed to help newsrooms gather structured data at the speed of news. We built it to lower the barriers in spinning up a new data collection project so that you can get to the story faster.

AP Harvester is schema-driven, meaning you define the structure of the dataset you want to collect and Harvester automatically renders a user-friendly form through which a team of reporters can enter data as they collect it. It's built to be flexible and transparent, allowing you to adapt as your data collection needs change.

AP Harvester uses Google Sheets as a data storage mechanism, meaning you can easily view and work with your data in a tool used by many newsrooms already. Starting a new data collection project with AP Harvester is as easy creating a new spreadsheet.

Deploy straight to Heroku

Ready to dive right in? Hit the button to deploy AP Harvester directly to Heroku. Take a look at the setup documentation for how to get started.

Deploy

Documentation

Credit

This project has been a labor of love for the AP Data Team and we can't wait to see what you do with it! If you do decide develop your own fork and take it in your own direction we would really appreciate it a shout-out to the Associated Press in your version of the tool.

Happy harvesting!

More Repositories

1

verify-dkim

Tool to verify DKIM signatures on an mbox of emails
Shell
88
star
2

geomancer

Open source tool to help journalists easily mash up data based on shared geography.
Python
59
star
3

datakit-core

Core library for the datakit CLI framework.
Python
53
star
4

cookiecutter-r-project

Basic cookiecutter template for R projects
Python
32
star
5

aptheme

R Themes in AP style
R
30
star
6

datakit-project

Project generator for use with the datakit framework.
Python
26
star
7

cookiecutter-python-project

Basic cookiecutter template for Python projects
Jupyter Notebook
17
star
8

national-caseload-data-ingest

Scripts to download the U.S. Department of Justice's National Caseload Data and load it into Amazon Athena for querying
Python
13
star
9

datakit-github

Datakit plugin to help manage Github integration on data projects.
Python
12
star
10

local-ai-brainerd-dispatch

The Public Safety Reporting System (PSRS) is a prototype web application that parses police blotters from unstructured PDF sources, applies editorial logic to the data to help journalists identify and select relevant incidents.
Python
10
star
11

local-ai-ksat

Clip2Story is a prototype web application that transcribes news video clips, summarizes transcripts using OpenAI, and feeds summaries as the first draft of a story into a CMS.
Python
9
star
12

apdatacheck

Data import and analysis validation library
R
8
star
13

apstyle

Install AP style materials for R work
R
8
star
14

bls_api

API wrapper for data from the U.S. Bureau of Labor Statistics
Ruby
7
star
15

datakit-dworld

Commands to manage project integration with data.world
Python
4
star
16

datakit-data

A datakit plugin to simplify using AWS S3 as a data store for data science projects.
Python
3
star
17

ipedals

R package to simplify access to IPEDS
R
2
star
18

geomancer-deploy

Scripts and configs for provisioning Ubuntu as a host for geomancer.
Shell
2
star
19

datakit-gitlab

DataKit plugin to help manage Gitlab integration on data projects.
Python
2
star
20

ailurus

Ruby client gem for newsroom data libraries running PANDA
Ruby
1
star
21

cookiecutter-datakit-plugin

Cookiecutter template to generate project skeleton for new datakit plugins.
Python
1
star
22

html-webpack-jsdom-prerender-plugin

Webpack plugin to prerender JS apps in a JSDOM context
JavaScript
1
star
23

cookiecutter-generic-project

Basic cookiecutter with just the AP style folder structure and readme
Python
1
star