There are no reviews yet. Be the first to send feedback to the community and the maintainers!
LookingGlass
Intuitive and configurable search interface for document archives.ICWATCH-Data
Resume data and scripts for managing itHarvester
Web crawling and document processing through a usable interface.TransparencyToolkit
Main repository for Transparency ToolkitLinkedInData
Scrapes all LinkedIn profiles including search terms.JSONToNetworkGraph
Generates network graphs from a JSON.generalscraper
Scrapes all pages on any site you specify for keywords.DocManager
Universal backend for indexing, storing, and querying documents.UtilityScripts
Scripts for managing scrapersIndeedScraper
Scraper for IndeedArchivePile
A read-only theme for publishing email archives using MailpileTransparency-Toolkit-Prototype
Analysis system for Transparency Toolkit.Twiddler
A user friendly tool for text processing, light NLP, and keyword extractionLinkedinCrawler
Crawls public LinkedIn profilesdataspec-sii
Dataspec for SIILinkedinParser
A parser for LinkedIn profilesCatalyst
Text mining framework.Surveillance-Research-Data
Raw data and scripts for Surveillance Research ArchiveTwitterCrawler
A crawler for TwitterIndeedCrawler
Crawler for the resume website IndeedNameToEmail
Gets a list of potential emails from a JSON with names.JSONToMap
Converts a JSON with locations into a map with points.Thumbtack
An open narrative mapping tool to corroborate narratives across multiple sources and formatsDesignAssets
A collection of branding, interfaces, and other visual resources!JSONToChoropleth
Generates choropleth maps from JSONs.theme-snowden
A theme for LookingGlass for Snowden doc searchFacebookCrawler
A crawler for Facebook data from public web and Graph APIEmailParser
A crawler for converting email files on disk to JSONParseFile
OCRs document and extracts metadataExtractPatterns
Extracts terms matching certain patterns. For finding new codewords and tracking mentions of known ones.TSJobCrawler
Collects listings for jobs that require security clearance.EntityExtractor
Extracts entities and terms matching certain patterns.IndeedParser
Parser for Indeed resumestransparencytoolkit.github.io
A styleguide site for Transparency ToolkitCrawlerManager
API for calling crawlersdataspec-LinkedinCrawl
A LookingGlass dataspec file for data scraped form LinkedIn.comPiplCollector
Request info from Pipl for all items in datasetDirCrawl
Runs block of code on every file in directoryPiplRequest
Request profiles from PiplJSONToChart
Converts JSONs to pretty chartsdataspec-IndeedCrawl
A LookingGlass dataspec file for data scraped from Indeed.comwlsearchscraper
Gets a list of results from the WikiLeaks search.Archiver
Archives URLstheme-pi
A theme for Privacy International collaborationsUploadConvert
Tools for converting documents uploaded to Transparency Toolkit to properly formatted JSONs.dataspec-GoogleCrawl
A dataspec for the Google crawlerdataspec-template
A starter template for LookingGlass json filesRequestManager
Manages scraper HTTP requestsfederalregisterscraper
Scraper for the Federal Registermonth-names
Names of months in multiple languagesdataspec-fbidhs
JSONCombiner
Combines JSONs.Test-Data
Test data for Transparency Toolkit developmentArchiveAdministrator
Archive administration system. Handles archive creation and user authentication.DocUpload
Upload application for documents in archiving service.dataspec-EmailCrawl
Dataspec for emailsIC-Company-Data
Intelligence contractorswordcloud
Changes word sizes in a document based on the number of times they occur.JSONCrossreference
Crossreferences JSONs and returns the matching data.OCRServer
OCR server for hosted archiving serviceansible-role-lookingglass
Automates deployment of LookingGlass instancesNetworkGraph
Neo4j network graph generator prototypeCountryConvert
Converts 2-char ISO country codes to 3-char codes.classification-sensation
Parse classification-related informationdataspec-LoadFiles
Dataspec for plain files loaded in via Harvester/DirCrawl.dataspec-snowden
A dataspec for Snowden documentsLove Open Source and this site? Check out how you can help us