• Stars
    star
    6
  • Rank 2,488,002 (Top 50 %)
  • Language
    Python
  • Created about 13 years ago
  • Updated over 12 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A fork of python decruft, a python version of the readability algorithm (http://www.minvolai.com/blog/decruft-arc90s-readability-in-python/), for bugfixing and tinkering.

More Repositories

1

fuzzytime

A go package to parse human-readble date and time strings
Go
55
star
2

evilpixie

Pixel-oriented paint program, modelled on Deluxe Paint
C++
36
star
3

qs

Query string parser for the Bleve text indexing library
Go
29
star
4

journalisted

PHP
11
star
5

unsourced

experimental site to make it easy to add sources to news articles
Python
10
star
6

unsourced-chrome

Chrome plugin for unsourced.org
JavaScript
6
star
7

arts

package to help extract data from online news articles
Go
4
star
8

impy

C library for loading/saving images and animations
C
4
star
9

media_complaints_site

Website for tracking media complaints body cases (eg PCC)
Python
4
star
10

fakemail

Hacky little tool to generate bulk fake emails for testing.
Rust
3
star
11

zig

It's a game where you shoot stuff.
C++
3
star
12

churnalism-extensions

JavaScript
3
star
13

tameimap

Simple IMAP server for testing
Go
2
star
14

metareadability

Package to pick out headline, publication date, author(s) from news articles on the web.
Python
2
star
15

fuzzydate

Python date parsing package
Python
2
star
16

hnews_checker

Online sanitychecker for hNews
PHP
2
star
17

badger

An simple in-memory document store for Go, with a search language.
Go
2
star
18

ukpr

Server which screenscrapes UK press releases and serves them up as HTTP Server-Sent-Event streams
Go
2
star
19

hnews_popup

A jquery plugin to display hNews information on a page in a popup box
JavaScript
1
star
20

scrapeomat

Backend code for Steno, a set of tools for gathering and analysing online news articles.
Go
1
star
21

retain

a simple tool to find files matching a given set of timestamps
Python
1
star
22

glod

The new glod standard for static site generation
Go
1
star
23

drongo-forms

A PHP version of django forms
PHP
1
star
24

unsourced-firefox

Firefox extension for unsourced.org
JavaScript
1
star
25

steno

GUI frontend code for Steno, a set of tools for gathering and analysing online news articles.
Go
1
star
26

warc

Go
1
star
27

tb-notes

Shell
1
star