• Stars
    star
    53
  • Rank 552,529 (Top 11 %)
  • Language
    Java
  • License
    Apache License 2.0
  • Created about 12 years ago
  • Updated 12 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Suite of tools for detecting changes in web pages and their rendering

More Repositories

1

format-corpus

An openly-licensed corpus of small example files, covering a wide range of formats and creation tools.
Java
183
star
2

jhove

File validation and characterisation.
Java
169
star
3

fido

Format Identification for Digital Objects (FIDO) is a Python command-line tool to identify the file formats of digital objects. It is designed for simple integration into automated work-flows.
Python
145
star
4

jpylyzer

JP2 (JPEG 2000 Part 1) validator and properties extractor. Jpylyzer was specifically created to check that a JP2 file really conforms to the format's specifications. Additionally jpylyzer is able to extract technical characteristics.
Python
69
star
5

scape-xcorrsound

Suite of tools for automated quality assurance of audio migration processes.
C
42
star
6

scape

SCAlable Preservation Environments
Java
39
star
7

ViPER

Dutch Digital Heritage Network virtual research environment set up and provisioning
Shell
16
star
8

nanite

Nanite - a friendly swarm of format-identifying robots.
Java
15
star
9

bitwiser

Bitwise analysis tools
Java
14
star
10

matchbox

Image comparison QA tool for digital preservation workflows.
C++
14
star
11

jpylyzer-test-files

Test files for conformance testing and benchmarking Jpylyzer.
Shell
13
star
12

scout

SCOUT - A preservation watch system
Java
13
star
13

plato

The Preservation Planning Tool Plato
Java
10
star
14

flint

A modular and extendible file/format validation framework
Java
9
star
15

hawarp

HAdoop-based Web Archive Record Processing
Arc
7
star
16

fuzzy-expert-system

Python
6
star
17

scape-apis

SCAPE Project API specifications
6
star
18

policies

Machine readable preservation policy ontology for SCAPE automated planning.
CSS
5
star
19

scape-toolwrapper

SCAPE project for creating debian packages from command line tools.
Java
5
star
20

ToMaR

Wraps command line tasks for parallel execution as Hadoop map reduce jobs.
Java
4
star
21

libmagic-jna-wrapper

A Java/JNA wrapper for calling libmagic.
Java
4
star
22

scape-demo-sites

Web based demonstrators of SCAPE tools.
PHP
3
star
23

odf-validator

Open source Open Document Format (ODF) validation
Java
3
star
24

Tika-identification-Wrapper

Java wrapper for executing Tika format identification across GovDocs.
Java
3
star
25

scape-component-profiles

Holds the SCAPE component profile ontology and profile XML files.
CSS
2
star
26

jpwrappa

Simple Python wrapper for the command-line tool of Aware's JPEG 2000 SDK.
Python
2
star
27

Arc-unpacker

ARC File unpacker for the Hadoop File System.
Arc
2
star
28

video-batch

Java
2
star
29

fits-blackbox-testing

A simple tool for FITS back box testing
Java
2
star
30

par-wikidp.old

PAR registry endpoint for WikiDP/Wikidata
Python
2
star
31

scape-planmanagement-webapp

SCAPE Plan Management Webapp which provides a GUI for the end user
JavaScript
2
star
32

sheets-preservation-spec

OPF Spreadsheets Preservation Specification
2
star
33

pdfPolicyValidate

PDF policy-based validation demo
XSLT
2
star
34

finger-detection-tool

Java
2
star
35

scape-fcrepo4-planmanagement

Plan Management API implementation on top of fedora 4
Java
1
star
36

scape-simulator

The SCAPE simulation environment.
Java
1
star
37

crop-detection-tool

Python
1
star
38

preflightGovdocsSelected

Results of analysis of Govdocs Selected corpus with Apache Preflight and Schematron rules
Shell
1
star
39

carrus

JavaScript
1
star
40

verapdfa

Initial VeraPDFA repository, private for initial months of the design phase.
Java
1
star
41

scape-toolspecs

A home, and version control, for the SCAPE project's tool specifications for the tool-wrapper.
Shell
1
star
42

tabular-data-normaliser

Normalises data from different sources (CSV, XLS and PDF)
Java
1
star
43

fido-update-service

FIDO signature update REST services
Python
1
star
44

par-wikidp

PAR registry that implements a subset of the PAR API/model base on data in WikiData.
Python
1
star