• Stars
    star
    1
  • Language
  • Created almost 10 years ago
  • Updated almost 10 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

GitHub mirror of the Tesseract OCR tessdata repo

More Repositories

1

ruby-opencv

Versioned fork of the OpenCV gem for Ruby
C++
176
star
2

macduff

Macduff: the Macbeth ColorChecker finder
C++
104
star
3

loebolus

Making the public domain Loebs more easily downloadable. Data at https://github.com/ryanfb/loebolus-data
Haml
91
star
4

torch-warp

Fully automatic optical flow based image morphing implemented in Torch
C++
73
star
5

pcl-tools

Mesh processing utilities based on the Point Cloud Library
C++
57
star
6

academia-dl

Download PDFs from academia dot edu without logging in
Ruby
45
star
7

georefine

Geospatial processing for OpenRefine
Java
40
star
8

book-aligner

Automatic alignment of books between HathiTrust, Internet Archive, Google Books, etc.
CoffeeScript
32
star
9

iiif-dl

Command-line tile downloader/assembler for IIIF endpoints/manifests
Ruby
31
star
10

loebolus-data

Data for Loebolus
29
star
11

iOS-Ancient-Greek-Keyboard

Ancient Greek Keyboard for iOS 8+
Objective-C
25
star
12

iiif-universe

An IIIF Universe for IIIF catalogs
Ruby
24
star
13

ancientgreekocr-ocr-evaluation-tools

'ocr-evaluation-tools' from http://ancientgreekocr.org/. Tools to test OCR accuracy.
C
21
star
14

dzi-dl

Deep Zoom Image Downloader
Ruby
17
star
15

latinocr-lat

'lat' repository, forked from https://github.com/ryanfb/ancientgreekocr-grc. The final training process for lat.traineddata
Makefile
13
star
16

homebrew-alicevision

Homebrew tap for AliceVision and related formulae
Ruby
12
star
17

docker_scantailor

Docker for ScanTailor and ScanTailor Advanced
Makefile
11
star
18

loeb-copyright

Collaborative assessment of the copyright status of volumes in the Loeb Classical Library
Ruby
9
star
19

docker_3d-photo-inpainting

Dockerfile for 3d-photo-inpainting
Dockerfile
9
star
20

docker_visualsfm

Experimental Docker file for building a replicable VisualSFM environment
9
star
21

bl-dl

Script, Dockerfile, and Instructions for Downloading Manuscript Images from the British Library
Dockerfile
9
star
22

etc

blog, wiki, miscellany, et cetera
HTML
8
star
23

tesseract-ocr.langdata

GitHub mirror of the Tesseract OCR langdata repo
Shell
8
star
24

ancientgreekspeak

Transliterate Ancient Greek to Apple phonemes for text-to-speech synthesis
Ruby
8
star
25

tesseract_latinocr_docker

Docker for building lat.traineddata from scratch
7
star
26

pleiades-geojson

Scripts and data for GeoJSON from Pleiades CSV data dumps
Ruby
7
star
27

mac-wasd-polytonic-greek

A Mac Polytonic Greek keyboard for WASD 104/87-Key US ANSI Layout
7
star
28

ancientgreekocr-grc

'grc' repository from http://ancientgreekocr.org/. The final training process, hopefully soon to be part of the main Tesseract codebase.
Makefile
7
star
29

sketchfab-dl

Command-line downloader for downloadable Sketchfab models
Ruby
7
star
30

twitter-bookmarks-export

Export as many of your Twitter Bookmarks as possible to JSON
Ruby
7
star
31

tesslinesplit

Standalone Tesseract line segmentation
C++
6
star
32

picodriveds

PicoDriveDS - Nintendo DS port of PicoDrive
Assembly
6
star
33

latinocr-lattraining

Rules and tools to deterministically generate all prerequisites for the final training process. Adapted from https://github.com/ryanfb/ancientgreekocr-grctraining/
Makefile
6
star
34

skimage-orbs-faiss-example

Using Faiss to perform ORB feature descriptor matching with scikit-image
Python
5
star
35

pleiades-tgn

Script for generating Pleiades/TGN alignment
Ruby
5
star
36

e57tools

Tools for working with E57 format 3D data, based on libE57Format
C++
5
star
37

Cardo

A mirror of David J. Perry's Cardo font, a Bembo-inspired font which is useful for scholars
5
star
38

docker_fromthepage

Docker for running a FromThePage server instance
Dockerfile
5
star
39

rousefirstgreekcourse

Readings from W.H.D. Rouse's "A First Greek Course"
Shell
5
star
40

web-archive-triage

Scripts for triaging and submitting URLs for web archiving
Shell
5
star
41

scripts

Various dumb shell scripts for doing things (file conversion, etc.)
Ruby
4
star
42

docker_diffengine

Dockerfile for diffengine
Shell
4
star
43

deZoom

Remove the Zoom video conferencing app from your Mac and fix vulnerabilities
Shell
4
star
44

gnomvat

English translations from the Gnomologium Vaticanum
Shell
3
star
45

hocr-reader

Client-side JavaScript reader for GitHub repositories with hOCR output.
JavaScript
3
star
46

docker_m68k_gcc

GCC Cross-Compiler for M68k in an Ubuntu Docker container
Dockerfile
3
star
47

gist2pinboard

Add all your gist URLs as private bookmarks on pinboard
Ruby
3
star
48

xsugar

XSugar code for EpiDoc<->Leiden+ transformation
HTML
3
star
49

ryanfb.github.com

HTML
3
star
50

dmvisbot

Generate and post dmvis visualizations of random levels from the worlds of Doom
Python
3
star
51

google-fusion-tables-backup

Google Fusion Tables CSV backup script
Ruby
3
star
52

neural_tv

Code for the Neural TV Twitter Bot
Ruby
2
star
53

docker_blender

Dockerized Blender 2.82
Dockerfile
2
star
54

iiif-pastec

Scripts for using Pastec as an image-based search engine for IIIF image servers
Ruby
2
star
55

kraken-gaza-batrachomyomachia

Theodorus Gaza’s edition and paraphrase of the Batrachomyomachia
CSS
2
star
56

grist

single-serving web app for searching your own gists
Ruby
2
star
57

docker_ffmpeg-shadertoy

Dockerfile for FFmpeg with GLSL shadertoy filter support
Dockerfile
2
star
58

teem-parallel

Fork of teem (http://teem.sf.net) to add crude parallelization with OpenMP (and eventually MPI)
C
2
star
59

tesseract_ancientgreekocr_docker

Docker for building grc.traineddata from scratch
2
star
60

ijk

Isosurface Jeneration Kode with some changes
C
2
star
61

okc-leak-checker

HTML
2
star
62

photios-bibliotheca

English translations of selected entries from the "Bibliotheca" of Photios
Ruby
2
star
63

copydatrom

Incredibly simple DAT-based ROM renaming utility
Ruby
2
star
64

docker_nidaba

Docker environment for https://github.com/OpenPhilology/nidaba
Shell
2
star
65

latinocr-lattestfodder

Latin page scans and ground truth text for testing OCR accuracy.
2
star
66

github-friction

Check for common sources of contributor friction across your GitHub repositories.
JavaScript
2
star
67

ancientgreekocr-grctraining

'grctraining' repository from http://ancientgreekocr.org/. Rules and tools to deterministically generate all prerequisites for the final training process.
Makefile
2
star
68

latinocr

Website for Tesseract Latin OCR training data and tools
HTML
2
star
69

kraken-gaza-iliad

Edition-specific OCR training for Theodorus Gaza's Attic paraphrase of the Iliad
HTML
2
star
70

dnsimple-tld-checker

Ruby script(s) for checking all available DNSimple TLDs for domain availability, alongside their prices
Ruby
1
star
71

papers-BICS

Chapter for Digital Classicist BICS volume
TeX
1
star
72

papers-ICADL12

1
star
73

docker_tesseract_opencl

A Dockerfile for building and installing Tesseract 3.03 with OpenCL support enabled.
1
star
74

oldalice

an IRC client viewed in WebKit (mirror of deprecated repository)
JavaScript
1
star
75

gh-pages-test

GitHub pages templating test
1
star
76

pleiades-osm

Scripts for generating Pleiades/OpenStreetMap alignments
Ruby
1
star
77

docker_ocropus

Docker for OCRopus development/test environment
1
star
78

jekyll-theme-palaios

A simple, elegant, archaizing Jekyll theme for beautiful plain text
HTML
1
star
79

dlflickr

Flickr image/metadata download & archiving scripts
Ruby
1
star
80

itinerarium

Visualization experiment for ancient itineraries
CoffeeScript
1
star
81

cryptosystem.org

Ruby
1
star
82

cite-collection-editor

Client-side JavaScript editor for CITE collections stored in Google Fusion Tables
CoffeeScript
1
star
83

geonames-geojson

Scripts and data for GeoJSON from GeoNames data dumps
Ruby
1
star
84

kraken-voulgaris-aeneid

Edition-specific OCR training for Evgenios Voulgaris' Greek translation of the Aeneid
HTML
1
star
85

docker_outrun_sdk

Dockerized environment for the Out Run Arcade SDK
Eagle
1
star
86

pleiades-json-to-geojson

Script for converting Pleiades JSON dumps to a GeoJSON FeatureCollection
Ruby
1
star
87

rotatebake

RotateBake - a Mac app for baking EXIF rotation into images
Shell
1
star
88

pleiades-static-search

Static client-side search for Pleiades
CoffeeScript
1
star
89

tlgu

Utility for converting an input file in Thesaurus Linguae Graeca (TLG) / Packard Humanities Institute (PHI) representation to Unicode (UTF-8).
C
1
star
90

backpack-export-xslt

Stylesheets for converting your Backpack export into a complete usable backup
JavaScript
1
star
91

ferguson-video

Download all videos in edsu/ferguson-urls with youtube-dl
Makefile
1
star
92

instapaper-watch-later

A script to move YouTube videos in your Instapaper feed to a YouTube "Watch Later" style playlist.
Ruby
1
star
93

google-drive-normalizer

Script for normalizing content in Google Sheets to Unicode NFC
Ruby
1
star
94

ancientgreekocr-grctestfodder

'grctestfodder' repository from http://ancientgreekocr.org/. Ancient Greek page scans and ground truth text for testing OCR accuracy.
1
star
95

ancient-world-3d

Test repository for 3D data of the Ancient World
1
star