• Stars
    star
    21
  • Rank 1,084,038 (Top 22 %)
  • Language
    Go
  • License
    Other
  • Created almost 8 years ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

dataset is a command line tool, Go package, shared library and Python package for working with JSON objects as collections

More Repositories

1

handprint

Apply different text recognition services to images of handwritten documents.
Python
169
star
2

datatools

A set of tools for working with JSON, CSV and Excel workbooks
Go
74
star
3

dibs

DIBS is an implementation of a basic controlled digital lending (CDL) system using IIIF to make scanned books available for time-limited viewing.
JavaScript
27
star
4

waystation

Automatically archive your repository's GitHub Pages in the Wayback Machine.
Makefile
23
star
5

sidewall

Sidewall is a Python library for interacting with the Dimensions search API.
Python
17
star
6

commonpy

Collection of common Python utility functions and classes used in other Caltech Library programs.
Python
15
star
7

bun

A Python package for a basic CLI and GUI user interface
Python
13
star
8

documentarist

Process Caltech Archives' digital documents and photos, and annotate each page or image with information about its contents
Python
12
star
9

baler

Bad link reporter for GitHub repositories
Makefile
11
star
10

caltechdata_api

Python library for using the CaltechDATA API
Python
10
star
11

pokapi

Simple Python object-oriented interface for getting records from FOLIO
Python
9
star
12

foliage

Foliage is the FOLIo chAnGe Editor, a tool to do bulk changes in FOLIO using the network API.
Python
9
star
13

caltechdata

The CaltechDATA InvenioRDM source code
HTML
8
star
14

iga

IGA is the InvenioRDM GitHub Archiver, a standalone program as well as a GitHub Action that lets you automatically archive GitHub software releases in an InvenioRDM repository.
Python
8
star
15

newt

Newt a microservice for integrating Postgres+PostgREST and Pandoc
HTML
7
star
16

cloud-init-examples

This repository includes an example of cloud-init YAML files for use with multipass VMs.
HTML
6
star
17

topi

Simple Python objects for getting data from TIND
Python
6
star
18

codemeta2cff

GitHub Action converting a codemeta file to CITATION.cff
6
star
19

template

Template repository for software projects by the Caltech Library
SCSS
5
star
20

cell-atlas

Cell atlas
TeX
5
star
21

ames

Automated Metadata Service
Python
5
star
22

coif

Cover image finder
Python
4
star
23

epxml_to_datacite

Transform Eprints XML to DataCite XML and mint DOIs in Eprints repositories
Python
4
star
24

boffo

Boffo is an add-on for Google Sheets written by the Caltech Library. It lets you select item barcodes in a spreadsheet and retrieve information about the item records from a FOLIO server.
JavaScript
4
star
25

urlup

Find the ultimate destination for URLs after following redirections.
Python
4
star
26

pairtree

A simple encoder/decoder for converting object identifiers into a Pair Tree Path (path)
Go
4
star
27

turf

Turf ("TIND.io URL Fixer") is a program to download records from the Caltech TIND.io database and check the URLs that may be embedded within the records.
Python
3
star
28

namaste

Go package and command line implementation of "NAMe AS TExt" metadata embedding for directories.
Go
3
star
29

tmplfn

tmplfn provides a set of common functions useful for working with Go's text/template and html/template packages
Go
3
star
30

collaborator_reports

Generate collaborator reports from data sources
Python
3
star
31

martian

Search Caltech's TIND.io database and download the resulting MARC records
Python
3
star
32

persistent_url_resolver

This repository contains a new version of Caltech Library's Persistent URL Resolver, based on AWS S3.
Python
3
star
33

splitit

Split range values produced by Caltech.tind.io in spreadsheet output
Python
3
star
34

convert_codemeta

Convert and validate codemeta files using crosswalk
Python
3
star
35

corsproxy

Simple CORS proxy server suitable for use as a system daemon on CentOS/RHEL systems
JavaScript
3
star
36

pubarchiver

Package up microPublication.org and other journals for archiving into Portico and PMC
Python
3
star
37

caltechauthors

The CaltechAUTHORS InvenioRDM source code
HTML
3
star
38

rss2

A Golang package for working with RSS 2
Go
3
star
39

cli

Building better command line interfaces for Caltech Library
Go
3
star
40

sidetrack

Simple debug tracing package for Python, with optimization support.
Python
3
star
41

eprints2bags

Download records and documents from an EPrints server and put them in BagIt format.
Python
3
star
42

eprints2archives

Send records from an EPrints server to the Internet Archive and other web archives
Python
3
star
43

eprinttools

EPrintTools is a Go package, command line utilities and a service for working with EPrints 3.3.x EPrint XML and REST API
Go
3
star
44

git-desktop

Modified version of the Software Carpentry git-novice lesson that uses GitHub Desktop
Python
3
star
45

py-cli-template

GitHub template project for non-web Python application projects. To use this, DO NOT CLONE OR FORK this repository; click on "Use this template". After it's used to create a new repo, this will run a GitHub Actions workflow to update files and directories, so give it a minute and refresh your browser to see the finished result.
Makefile
3
star
46

orcidtools

A command line tool and example Bash scripts for working with the ORCID v2.0 API
Go
2
star
47

popstar

Phone-Oriented Processing SofTware for ARchives
Makefile
2
star
48

coda_media

Web pages for CODA Movies
HTML
2
star
49

crossrefapi

This is a Go package fork working politely with the CrossRef API.
Go
2
star
50

libguine

Caltech Library customizations for LibGuides CMS
HTML
2
star
51

storage

A Go package wrapper for Caltech Library's remote storage options supporting CRUD operations
Go
2
star
52

lostit

Report lost items in the Caltech Library
JavaScript
2
star
53

2018-10-18-shell-arc

Python
2
star
54

install-golang

A Bash script that bootstraps and installs Go language in your home directory
Shell
2
star
55

RDMworkbook

bookdown files for "The Research Data Management Workbook"
TeX
2
star
56

acacia

Automated CaltechAUTHORS Catalog Ingest Agent
Python
2
star
57

inveniordm-migrate

Scripts to migrate content into Invenio RDM
Python
2
star
58

orcid-sciencv-workshop

HTML
2
star
59

liblog

experiment in tracking website content changes
HTML
2
star
60

caltechlibrary.github.io

Caltech Library's Digital Library Development sandbox.
HTML
2
star
61

unified-search-box

DEPRECIATED: A unified search box for Caltech Library.
HTML
2
star
62

rc

Rest Client wrapper around Golang's http/http packages targeting ease of implementation working with REST API
Go
2
star
63

andor

A light weight object repository concept
Python
2
star
64

2019-01-22-Python-Workshop

Workshop page for CPA Python Workshop
Jupyter Notebook
2
star
65

wsfn

Go package for standardize web service functionality across our library's go projects
Go
2
star
66

wos_reports

Scripts to generate reports from Web of Science
Python
2
star
67

command-line-for-librarians

MMWConf presentation Fall 2016
HTML
1
star
68

ocre

OCR evaluation utilities
Python
1
star
69

codemeta-pandoc-examples

This repository describes how to generate a CITATION.cff, about.md and installer.sh from a codemeta.json file using Pandoc.
HTML
1
star
70

irdmtools

A Go and Python package for working with InvenioRDM repositories.
Go
1
star
71

cold

Controlled Object Lists Daemon
TypeScript
1
star
72

refoliate

REstore FOLIo sAved insTance rEcords
1
star
73

vcard

A light weight VCard v4 package for harvesting local directory information
Go
1
star
74

2018-08-13-caltech

Python
1
star
75

2018-03-29-caltech

Caltech R Workshop
Python
1
star
76

irdm_harvester

Automatically harvest publications for an InvenioRDM repository
Python
1
star
77

metagenesys

Take the information from a Python setup.cfg file and generate a codemeta.json file
1
star
78

etdb-ipfs-mirror

Code involved in setting up an IPFS mirror of ETDB
Shell
1
star
79

dataset-instruction

Instructional content for the dataset package
HTML
1
star
80

caltechdata_map

Map interface for CaltechDATA files
HTML
1
star
81

library-shell-curl-and-api

An intermediate exploration of Bash, curl and working with content from web API
1
star
82

tccon-caltechdata

Scripts for uploading TCCON data to CaltechDATA
Python
1
star
83

htr-test-cases

Images of documents for testing HTR.
Python
1
star
84

R-Carpentry

Repository for Carpentry Lessons based on R
HTML
1
star
85

checkit

Look up barcodes in Caltech's TIND server
Python
1
star
86

html_footer

Universal footer code to be embedded in various systems for a consistent look.
HTML
1
star
87

doitools

A Go package for working with DOI
Go
1
star
88

archives_drupal

Caltech Archives on Drupal 8
CSS
1
star
89

caltechdata_plot

Demo interactive plotting tool that uses Bokeh server to produce an interactive plot by calling the caltechDATA (Invenio 3) API
Python
1
star
90

caltechdata_migrate

Assorted scripts for migrating content to CaltechDATA
Python
1
star
91

dibsiiif

Scripts to support the Caltech Library DIBS application
Python
1
star
92

pkgassets

(this has moved into the cli package) See https://github.com/caltechlibrary/cli
Go
1
star
93

searchtools

A Python3 package for working with Elasticsearch and LunrJS.
Python
1
star
94

distillery

WIP
Python
1
star
95

book-template

A Bookdown book template with Caltech customizations
JavaScript
1
star
96

ep-php

EPrints 3.3 REST API wrapper for PHP
PHP
1
star
97

adage

Authors Dimensional Analysis and General Exploration
1
star
98

dataciteapi

A Golang package and command line utility for working with the public DataCite API
Go
1
star
99

archives-hale-processing

Scripts and CSV files for processing the George Ellery Hale Papers for the Caltech Archives.
PHP
1
star
100

cait

Caltech Archives Integration Tools - A collection of tools utilities and services for integrating ArchivesSpace with other systems.
Go
1
star