@lsb

Top repositories

1

pq.js

Using embeddings compressed by Product Quantization, in Javascript
Python
30
star
2

sqlite-vector-search

Jupyter Notebook
28
star
3

gcs

Compressed Bloom Filters (Golomb-compressed sequences, with indices)
Haskell
17
star
4

zgrab-the-web

Feed the hundreds of millions of domains in the Common Crawl to zgrab
Shell
9
star
5

save-bindings

Making printf-debugging obsolete.
Ruby
7
star
6

embeddingdb.js

An embedding database for the browser
JavaScript
6
star
7

text-entropy-api

Web service (via Sinatra) to pull stats for a list of words from the text-entropy statistical models
Ruby
6
star
8

text-entropy

Aggregate n-gram frequency counts from GBooks via Hadoop, dump statistical model out to flat files
Shell
6
star
9

text-entropy-visualization

The JS that renders the data from the API from text-entropy
JavaScript
4
star
10

ugc-contributors

Analyzing contributors to Wikipedia's user-generated content
Ruby
3
star
11

diffusion-local-time

Diffusion Local Time, in Art Hack Day DETHRONE 2024
Python
3
star
12

sister-cities-map

Map of sister cities, via lsb/city-correlation-mapping and deck.gl
JavaScript
2
star
13

jslzjb-faster

Trying out some speed improvements for jslzjb
JavaScript
2
star
14

liberstatim

This was the original incarnation of NoDictionaries, written in Ruby 1.6 [sic]. It is included for historical reference, and contains many antipatterns.
TeX
2
star
15

turk-rest-api

QC, fair wages, and question form generation for Mechanical Turk
Ruby
1
star
16

code-kata-bayhac2013

Code Kata BayHac 2013
Haskell
1
star
17

passphrase-safety-ui

UI for http://www.leebutterman.com/passphrase-safety/
JavaScript
1
star
18

n-gram-weaving

Visualizing the weave of n-grams in text
Haskell
1
star
19

generative-image-semantic-search

Image semantic search in <10 lines
1
star
20

city-correlation-mapping

City correlation mapping, via Wikipedia revisions
Jupyter Notebook
1
star
21

visualizing-light-pollution

Visualizing light pollution, as topography (and as haze)
1
star
22

slow-nested-svgs

Browser benchmark of speed differences between RECTs in nested SVGs versus non-nested RECTs.
1
star
23

enwiki-nlp

Obsolete: churn through wiki markup in various ways
Ruby
1
star
24

stable-diffusion-raspi-clock

A clock on a Raspberry Pi, telling time via Stable Diffusion images
Python
1
star
25

eight-ways-of-looking-at-a-fivegram

WIP. Markov-chain-generated text, US/UK, 1700s/1800s/early 1900s/late 1900s
Ruby
1
star
26

1984-dissociated-text

Markov-chain-generated text, based on Orwell's 1984, in the browser.
1
star
27

bashblog

A Bash script that handles blog posting
Shell
1
star
28

human-numbers

Lists of numbers in American English
Shell
1
star
29

wikipedia-protected-classes

Fine tuning a text classification model to identify potentially protected classes of new articles
Jupyter Notebook
1
star
30

ndclj

ND.clj
Clojure
1
star
31

stable-diffusion-clock

Timepiece, imagery via Stable Diffusion
Jupyter Notebook
1
star