OpenSource Connections (@o19s)

Top repositories

1

elasticsearch-learning-to-rank

Plugin to integrate Learning to Rank (aka machine learning for better relevance) with Elasticsearch
Java
1,466
star
2

relevant-search-book

Code and Examples for Relevant Search
Jupyter Notebook
292
star
3

quepid

Improve your Elasticsearch, OpenSearch, Solr, Vectara, Algolia and Custom Search search quality.
Ruby
271
star
4

hello-ltr

Set of Jupyter notebooks demonstrating Learning to Rank integrated with Solr and Elasticsearch
Jupyter Notebook
156
star
5

elyzer

"Stop worrying about Elasticsearch analyzers", my therapist says
Python
153
star
6

splainer

Elasticsearch/Solr Sandbox for exploring explain information and tweaking
JavaScript
135
star
7

hello-nlp

A natural language search microservice
Python
94
star
8

awesome-search-relevance

Tools and other things for people who work on search relevance & information retrieval
81
star
9

Spyglass

Simple search results with Solr and EmberJS
JavaScript
58
star
10

solr-to-es

Migrate a Solr node to an Elasticsearch index.
Python
53
star
11

lucene-query-example

Educational Examle of a custom Lucene Query & Scorer
Java
48
star
12

solr_nginx

Starter Reverse Proxy Configuration for Solr
47
star
13

SemanticSearchInNumpy

XSLT
44
star
14

hangry

Vector search in Lucene based search attempting to use just the existing Lucene data structures (experimental)
Java
43
star
15

RankyMcRankFace

Hardened Fork of Ranklib learning to rank library
Java
43
star
16

trireme

Migration tool providing support for Apache Cassandra, DataStax Enterprise Cassandra, & DataStax Enterprise Solr.
Python
37
star
17

elastic-graph-recommender

Building recommenders with Elastic Graph!
JavaScript
37
star
18

elasticsearch-ltr-demo

This demo uses data from TheMovieDB (TMDB) to demonstrate using Ranklib learning to rank models with Elasticsearch.
HTML
36
star
19

lazy-semantic-indexing

Elasticsearch Latent Semantic Indexing experimentation
Python
33
star
20

pdf-discovery-demo

Demonstration of searching PDF document with Solr, Tika, and Tesseract
JavaScript
29
star
21

match-query-parser

Search a single field with different query time analyzers in Solr
Java
25
star
22

splainer-search

Angular JS Solr and Elasticsearch and OpenSearch Diagnostic Search Services
JavaScript
25
star
23

tmdb_dump

Dump TheMovieDB
Python
21
star
24

es-tmdb

Elasticsearch TMDB examples
Python
20
star
25

solr-tmdb

TheMovieDB in Solr
Python
19
star
26

skipchunk

Extracts a latent knowledge graph from text and index/query it in elasticsearch or solr
Python
19
star
27

StackExchangeSolrIndexing

AutoTaxonomyExtractionAndTagging
XML
18
star
28

cfn-solr

Cloud formation script for solr servers
Shell
16
star
29

solr_angular_demo

A little search widget for instant Solr search with angular
JavaScript
15
star
30

lucene-bm25f

BM25F demo with lucene using BlendedTermQuery and a custom similarity
Java
15
star
31

bearded-wookie

An experiment in visualizing your Solr index via term counts, document counts, and memory usage per field and data type.
CSS
15
star
32

search-metrics

Python functions for popular relevance metrics (ndcg, err, etc)
Python
14
star
33

elasticsearch-image-search

Stupid Experiments in Elasticsearch Image Search
Jupyter Notebook
14
star
34

solr-movielens-recommender

Movielens collaborative filtering with Solr streaming expression
Python
11
star
35

grand_central

Docker & Kubernetes deployment system for dynamic environments.
Java
11
star
36

agent_q

Headless agent for test driven relevancy with Quepid.com
Ruby
10
star
37

ltr-synth-judg

Experiments in creating synthetic training data for learning to rank
Python
9
star
38

payload-component

Solr component that surfaces payloads for matching terms
Java
9
star
39

goRank

click tracking for creating judgement lists for search-y stuff
Go
8
star
40

puppet-solr

Puppet module for installing solr with a stand alone jetty server
Shell
7
star
41

semantic-search-course

Semantic Search Course, Originally delivered at Code4Lib
Python
7
star
42

Sample-Spark-Project

Sample Spark project with Scala and SBT
Scala
7
star
43

solr_dump

Dump Solr docs to file; Write dumped docs to a Solr
Python
7
star
44

lucene_codec_hello_world

Starting point and instructions on developing a Lucene Codec
Java
7
star
45

SolrSwan

SolrSwan is a query parser and highlighter for Solr that accepts proximity and Boolean queries.
Java
6
star
46

solr-docker

Sample Dockerfiles for running Solr in a container
6
star
47

o19s-lambda

AWS Lambda Functions to make your life easier.
JavaScript
6
star
48

StackExchangeElasticSearch

Playing with ElasticSearch and the SciFi Stackexchange Dataset
Python
6
star
49

highlighting-pdf-viewer

A component (written in Vue) that supports highlighting of words in the PDF document.
Vue
6
star
50

elasticsearch-vagrant

An ubuntu 14.04 vagrant box running Elasticsearch
Shell
5
star
51

jackhanna

Simple CLI for Zookeeper
Java
5
star
52

keel

This gem provides a few easy to run rake tasks to deploy your Rails application to a Kubernetes cluster.
Ruby
5
star
53

bad-libs

📝 Automatically converts any book into a Mad-Libs style game of silliness using spaCy. Free Charles Dickens included!
Jupyter Notebook
4
star
54

elasticsearch-query-builder-example

Basic Elasticsearch Query Builder Plugin
Java
4
star
55

natural-language-search

Colaboratory notebooks for OSC's Natural Language Search training
Jupyter Notebook
4
star
56

opensearch-ubi

OpenSearch plugin for User Behavior Insights
Java
4
star
57

word2vec-experiments

Some experimentation with word2vec
Jupyter Notebook
3
star
58

trec-news-index

Index for the TREC Washington Post corpus
Jupyter Notebook
3
star
59

twittalytics

Twitter Analytics with Cassandra
Python
3
star
60

tlre-nlp

Materials for "Think Like A Relevance Engineer - NLP" Training
Jupyter Notebook
3
star
61

solr-monitor

Java
2
star
62

search-viz

Various experiments demonstrating pairing realtime visualizations with search results.
JavaScript
2
star
63

tm-import

Importing public domain Trademark XML from Google
Go
2
star
64

elasticsearch-heatmap

Java
2
star
65

o19s-blog-ltr

Using the Elasticsearch LTR demo w/ some hand-created judgments
Python
2
star
66

JodaTimeCodecs

A collection of Cassandra TypeCodecs for serializing and deserializing Joda Time objects.
Java
2
star
67

Spark-Cassandra-Demo

Demo code for loading data into Cassandra and Solr with Spark.
Java
2
star
68

trec-podcasts-index

Index Spotify's 100k podcasts dataset into Elasticsearch
Python
2
star
69

ispy_component

Relevance debugging component for Solr
Java
2
star
70

clustering-lowes-grouts

Code to support a blog post about extracting tags from Lowes.com for clustering unsanded grout search results
JavaScript
2
star
71

visualizing-signals

A Practical Introduction to Exploring and Visualizing E-Commerce Search Signal Data
Shell
2
star
72

solr-query-parser-demo

A "surround"-like and capitalization custom query parsers demo
Java
2
star
73

metric-plots

Plots for search metrics nDCG and ERR
JavaScript
1
star
74

jupyter-blogs

Drafts of Doug's Jupyter Notebook Blog Posts
Python
1
star
75

os-tmdb

TLRE OpenSearch
Python
1
star
76

quepid-jupyterlite

Jupyter notebooks to help with search relevancy measurements, optimized for Quepid.
Jupyter Notebook
1
star
77

ndoch-trademark-challenge

Applications built for the National Day of Civic Hacking's USPTO Trademarks Challenge
Ruby
1
star
78

movielens-judgments

experiments using movielens genome tags as an experimental ltr training set
Python
1
star
79

training_coms

R scripts to manage bulk training communications and certificate generation
R
1
star
80

jarjar

Joint Analysis Review of Judgements And Raters
Jupyter Notebook
1
star
81

puppet-modules

puppet modules for o19s
Puppet
1
star
82

thats-trackable

Running app for XC team.
Ruby
1
star
83

ggoodggraphics

The grammer of graphics is powerful and now in Python thanks for `plotnine`!
Jupyter Notebook
1
star