• Stars
    star
    1
  • Language
    Jupyter Notebook
  • License
    MIT License
  • Created 5 months ago
  • Updated 5 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A repo for analyzing the CATMuS dataset.

More Repositories

1

ocr_python_textbook

Jupyter Notebook
205
star
2

freecodecamp_spacy

Jupyter Notebook
129
star
3

topic_modeling_textbook

Jupyter Notebook
105
star
4

streamlit-pandas

Python
85
star
5

spacyex

SpaCyEx allows the creation of spaCy Matcher patterns with RegEx like syntax.
Python
57
star
6

ner_youtube

Python
54
star
7

LeetTopic

Python
53
star
8

python_for_dh

Jupyter Notebook
41
star
9

holocaust_ner_lessons

Jupyter Notebook
40
star
10

qwen2-vl-finetune-huggingface

This project is a collection of fine-tuning scripts to help researchers fine-tune Qwen 2 VL on HuggingFace datasets.
Python
38
star
11

hobbit-spacy

Jupyter Notebook
23
star
12

biospacy

Python
21
star
13

spacy_tutorials_3x

Jupyter Notebook
20
star
14

tap-2023-spacy-01

Jupyter Notebook
20
star
15

ww2-spacy

Python
17
star
16

date-spacy

Python
15
star
17

youtube-bertopic

Jupyter Notebook
14
star
18

youtube_booknlp

HTML
14
star
19

youtube-florence-table

Table detection with Florence.
Jupyter Notebook
13
star
20

spacy-chunks

An easy way to chunk spaCy docs.
Python
11
star
21

tap-2024-vector-databases

This is my 2024 course for TAP Institute on Vector Databases and Semantic Searching.
Jupyter Notebook
11
star
22

latin_ner_lesson

Python
11
star
23

streamlit_lessons_youtube

Python
9
star
24

youtube-txtai

Jupyter Notebook
9
star
25

youtube_text_classification

This repo is meant to work alongside my youtube series on Text Classification.
Jupyter Notebook
9
star
26

vulgata-spacy

Python
9
star
27

weaviate-filter

A package for creating GraphQL filters for Weaviate
Python
9
star
28

tap-2024-spacy-llms

This is the repository for my 2024 Tap Institute Course on spaCy with LLMs
Jupyter Notebook
8
star
29

bagpipes-spacy

Bagpipes spaCy is a collection of custom spaCy pipeline components designed to enhance text processing capabilities.
Python
8
star
30

keyword-spacy

Keyword spaCy is a spaCy pipeline component for extracting keywords from text using cosine similarity.
Jupyter Notebook
8
star
31

bulk-image-clustering

Python
7
star
32

intro-to-ml

Jupyter Notebook
7
star
33

instagram-analysis

Python
7
star
34

spacy_3_ner_tutorials

Jupyter Notebook
7
star
35

textbook_pandas

Jupyter Notebook
7
star
36

intermediate-python-for-dh

HTML
6
star
37

youtube-rembg

Jupyter Notebook
6
star
38

tap-2024-rag

Jupyter Notebook
6
star
39

ml-project-template

My template for machine learning projects
6
star
40

tap-2022-pandas

Jupyter Notebook
5
star
41

intro-nlp-tap-2022

Jupyter Notebook
5
star
42

fewshot-text

Jupyter Notebook
5
star
43

spacy_custom_vectors

Jupyter Notebook
4
star
44

leettopic-test

Jupyter Notebook
4
star
45

textbook_digital_humanities

Python
4
star
46

digital_alcuin_project

Jupyter Notebook
4
star
47

text-analysis-for-ancient-and-medieval-languages

Jupyter Notebook
4
star
48

youtube-shakespeare

Jupyter Notebook
4
star
49

number-spacy

Number spaCy is a custom spaCy pipeline component that enhances the identification of number entities in text and fetches the parsed numeric values using spaCy's token extensions.
Python
4
star
50

youtube-spacy-ml

Jupyter Notebook
4
star
51

youtube-streamlit-link-analysis

A quick repository for using streamlit link analysis component.
Python
3
star
52

neural_networks_for_dh

3
star
53

skweak

Jupyter Notebook
3
star
54

quiz-generator

Jupyter Notebook
3
star
55

youtube-bm25

Jupyter Notebook
3
star
56

wjbmattingly

3
star
57

youtube-streamlit-image-grid

Python
3
star
58

spacy_components

Jupyter Notebook
3
star
59

gliner-finetune

A package for generating synthetic data and fine-tuning a gliner model.
Jupyter Notebook
3
star
60

dap_app

HTML
3
star
61

florida

simple tools to make my life easier
Python
3
star
62

streamlit-openai-functions

Python
3
star
63

yale-lux-overlap

This project demonstrates how to connect multiple records in Yale's Lux search to a single record.
Python
2
star
64

vulgata-spacy-app

Python
2
star
65

cltk_tutorial

Files for cltk tutorial
2
star
66

Patrologia-Latina

This repository is for functions and tools for handling Patrologia Latina (PL) texts.
Python
2
star
67

textbook_pdfs

Jupyter Notebook
2
star
68

ushmm_test_app

Python
2
star
69

cltk-textbook

Jupyter Notebook
2
star
70

latin_cltk_mwt

Python
2
star
71

text_class_models

Jupyter Notebook
2
star
72

text2xmlnolibs

This is the code for a simple video I made on how to convert text to xml in Python without libraries.
Python
2
star
73

Alcuin-Letters

This page hosts the Python functions developed by William Mattingly for quantifying and analyzing Alcuin's Letter Collections
Python
2
star
74

grk_ang_ner_cltk

Python
2
star
75

Vulgate-Neural-Network

This is a sample of the code necessary to train a neural network capable of identifying Scripture in a text. I also include the functions for extracting that data from the text.
Python
2
star
76

open-medieval-bibliography

open-source medieval bibliography
Jupyter Notebook
2
star
77

ushmm_sent_embedding_app

Jupyter Notebook
1
star
78

medieval-htr

A demo for how to use TrOCR Medieval HTR models.
Jupyter Notebook
1
star
79

ushmm_text_pipeline

Python
1
star
80

Grimbot

In the grim future there are only dice, and math
Python
1
star
81

top2vec-demo

Jupyter Notebook
1
star
82

ushmm_ner_app

Python
1
star
83

bap_app

Jupyter Notebook
1
star
84

streamlit-110-demo

Python
1
star
85

florence-2-finetune

Finetuning florence 2 on CATMuS.
Python
1
star
86

themedievalworld

HTML
1
star
87

youtube-feather

Jupyter Notebook
1
star
88

word_embedding_ushmm

Python
1
star
89

tap-2022-multilingual-ner

Jupyter Notebook
1
star
90

ushmm

A python package for working with data at the United States Holocaust Memorial Museum
HTML
1
star
91

bap_sent_embedding

HTML
1
star
92

christie

Jupyter Notebook
1
star
93

streamlit-textbook

1
star
94

setting_pyvis

Python
1
star
95

weaviate-vulgate

Latin vulgate search engine
Jupyter Notebook
1
star
96

demo-latincy

Jupyter Notebook
1
star
97

rebecca_text

Jupyter Notebook
1
star
98

tiktok_python

Python
1
star
99

spacyex-demo

Demo for spaCyEx library.
Python
1
star
100

youtube-clip-demo

Jupyter Notebook
1
star