• Stars
    star
    1
  • Language
    Python
  • Created over 1 year ago
  • Updated over 1 year ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Extract text, images and metadata from PDFs

More Repositories

1

example-pdf-search

Search PDFs using Jina, DocArray and Jina Hub
Python
53
star
2

easy_text_generator

Generate text from machine-learning models right in your browser
Python
19
star
3

neural-search-notebooks

Notebooks for docarray, Jina, Finetuner, and other products from Jina AI
Jupyter Notebook
11
star
4

executor-spacy-sentencizer

Jina Executor to sentencize Document text into chunks using spaCy Sentencizer
Python
10
star
5

ultrascope-doc

Documentation for the Ultrascope project
8
star
6

jina-shortest-program

Shortest viable Jina program that actually does something useful
Python
8
star
7

wikihouse

Scale models of WIkihouse
7
star
8

simple-jina-examples

Learn Jina one baby step at a time
Python
6
star
9

jina-meme-search-image-backend

Jina search engine to find memes using similar images
Python
5
star
10

example-chatbot

Chatbot using Jina, Jina Hub and DocArray with Streamlit frontend
Python
5
star
11

godel_escher_bach

Learning resources for the book GΓΆdel Escher Bach
Jupyter Notebook
4
star
12

jina-wikipedia-sentences

Using Jina to search through sentences from English-language Wikipedia
Python
4
star
13

jina_celeb_twin

(Mis)use cutting edge neural search to find your celebrity lookalike!
Python
4
star
14

jina-streamlit-frontend

A simple front-end for Jina neural search framework, written in Streamlit, that supports querying with image, text, or drawing on a canvas.
Python
3
star
15

example-anime-book-girls

Search Anime Book Girls using Jina AI's docarray
Python
3
star
16

jina-meme-search-frontend

Streamlit frontend for Jina meme search
Python
2
star
17

example-knowledge-base-search

Search knowledge bases using Jina
Python
2
star
18

semantle-docarray

Semantle in DocArray and GPT
Python
2
star
19

executor_tesseract_ocr

Executor to extract text from images
Python
2
star
20

example-knowledge-base-stackoverflow

Stack Overflow search engine using Jina
Python
2
star
21

quick-metrics-tool

Python
1
star
22

ml-datasets

Python
1
star
23

trekbot

Python
1
star
24

executor-text-cleaner

Clean up messy chunks of text
Python
1
star
25

transppt

Translate Powerpoint files from the command line
Python
1
star
26

executor-pdf-table-extractor

Extract PDF Tables
Python
1
star
27

trekbot_script-writer

A bot that writes (bad) Star Trek scripts
Jupyter Notebook
1
star
28

jina-doc-sample

1
star
29

alexcg1

Personal README
1
star
30

executor-html-stripper

Executor strip HTML from doc.text in Jina
Python
1
star
31

mediawiki2book

Convert mediawiki pages into beautiful PDF books
Python
1
star
32

jina-2.0-playground

Playing with Jina 2.0
Python
1
star
33

executor-uri-downloader

Downloads a file from a uri and stores in a blob
Python
1
star
34

executor-chunk-level-equalizer

Put all text chunks on doc.chunks level, not lower traversal levels
Python
1
star
35

executor_image_uri_to_blob

Jina Executor: Convert image URI to blob
Python
1
star
36

streamlit-dalle-flow

Python
1
star
37

example-clip-as-service

Run CLIP as service frontend in your browser
Python
1
star
38

foodie-thinkgpt

Python
1
star
39

clip-as-service-server

CLIP-as-service server
Dockerfile
1
star
40

executor-pandoc-text-extractor

Extract text using Pandoc
Python
1
star
41

collar_stay

Basic 3D-printed collar stays for one of my shirts
OpenSCAD
1
star