• Stars
    star
    103
  • Rank 333,046 (Top 7 %)
  • Language
    JavaScript
  • Created over 11 years ago
  • Updated about 8 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Exploring extracting tables from a PDF to CSV using PDF.JS

pdf-js-csv

How It Works:

Extracting tables

http://garysieling.com/blog/extracting-tables-from-pdfs-in-javascript-with-pdf-js

Loading files in PDF.js using PhantomJS

http://www.garysieling.com/blog/integrating-phantomjs-and-pdf-js-inter-process-communication pdf-js-csv

Quick Start

npm install pdf2csv --no-bin-links

wget https://github.com/garysieling/pdf-js-csv/raw/master/examples/tests.pdf --no-check-certificate

wget https://raw.github.com/garysieling/pdf-js-csv/master/main.js --no-check-certificate

node main tests.pdf output.csv

More Repositories

1

jquery-highlighttextarea

JavaScript
156
star
2

solr-git

Convert git commit history to solr index
Java
73
star
3

video-crawler

Crawl websites for videos from Youtube, Vimeo, Soundcloud, etc
Scala
30
star
4

chrome-scraper

Chrome Based Scraper
JavaScript
22
star
5

wikipedia-categorization

17
star
6

adsense-scraper

JavaScript
12
star
7

solrkit

UI Components for Solr
TypeScript
11
star
8

grep-js

grep-js
JavaScript
9
star
9

browser-map-reduce

Browser based map-reduce
6
star
10

scala-k-means

k-means
Scala
6
star
11

git-solr-talk

Talk in indexing Git history in Solr
JavaScript
6
star
12

fft-scala

fft-scala
Scala
5
star
13

postgres-immutable-data

postgres-immutable-data
JavaScript
3
star
14

social_media_counts

A php script to retrieve counts from Twitter, Google+, HN, and Reddit for blog posts
PHP
3
star
15

intelligence_events

interesting
JavaScript
3
star
16

chords

chords
R
2
star
17

transcript-alignment

Map closed captions to transcripts with Smith-Waterman alignment
JavaScript
2
star
18

db-connection-proxy

Proxy to send queries to multiple databases simultaneously
C#
2
star
19

srt-to-text

Extract sentences from closed caption files (SRT, VTT)
JavaScript
2
star
20

video-collection-app

Web UI for collection stock videos for ML
JavaScript
2
star
21

apache-zeppelin-talk-slides

JavaScript
1
star
22

word2vec-dbpedia-solr

Python
1
star
23

test-screenshots

Use screenshots for integration testing
JavaScript
1
star
24

deeplens-experiments

AWS DeepLens Experiments
Python
1
star
25

line-following-robot

line-following-robot
1
star
26

scaladocs-jekyll

HTML
1
star
27

vimrc

vimrc
Vim Script
1
star
28

postgres-auditing

Data analysis and correction tools for auditing a Postgres database
Shell
1
star
29

lambda-video-crawler

AWS Lambda Video Crawler
JavaScript
1
star
30

email-alerts

Generate Weekly Emails by Querying a Full Text (Solr) Index
TypeScript
1
star
31

spanish-english

Python
1
star
32

WPVariationTester

WPVarationTester
1
star
33

findlectures.com

1
star
34

sql-for-support

Using SQL parsing to generate queries that satisfy customer support questions
Python
1
star
35

findlectures-nlp

FindLectures.com Word2Vec Model Construction
Python
1
star