Vik Paruchuri (@VikParuchuri)

Top repositories

1

marker

Convert PDF to markdown quickly with high accuracy
Python
9,051
star
2

surya

OCR, layout analysis, reading order, line detection in 90+ languages
Python
7,035
star
3

apartment-finder

A Slack bot that helps you find an apartment.
Python
1,058
star
4

zero_to_gpt

Go from no deep learning knowledge to implementing GPT.
Jupyter Notebook
813
star
5

texify

Math OCR model that outputs LaTeX and markdown
Python
523
star
6

textbook_quality

Generate textbook-quality synthetic LLM pretraining data
Python
449
star
7

libgen_to_txt

Convert all of libgen to high quality markdown
Python
223
star
8

pdftext

Extract structured text from pdfs quickly
Python
204
star
9

scribe

Simple speech recognition using your microphone.
Python
122
star
10

researcher

Concise answers to search queries using Google and GPT-3. Includes citations.
Python
70
star
11

scan

Score essays automatically with an easy web interface.
Python
40
star
12

evolve-music2

Evolve music automatically with python -- rewrite of evolve-music.
Python
40
star
13

classified

Score LLM pretraining data with classifiers
Python
35
star
14

evolve-music

Superseded by github.com/vikparuchuri/evolve-music2 -- use that instead.
C
25
star
15

simpsons-scripts

Find out how much the simpsons characters like each other with text and audio analysis.
Python
23
star
16

movide

The student-centric learning platform.
Python
18
star
17

snapcheck

Find out if your info was leaked.
Python
15
star
18

political-positions

Analyze politics.
Python
14
star
19

vikparuchuri.com

Code for vikparuchuri.com -- personal blog.
Ruby
13
star
20

boston-python-ml

Text scoring/classification presentation
JavaScript
9
star
21

percept

A modular machine learning framework that is easy to test and deploy.
Python
9
star
22

wp-deployment

Deploy wordpress with multisite to ec2 with ansible.
Python
7
star
23

spotify-export

Export albums from Spotify into Google Play Music.
Python
7
star
24

algorithms

Pure python implementations of various algorithms, including a matrix class.
Python
6
star
25

vikparuchuri-affirm

CSS
5
star
26

ds-webinar

How to learn data science webinar presentation
CSS
5
star
27

nyt-articles

Get articles from new york times API.
Python
5
star
28

triton_tutorial

Tutorials for Triton, a language for writing gpu kernels
Jupyter Notebook
4
star
29

pdf_to_md

Python
4
star
30

ml-math

Svelte
3
star
31

TulaLensSurvey

Android app that makes it easy to survey people.
Java
3
star
32

medicare-analysis

Analyze medicare data from the recent release.
CSS
3
star
33

sports-stats

Try to rethink sports statistics.
Python
3
star
34

bostonpython2015

Presentation for boston python 2015
CSS
2
star
35

dscontent-starter

2
star
36

Presentations

JavaScript
1
star
37

vik-blog

HTML
1
star
38

tulalens-survey-web

Web component of android survey app.
Ruby
1
star
39

nextml-talk

CSS
1
star
40

vj-wedding2

A site I made for a wedding.
JavaScript
1
star
41

matter

Chrome extension that highlights important passages.
JavaScript
1
star
42

vj-wedding

Placeholder site for a wedding (with countdown)
JavaScript
1
star
43

affirm-themes

Themes for affirm.io.
CSS
1
star
44

openphi

1
star