Eugene Yan (@eugeneyan)

Top repositories

1

applied-ml

๐Ÿ“š Papers & tech blogs by companies sharing their work on data science & machine learning in production.
24,324
star
2

open-llms

๐Ÿ“‹ A list of open LLMs available for commercial use.
10,867
star
3

ml-surveys

๐Ÿ“‹ Survey papers summarizing advances in deep learning, NLP, CV, graphs, reinforcement learning, recommendations, graphs, etc.
2,630
star
4

ml-design-docs

๐Ÿ“ Design doc template & examples for machine learning systems (requirements, methodology, implementation, etc.)
395
star
5

1-on-1s

๐ŸŒฑ 1-on-1 questions and resources from my time as a manager.
310
star
6

testing-ml

๐Ÿ” Minimal examples of machine learning tests for implementation, behaviour, and performance.
Python
199
star
7

obsidian-copilot

๐Ÿค– A prototype assistant for writing and thinking
Python
186
star
8

applyingml

๐Ÿ“Œ Papers, guides, and mentor interviews on applying machine learning for ApplyingML.comโ€”the ghost knowledge of machine learning.
JavaScript
160
star
9

papermill-mlflow

๐Ÿงช Simple data science experimentation & tracking with jupyter, papermill, and mlflow.
Jupyter Notebook
152
star
10

python-collab-template

๐Ÿ›  Python project template with unit tests, code coverage, linting, type checking, Makefile wrapper, and GitHub Actions.
Python
129
star
11

recsys-nlp-graph

๐Ÿ›’ Simple recommender with matrix factorization, graph, and NLP. Beating the regular collaborative filtering baseline.
Python
112
star
12

llm-paper-notes

Notes from the Latent Space paper club. Follow along or start your own!
73
star
13

fastapi-html

Sample repository demonstrating how to use FastAPI to serve HTML web apps.
Python
62
star
14

eugeneyan

Python
38
star
15

poc-docker-template

Simple template showing how to set up docker for reproducible data science with Jupyter notebooks.
Jupyter Notebook
21
star
16

text-to-image

Jupyter Notebook
13
star
17

nocode-ml

๐Ÿ˜ End-to-end machine learning; "no code" required!
12
star
18

discord-llm

Experimenting with LLMs to Research, Reflect, and Plan (LLM assistants, retrieval, and Discord integration)
Jupyter Notebook
11
star
19

learning-typescript

JavaScript
10
star
20

design-patterns

Java
7
star
21

deep-rl

Repository for deep reinforcement learning with OpenAI
Python
6
star
22

testing-pipelines

Python
6
star
23

kaggle_springleaf

Code for Kaggle Springleaf Email Prediction Challenge
Python
5
star
24

Computational-Thinking-and-Data-Science

edX: Introduction to Computational Thinking and Data Science (Oct 2014)
Python
5
star
25

ama

Ask Me Anything
4
star
26

Mining-Massive-Datasets

Coursera: Mining Massive Datasets (Sep 2014)
R
4
star
27

Time-Series-Analysis

Simple forecasting with Regression Model
R
3
star
28

raspberry-llm

Calling LLM APIs on a Raspberry Pi for lulz
Python
3
star
29

Statistical-Inference

This repository contains the lab assignments for the facilitation of John Hopkins University' Coursera MOOC on Statistical Inference.
R
3
star
30

kaggle_titanic

Code for Kaggle Titanic Challenge (and other learning)
HTML
3
star
31

Statistical-Learning

Stanford OpenX: Introduction to Statistical Learning
HTML
3
star
32

Data-Analysis-and-Statistical-Inference-Project

Coursera: Data Analysis & Statistical Inference Project (Feb 2014)
R
2
star
33

neural_networks_and_deep_learning

2
star
34

Twitter-SMA

Twitter Streaming and Analysis with Python and R
R
2
star
35

scratch

Jupyter Notebook
2
star
36

Getting-and-Cleaning-Data

Coursera: Getting and Cleaning Data (May 2014)
R
2
star
37

Computer-Science-and-Programming-In-Python

edX: Introduction to Computer Science and Programming in Python (July 2014)
Python
1
star
38

Misc

R
1
star
39

datagene

Jupyter Notebook
1
star
40

Interactive-Programming-in-Python

Coursera: Interactive Programming in Python (Apr 2014)
Python
1
star
41

R-Programming

Coursera: R Programming (May 2014)
R
1
star
42

Visualizations

Random Visualizations
R
1
star
43

json-to-utterances

Jupyter Notebook
1
star
44

DKSG-HOME

Sharing my R script used in the DKSG DataLearn for home
R
1
star
45

eugeneyan-comments

1
star
46

kaggle_otto

Code for Kaggle Otto Production Classification Challenge
R
1
star
47

Demand-Forecasting

Prototyping various forecasting techniques
R
1
star
48

Machine-Learning

Coursera: Machine Learning (Aug 2014)
MATLAB
1
star