• Stars
    star
    934
  • Rank 48,927 (Top 1.0 %)
  • Language
    Jupyter Notebook
  • License
    Apache License 2.0
  • Created over 5 years ago
  • Updated about 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A smaller subset of 10 easily classified classes from Imagenet, and a little more French

Imagenette

🎶 Imagenette, gentille imagenette,

Imagenette, je te plumerai. 🎶

(Imagenette theme song thanks to Samuel Finlayson)


NB:

  • Versions of Imagenette and Imagewoof with noisy labels are now available as CSV files that come with the dataset.
  • The Imagenette and Imagewoof datasets changed on Dec 6th 2019. They now have a 70/30 train/valid split.
  • The old versions (which have a much smaller validation set) are still available with the same URLs, but the URLs below point to the new versions.
  • We've also added the Image网 dataset (see below for details). The leaderboards below been updated using the new datasets, using a strong. Can you beat it?...

The Datasets

Imagenette

Imagenette is a subset of 10 easily classified classes from Imagenet (tench, English springer, cassette player, chain saw, church, French horn, garbage truck, gas pump, golf ball, parachute).

'Imagenette' is pronounced just like 'Imagenet', except with a corny inauthentic French accent. If you've seen Peter Sellars in The Pink Panther, then think something like that. It's important to ham up the accent as much as possible, otherwise people might not be sure whether you're refering to "Imagenette" or "Imagenet". (Note to native French speakers: to avoid confusion, be sure to use a corny inauthentic American accent when saying "Imagenet". Think something like the philosophy restaurant skit from Monty Python's The Meaning of Life.)

The '320 px' and '160 px' versions have their shortest side resized to that size, with their aspect ratio maintained.

The dataset also comes with a CSV file with 1%, 5%, 25%, and 50% of the labels randomly changed to an incorrect label. More information about the noisy labels are provided in the "noisy_labels" folder. Leaderboards for 5% noise and 50% noise are maintained below.

Too easy for you? In that case, you might want to try Imagewoof.

Imagewoof

Imagewoof is a subset of 10 classes from Imagenet that aren't so easy to classify, since they're all dog breeds. The breeds are: Australian terrier, Border terrier, Samoyed, Beagle, Shih-Tzu, English foxhound, Rhodesian ridgeback, Dingo, Golden retriever, Old English sheepdog. (No we will not enter in to any discussion in to whether a dingo is in fact a dog. Any suggestions to the contrary are un-Australian. Thank you for your cooperation.)

The dataset also comes with a CSV file with 1%, 5%, 25%, and 50% of the labels randomly changed to an incorrect label. More information about the noisy labels are provided in the "noisy_labels" folder.

Imagewoof too easy for you too?!? Then get your hands on Image网.

Image网

Image网 is pronounced "Imagewang"; 网 means "net" in Chinese! Image网 contains Imagenette and Imagewoof combined, but with some twists that make it into a tricky semi-supervised unbalanced classification problem:

  • The validation set is the same as Imagewoof (i.e. 30% of Imagewoof images); there are no Imagenette images in the validation set (they're all in the training set)

  • Only 10% of Imagewoof images are in the training set!

  • The remaining are in the unsup ("unsupervised") directory, and you can not use their labels in training!

  • It's even hard to type and hard to say!

  • Full size download;

  • 320 px download;

  • 160 px download.

Why Imagenette?

I (Jeremy Howard, that is) mainly made Imagenette because I wanted a small vision dataset I could use to quickly see if my algorithm ideas might have a chance of working. They normally don't, but testing them on Imagenet takes a really long time for me to find that out, especially because I'm interested in algorithms that perform particularly well at the end of training.

But I think this can be a useful dataset for others as well.

Usage

If you are already using the fastai library, you can download and access these quickly with commands like:

path = untar_data(URLs.IMAGENETTE_160)

where path now stores the destination to ImageNette-160.

For researchers

  • Try to create a classifier that's as accurate as possible under various constraints (we'll keep leaderboards below, submit your PR with a link to your repo or gist!), such as:
    • Within a certain number of epochs: 5, 20, 40, 160
    • Within a certain budget on AWS or GCP (use spot or interruptible instances to save money): $0.05, $0.10, $0.25, $0.50, $1.00, $2.00
  • Experiment with other low resource problems like transfer learning from small datasets, using semi-supervised learning to help classify small datasets, etc
  • Test the impact of using different sized images, either separately, or together as part of training (i.e. progressive resizing)
  • Compare your algorithm on easy vs hard small datasets, which are otherwise very similar (Imagenette vs Imagewoof)
  • Ensure that you start from random weights - not from pretrained weights.

For students

  • Practice your modeling skills on a dataset that's very similar to Imagenet, but much less expensive to deal with
  • Do send me a PR with your other applications for this dataset!

Tips

  • Because there are only 10 categories, the usual "top 5 accuracy" isn't so interesting. So you should generally report top 1 accuracy when using Imagenette
  • The best approaches to 5 epoch training often don't scale well to more epochs
  • Data augmentation like mixup tends to only help for 80+ epochs

Leaderboard

Generally you'll see +/- 1% differences from run to run since it's quite a small validation set. So please only send in contributions that are higher than the reported accuracy >80% of the time. Here's the rules:

  • No inference time tricks, e.g. no: TTA, validation size > train size
  • Must start with random weights
  • Must be one of the size/#epoch combinations listed in the table
  • If you have the resources to do so, try to get an average of 5 runs, to get a stable comparison. Use the "# Runs" column to include this (note that train_imagenette.py provides a --runs flag to make this easy)
  • In the URL column include a link to a notebook, blog post, gist, or similar which explains what you did to get your result, and includes the code you used (or a link to it), including the exact commit, so that others can reproduce your result.

Imagenette Leaderboard

Size (px) Epochs URL Accuracy # Runs
128 5 fastai2 train_imagenette.py 2020-10 + MaxBlurPool + tuned hyperparams 87.43% 5, mean
128 20 fastai2 train_imagenette.py 2020-01 + MaxBlurPool 91.57% 5, mean
128 80 fastai2 train_imagenette.py 2020-01 93.55% 1
128 200 fastai2 train_imagenette.py 2020-01 94.24% 1
192 5 fastai2 train_imagenette.py 2020-01 + MaxBlurPool 86.76% 5, mean
192 20 fastai2 train_imagenette.py 2020-01 + MaxBlurPool 92.50% 5, mean
192 80 fastai2 train_imagenette.py 2020-01 94.50% 1
192 200 fastai2 train_imagenette.py 2020-01 95.03% 1
256 5 fastai2 train_imagenette.py 2020-01 + MaxBlurPool 86.85% 5, mean
256 20 fastai2 train_imagenette.py 2020-01 + MaxBlurPool 93.53% 5, mean
256 80 fastai2 train_imagenette.py 2020-01 94.90% 1
256 200 fastai2 train_imagenette.py 2020-01 95.11% 1

Imagenette w/Label Noise = 5%

Size (px) Epochs URL Accuracy # Runs
128 5 baseline 83.44% 1
128 20 baseline 89.53% 1
128 80 baseline 89.30% 1
128 200 baseline 90.04% 1
192 5 baseline 84.13% 1
192 20 baseline 90.65% 1
192 80 baseline 91.01% 1
192 200 baseline 91.08% 1
256 5 SESEMI 88.87% ± 0.67 5,mean±std
256 20 baseline 91.39% 1
256 80 SESEMI 92.95% ± 0.12 3,mean±std
256 200 SESEMI 93.96% ± 0.23 3,mean±std

Imagenette w/Label Noise = 50%

Size (px) Epochs URL Accuracy # Runs
128 5 baseline 66.60% 1
128 20 baseline 79.36% 1
128 80 baseline 50.80% 1
128 200 baseline 52.18% 1
192 5 baseline 67.54% 1
192 20 baseline 79.34% 1
192 80 baseline 52.51% 1
192 200 baseline 53.71% 1
256 5 SESEMI 76.72% ± 0.83 5,mean±std
256 20 baseline 79.21% 1
256 80 SESEMI 57.76% ± 0.39 3,mean±std
256 200 SESEMI 61.48% ± 0.33 3,mean±std

Imagewoof Leaderboard

Size (px) Epochs URL Accuracy # Runs
128 5 depthwise(x6) 76.61% 5, mean
128 20 depthwise(x4) 86.27% 5, mean
128 80 depthwise(x4) 87.83% 1
128 200 fastai2 train_imagenette.py 2020-01 87.20% 1
192 5 depthwise(x4) 81.15% 5, mean
192 20 depthwise(x4) 88.37% 5, mean
192 80 depthwise(x2) 90.30% 1
192 200 fastai2 train_imagenette.py 2020-01 89.54% 1
256 5 Resnet Trick + Mish + Sa + MaxBlurPool 78,84% 5, mean
256 20 Resnet Trick + Mish + Sa + MaxBlurPool 88,58% 5, mean
256 80 fastai2 train_imagenette.py 2020-01 90.48% 1
256 200 fastai2 train_imagenette.py 2020-01 90.38% 1

Image网 Leaderboard

Size (px) Epochs URL Accuracy # Runs
128 5 SwAV 72.94% 5,mean
128 20 SwAV 72.18% 3,mean
128 80 SwAV 69.53% 1
128 200 SwAV 66.04% 1
192 5 SwAV 77.07% 5,mean
192 20 SwAV 77.81% 3,mean
192 80 SwAV 74.9% 1
192 200 SwAV 71.77% 1
256 5 SwAV 79.56% 5,mean
256 20 SwAV 79.2% 3,mean
256 80 SESEMI 78.41% ± 0.39 5,mean±std
256 200 SESEMI 79.27% ± 0.20 3,mean±std

More Repositories

1

fastai

The fastai deep learning library
Jupyter Notebook
26,176
star
2

fastbook

The fastai book, published as Jupyter Notebooks
Jupyter Notebook
21,305
star
3

numerical-linear-algebra

Free online textbook of Jupyter notebooks for fast.ai Computational Linear Algebra course
Jupyter Notebook
10,134
star
4

courses

fast.ai Courses
Jupyter Notebook
5,571
star
5

course-v3

The 3rd edition of course.fast.ai
Jupyter Notebook
4,906
star
6

nbdev

Create delightful software with Jupyter Notebooks
Jupyter Notebook
4,829
star
7

fastpages

An easy to use blogging platform, with enhanced support for Jupyter Notebooks.
Jupyter Notebook
3,522
star
8

course-nlp

A Code-First Introduction to NLP course
Jupyter Notebook
3,414
star
9

course22

The fast.ai course notebooks
Jupyter Notebook
2,318
star
10

lm-hackers

Hackers' Guide to Language Models
Jupyter Notebook
1,756
star
11

fastprogress

Simple and flexible progress bar for Jupyter Notebook and console
Jupyter Notebook
1,083
star
12

fastcore

Python supercharged for the fastai library
Jupyter Notebook
961
star
13

course20

Deep Learning for Coders, 2020, the website
Jupyter Notebook
845
star
14

fastai2

Temporary home for fastai v2 while it's being developed
Jupyter Notebook
645
star
15

fastai_dev

fast.ai early development experiments
Jupyter Notebook
639
star
16

diffusion-nbs

Getting started with diffusion
Jupyter Notebook
606
star
17

ghapi

A delightful and complete interface to GitHub's amazing API
Jupyter Notebook
528
star
18

course22p2

course.fast.ai 2022 part 2
Jupyter Notebook
471
star
19

swiftai

Swift for TensorFlow's high-level API, modeled after fastai
Jupyter Notebook
457
star
20

course-v4

Please use fastbook's /clean folder instead of this
446
star
21

imagenet-fast

Python
302
star
22

nbdev_template

Template for nbdev projects
Python
290
star
23

timmdocs

Documentation for Ross Wightman's timm image model library
Jupyter Notebook
277
star
24

numerical-linear-algebra-v2

Jupyter Notebooks for Computational Linear Algebra course, taught summer 2018 in USF MSDS program
Jupyter Notebook
269
star
25

fast_template

A template for really easy blogging with GitHub Pages
HTML
250
star
26

fastdoc

Create publication-quality books from Jupyter notebooks
Jupyter Notebook
230
star
27

fastsetup

Setup all the things
Shell
219
star
28

fastdiffusion

Notes and plans for fastdiffusion course
Jupyter Notebook
202
star
29

fastai_old

OLD REPO - PLEASE USE fastai/fastai
Jupyter Notebook
179
star
30

docker-containers

Docker images for fastai
Shell
172
star
31

book_nbs

Notebooks for upcoming fastai book (draft / incomplete)
Jupyter Notebook
166
star
32

fastscript

A fast way to turn your python function into a script
CSS
164
star
33

word-embeddings-workshop

This contains materials for the word embeddings workshop
HTML
123
star
34

fastec2

AWS EC2 computer management for regular folks
Python
121
star
35

execnb

Execute a jupyter notebook, fast, without needing jupyter
Jupyter Notebook
111
star
36

fastai1

v1 of the fastai library. v2 is the current version. v1 is still supported for bug fixes, but will not receive new features.
Jupyter Notebook
98
star
37

tf-fit

Fit your tensorflow model using fastai and PyTorch
Python
91
star
38

fastgpu

A queue service for quickly developing scripts that use all your GPUs efficiently
Jupyter Notebook
81
star
39

fastsql1

A bit of extra usability for sqlalchemy v2.
Jupyter Notebook
77
star
40

nbdev-template

Python
74
star
41

book.fast.ai

Information for readers of the fastai book
Jupyter Notebook
67
star
42

fastdot

A simple wrapper over `pydot` and `graphviz` which fixes some sharp edges
Jupyter Notebook
63
star
43

fastkaggle

Kaggling for fast kagglers!
Jupyter Notebook
52
star
44

nbprocess-old

Process and export Jupyter Notebooks fast (Jupyter not required)
Jupyter Notebook
52
star
45

randomized-SVD

demos for PyBay talk: Using Randomness to make code faster
Jupyter Notebook
50
star
46

fastai3

Research repo for code that may or may not end up in fastai3
Jupyter Notebook
50
star
47

pypi_template

A minimal template for creating a pypi package
Python
49
star
48

fastbook2e

The fastai book, 2nd edition (in progress)
Jupyter Notebook
46
star
49

fastdownload

Easily download, verify, and extract archives
Jupyter Notebook
44
star
50

course22-web

Website for Practical Deep Learning for Coders 2022
Jupyter Notebook
43
star
51

fastforest

A forest that is fast
C++
41
star
52

dotfiles

Vim Script
41
star
53

nbdev-vscode

TypeScript
41
star
54

bear_voila

Demo bear classifier with fastai and Voila
Jupyter Notebook
40
star
55

workflows

Composite Actions workflows for use in fastai projects
Python
39
star
56

tinykernel

A minimal Python kernel so you can run Python in your Python
Python
39
star
57

wireguard-fast

Simplified installation of WireGuard server for Ubuntu
Shell
37
star
58

minai

A miniture AI training framework for PyTorch
Jupyter Notebook
35
star
59

tinypets

Minimal JS interface to an image classifer
HTML
34
star
60

fastai-nbstripout

much faster nbstripout and autotrust notebooks local git configuration
Python
33
star
61

fastai-docs

Documentation for fastai
HTML
32
star
62

pagegraph

Draw, modify, and execute graphs using a simple python dict interface
Jupyter Notebook
29
star
63

masks4all

Make your own mask. My mask protects you. Your mask protects me.
Jupyter Notebook
26
star
64

apl-study

fast.ai APL study group notes
Jupyter Notebook
25
star
65

fastconda

Get packages onto your conda channel faster
Jupyter Notebook
22
star
66

git-tools

Tools to make git easier to use and to avoid the learning curve
Python
20
star
67

tweetrel

Use GitHub Actions to send a tweet when you make a new release
Jupyter Notebook
18
star
68

fastcgi

A fastcgi handler for Python's `socketserver` classes
Jupyter Notebook
17
star
69

fastrelease

DEPRECATED--all functionality moved to nbdev
Jupyter Notebook
15
star
70

fastpy

An easy way to start a python programming environment using GitHub Codespaces.
15
star
71

paperspace-setup

Setup a paperspace instance for fastai
Shell
14
star
72

ethics.fast.ai

The fast.ai data ethics course
Jupyter Notebook
14
star
73

fastcups

HTML
14
star
74

jupytercon-2023

Materials for "Write, Document, Test and Distribute Python Packages With Jupyter & Quarto", JupyterCon 2023.
Jupyter Notebook
14
star
75

fastlinkcheck

Check local static links and online links fast and in parallel
Jupyter Notebook
13
star
76

nbdev-index

nbdev docs lookup for a few libraries and python itself
Python
12
star
77

dlcert2

Web site source for Practical Deep Learning for Coders (2018)
Jupyter Notebook
12
star
78

Str8Pix

find the optimal rotation to correct skewed photos
Jupyter Notebook
11
star
79

dlcert1

Deep learning certificate part 1
10
star
80

cards_deck

A minimal example of nbdev based on Allen Downey's Think Python 2nd Ed
Jupyter Notebook
10
star
81

docments

Document parameters using comments
10
star
82

nbdev2

Temporary repo for nbdev2
Jupyter Notebook
7
star
83

spacy_conda

conda packages for spacy and deps
7
star
84

hugo-mathjax

Hugo with goldmark-mathjax
Jupyter Notebook
6
star
85

fastwebhook

A simple GitHub webhook server
Jupyter Notebook
6
star
86

wandbfastai

Jupyter Notebook
5
star
87

nbeverything

5
star
88

nbdev-jekyll-theme

CSS
4
star
89

nbdev_cards

A sample of how to create a deck of cards lib using nbdev
Jupyter Notebook
4
star
90

nbdev-demo

4
star
91

getcaddy

Simple approach to getting the Caddy web server
Shell
4
star
92

nbdev-spaces-demo

A demo of how to create a Hugging Face Space with gradio within a nbdev project.
Jupyter Notebook
4
star
93

fastdispatch

Wrapper for plum dispatch to make it more compatible with fastcore's typedispatch
Jupyter Notebook
3
star
94

tanishq_blog

CSS
3
star
95

docs

3
star
96

ghtop-demo

A CLI tool for showing GitHub events, written with Rich
Jupyter Notebook
3
star
97

logos

Logos for fastai for use in websites, icons, etc.
2
star
98

jb-nbdev

Exploring how nbdev may integrate with JupyterBook
Jupyter Notebook
2
star
99

nbdev-hello-world

Hello world from nbdev
Python
2
star
100

conda-recipes

Recipes for non-fastai projects that fastai provides packages for.
Python
2
star