• Stars
    star
    197
  • Rank 196,552 (Top 4 %)
  • Language
    Python
  • License
    MIT License
  • Created over 5 years ago
  • Updated over 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

ImageNet-Sketch data set for evaluating model's ability in learning (out-of-domain) semantics at ImageNet scale

ImageNet-Sketch

ImageNet-Sketch

Description

ImageNet-Sketch data set consists of 50000 images, 50 images for each of the 1000 ImageNet classes. We construct the data set with Google Image queries "sketch of __", where __ is the standard class name. We only search within the "black and white" color scheme. We initially query 100 images for every class, and then manually clean the pulled images by deleting the irrelevant images and images that are for similar but different classes. For some classes, there are less than 50 images after manually cleaning, and then we augment the data set by flipping and rotating the images.

This github repository consists of the scripts we used to conduct query and clean images.

Download the Data

  • Hugging Face dataset
    • one can use the data with
      from datasets import load_dataset
      dataset = load_dataset("imagenet_sketch")  
      
    • Thanks to Nathan Raw for setting up
  • Links
  • Information
    • zip file is 7.8 GB
    • extracted files will be 8.4 GB

ImageNet-Sketch Leaderboard

Method Reference Backbone From Scratch* Top1-Acc. Top5-Acc.
Texture Debiased Augmentation Hermann et al. (NeurIPS 2020) ResNet50 Y 30.9% 51.4%
Anisotropic diffusion Mishra et al. ResNet50 Y 24.49% 41.81%
Random Convolutions Xu et al. AlexNet Y 18.09% 35.40%
RSC Huang et al. (ECCV 2020) AlexNet Y 16.12% 30.78%
PAR Wang et al. (NeurIPS 2019) AlexNet N 13.06% 26.27%
AlexNet Baseline AlexNet N/A 12.04% 24.80%

*This column indicates whether the model is trained from the scratch or built and fine-tuned based a pretrained backbone model.

Analysis

Reference

The data set is introduced together with the following paper in NeurIPS 2019, so if you find this data set helpful, please consider citing it:

Learning Robust Global Representations by Penalizing Local Predictive Power

@inproceedings{wang2019learning,
        title={Learning Robust Global Representations by Penalizing Local Predictive Power},
        author={Wang, Haohan and Ge, Songwei and Lipton, Zachary and Xing, Eric P},
        booktitle={Advances in Neural Information Processing Systems},
        pages={10506--10518},
        year={2019}
}

More Repositories

1

HFC

Implementation for the paper (CVPR Oral): High Frequency Component Helps Explain the Generalization of Convolutional Neural Networks
Python
238
star
2

Robustar

Interactive Toolbox for Robust Vision Classification
HTML
44
star
3

HEX

Example implementation for the paper: (ICLR Oral) Learning Robust Representations by Projecting Superficial Statistics Out
Python
27
star
4

SelectAdditiveLearning

implementation for the paper "Select-Additive Learning: Improving Cross-individual Generalization in Multimodal Sentiment Analysis"
Python
21
star
5

PAR_experiments

Learning Robust Global Representations by Penalizing Local Predictive Power (NeurIPS 2019))
Python
17
star
6

PAR

Implementation of Patch-wise Adversarial Regularization from "Learning Robust Global Representations by Penalizing Local Predictive Power"
Python
17
star
7

CF

Removing Confounding Factors Associated Weights in Deep Neural Networks Improves the Prediction Accuracy for Healthcare Applications
Python
15
star
8

HEX_experiments

For replication of the experiments in the paper Learning Robust Representations by Projecting Superficial Statistics Out
Python
13
star
9

LMM-Python

A python linear mixed model package model for GWAS
Python
12
star
10

thePrecisionLasso

implementation for Precision Lasso: accounting for correlations and linear dependencies in high-dimensional genomic data
Jupyter Notebook
12
star
11

BioImaging

Data collection software for the paper Using EEG to Improve Massive Open Online Courses Feedback Interaction
Java
9
star
12

DMM

Python
6
star
13

CMM

Coupled Mixed Model
Python
5
star
14

ADSubtyping

Jupyter Notebook
3
star
15

CS-LMM

Implementation for CS-LMM (Constrained Sparse multi-locus Linear Mixed Model)
Python
2
star
16

KMM

Kernel Mixed Model
Python
1
star
17

TaxiWaitingTimePrediction

This is a repository for big data project, predict the waiting time for a taxi in Beijing
Java
1
star
18

ICURCoderHelper

Python
1
star
19

MiceDataPreprocessing

Python
1
star
20

BrownClustering

This is a python implementation of Brown word clustering algorithm
Python
1
star
21

DeepGenerativeTransfer

Python
1
star
22

GenAMapWebsite

Gate website for GenAMap
CSS
1
star
23

sLMMn

Python
1
star
24

AlzheimerPreprocess

Python
1
star
25

EEG_Basic

Java
1
star
26

leetcode

My solutions to all the questions on leetcode till 2013
1
star
27

HaohanWang.github.io

github io
SCSS
1
star
28

ActiveLearningForP53

This is the code for our Active Learning Course Project
Java
1
star
29

ADMM

Python
1
star
30

PersonalWebsite

HTML
1
star
31

EmotionClassificationBatchEffect

Python
1
star
32

backProjection

Python
1
star
33

MultiClientSocket

Socket examples for cpp client, java client, php client to connect with cpp server
C++
1
star
34

ATDataPreprocessing

Python
1
star
35

graphicalLasso

solution for PGM 17 HW2 Q2, PGM20, HW3 Q3
Python
1
star