• Stars
    star
    155
  • Rank 239,370 (Top 5 %)
  • Language
    Shell
  • License
    MIT License
  • Created about 5 years ago
  • Updated almost 2 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

A hybrid dataset for document unwarping (Paper: https://www3.cs.stonybrook.edu/~cvl/projects/dewarpnet/storage/paper.pdf)

doc3D

Doc3D is the first 3D dataset focused on document unwarping with realistic paper warping and renderings.

It contains 100k images with the following ground-truths:

  • 3D Coordinates
  • Depth
  • UV
  • Backward Mapping
  • Albedo
  • Normals
  • Checkerboard

This repository contains all the necessary bash scripts to download the dataset-

  • To download the dataset you need to obtain a username and password. Please fill out the Google Form to request one.
  • Update the assigned username password in the download scripts at lines: local uname=**** local pass=****
  • To download the entire dataset at once (in the default directory $HOME/Downloads/doc3d), use the following command:
    • bash download_doc3d.sh
  • To download in a specific directory-
    • bash download_doc3d.sh <out_dir>
  • Individual bash scripts are provided to download a specific part of the data. Following will download all the image files in <out_dir>/doc3d/img/-
    • bash download_img.sh <out_dir>

Rendering codes are available!!: You can use the scripts here to render your own version of doc3D.

Some Notes:

  • A download can be interrupted and resumed later, wget keeps track of it.
  • Already downloaded files will be skipped and partially downloaded files will be resumed.
  • The scripts are tested on Linux and Mac. For windows, a bash shell [probably-useful] should work.

Visualize Data:

Run the demo.py file to display a random image and corresponding ground-truths. demo.py takes the following flags-

  • --data_root : Path to the doc3d dataset.
  • --folder : Specific folder to load image from.
  • --download_sample : If you want to download some samples and run demo.py on it. useful if you want to visualize it before downloading the entire data.
  • --unwarp : Unwarp input image using the ground-truth backward mapping.

Release Updates:

  • Sep 16, 2019: v0.5 (36K images, no depth map)
  • Sep 17, 2019: v0.5.1 (Depth maps for v0.5 images)
  • Sep 21, 2019: Rendering code is now available!
  • Sep 22, 2019: v0.9 (65K images, no albedos)
  • Mar 11, 2020: Please send an email to request the meshes (.obj)

Citation:

If you use the dataset, please consider citing our work-

@inproceedings{SagnikKeICCV2019, 
Author = {Sagnik Das*, Ke Ma*, Zhixin Shu, Dimitris Samaras, Roy Shilkrot}, 
Booktitle = {Proceedings of International Conference on Computer Vision}, 
Title = {DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks}, 
Year = {2019}}   

Acknowlegement:

More Repositories

1

DewarpNet

Code for the paper "DewarpNet: Single-Image Document Unwarping With Stacked 3D and 2D Regression Networks" (ICCV '19)
Python
475
star
2

LearningToCountEverything

Python
342
star
3

DM-Count

Code for NeurIPS 2020 paper: Distribution Matching for Crowd Counting.
Python
204
star
4

SID

Official implementation for ICCV19 "Shadow Removal via Shadow Image Decomposition"
Jupyter Notebook
96
star
5

PaperEdge

The code and the DIW dataset for "Learning From Documents in the Wild to Improve Document Unwarping" (SIGGRAPH 2022)
Python
83
star
6

Scanpath_Prediction

Predicting Goal-directed Human Attention Using Inverse Reinforcement Learning (CVPR2020)
Python
76
star
7

BodyHands

Whose Hands Are These? Hand Detection and Hand-Body Association in the Wild, CVPR 2022
Python
70
star
8

zero-shot-counting

CVPR2023 Zero-shot Counting
Python
49
star
9

ContactHands

Detecting Hands and Recognizing Physical Contact in the Wild, NeurIPS 2020.
Python
45
star
10

fsl-rsvae

Python
34
star
11

DocIIW

Repository for Intrinsic Decomposition of Document Images In-the-Wild (BMVC '20)
Python
34
star
12

EmotionNet_CVPR2020

Python
30
star
13

local_learning_wsi

Repository for "Gigapixel Whole-Slide Images Classification using Locally Supervised Learning"
Python
27
star
14

PathLDM

Official Code for PathLDM: Text conditioned Latent Diffusion Model for Histopathology (WACV 2024)
Jupyter Notebook
27
star
15

SAMPath

Repository for "SAM-Path: A Segment Anything Model for Semantic Segmentation in Digital Pathology" (MedAGI2023, MICCAI2023 workshop)
Python
22
star
16

HandLer

Forward Propagation, Backward Regression and Pose Association for Hand Tracking in the Wild (CVPR 2022)
Python
20
star
17

scenes100

Python
20
star
18

Large-Image-Diffusion

CVPR 2024: Learned representation-guided diffusion models for large-image generation
Jupyter Notebook
20
star
19

vfd-iccv21

Python
19
star
20

PromptMIL

Repository for "Prompt-MIL: Boosting Multi-Instance Learning Schemes via Task-specific Prompt Tuning" (MICCAI2023)
Python
16
star
21

SelfMedMAE

Code for ISBI 2023 paper "Self Pre-training with Masked Autoencoders for Medical Image Classification and Segmentation"
Python
16
star
22

Iso-UVField

Learning an Isometric Surface Parameterization for Texture Unwrapping (ECCV 2022)
Python
13
star
23

Emotion-Prediction

Visual Emotion Prediction (as a single-label problem) -- MS Thesis
Python
12
star
24

Gazeformer

Official codebase for "Gazeformer: Scalable, Effective and Fast Prediction of Goal-Directed Human Attention" (CVPR 2023)
Python
12
star
25

Target-absent-Human-Attention

Target-absent Human Attention (ECCV2022)
Python
11
star
26

PLM_SSL

Repository for "Precise Location Matching Improves Dense Contrastive Learning in Digital Pathology"
Python
8
star
27

LSAE

PyTorch Implementation of Lung Swapping Autoencoder
Python
7
star
28

fewshot-conditional-diffusion

Official code for "Conditional Generation from Unconditional Diffusion Models using Denoiser Representations" (BMVC 2023)
Jupyter Notebook
7
star
29

HyperMAE

Python
3
star
30

infinity-brush

2
star
31

EnEx

Code and datasets for BMVC 2021 paper "Exemplar-Based Early Event Prediction in Video"
Python
2
star
32

JEAN

2
star
33

hematopoiesis-relationvae

1
star
34

TokenSparse-for-MedSeg

Code for IPMI2023 paper "Token Sparsification for Faster Medical Image Segmentation"
1
star
35

GCDR-Gaze

Repository of the paper "Diffusion-Refined VQA Annotations for Semi-Supervised Gaze Following" (ECCV 2024)
1
star