• Stars
    star
    140
  • Rank 261,473 (Top 6 %)
  • Language
    Jupyter Notebook
  • Created about 6 years ago
  • Updated almost 5 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Predictive Filter Flow for fully/self-supervised learning on various vision tasks

Learning with Predictive Filter Flow

This github repository contains a series demonstrations of learning with Predictive Filter Flow (PFF) for various vision tasks. PFF is a framework not only supporting self/fully/un-supervised learning on images and videos, but also providing better interpretability that one is able to track every single pixel's movement and its kernels in constructing the output. Here is a list of specific applications (click the link to visit each webpage):

  1. image based application: deblur, denoising, defocus, super-resolution, day-night image tranlsation, etc;
  2. video based application: instance tracking, pose tracking, video transition shot detection, frame interpolation, long-range flow learning, style transfer, etc.
PFF mgPFF
splash figure splash figure
bibtex bibtex

Image Reconstruction with Predictive Filter Flow

For paper, slides and poster, please refer to our project page

We propose a simple, interpretable framework for solving a wide range of image reconstruction problems such as denoising and deconvolution. Given a corrupted input image, the model synthesizes a spatially varying linear filter which, when applied to the input image, reconstructs the desired output. The model parameters are learned using supervised or self-supervised training. We test this model on three tasks: non-uniform motion blur removal, lossy-compression artifact reduction and single image super resolution. We demonstrate that our model substantially outperforms state-of-the-art methods on all these tasks and is significantly faster than optimization-based approaches to deconvolution. Unlike models that directly predict output pixel values, the predicted filter flow is controllable and interpretable, which we demonstrate by visualizing the space of predicted filters for different tasks.

keywords: inverse problem, spatially-variant blind deconvolution, low-level vision, non-uniform motion blur removal, compression artifact reduction, single image super-resolution, filter flow, interpretable model, per-pixel twist, self-supervised learning, image distribution learning.

If you find anything provided here inspires you, please cite our arxiv paper (hig-resolution draft pdf, 44Mb):

splash figure


Multigrid Predictive Filter Flow

For paper, slides and poster, please refer to our project page

We introduce multigrid Predictive Filter Flow (mgPFF), a framework for unsupervised learning on videos. The mgPFF takes as input a pair of frames and outputs per-pixel filters to warp one frame to the other. Compared to optical flow used for warping frames, mgPFF is more powerful in modeling sub-pixel movement and dealing with corruption (e.g., motion blur). We develop a multigrid coarse-to-fine modeling strategy that avoids the requirement of learning large filters to capture large displacement. This allows us to train an extremely compact model (4.6MB) which operates in a progressive way over multiple resolutions with shared weights. We train mgPFF on unsupervised, free-form videos and show that mgPFF is able to not only estimate long-range flow for frame reconstruction and detect video shot transitions, but also readily amendable for video object segmentation and pose tracking, where it substantially outperforms the published state-of-the-art without bells and whistles. Moreover, owing to mgPFF's nature of per-pixel filter prediction, we have the unique opportunity to visualize how each pixel is evolving during solving these tasks, thus gaining better interpretability.

keywords: Unsupervised Learning, Multigrid Computing, Long-Range Flow, Video Segmentation, Instance Tracking, Pose Tracking, Video Shot/Transition Detection, Optical Flow, Filter Flow, Low-level Vision.

If you find anything provided here inspires you, please cite our arxiv paper

dog soccerball
splash figure

last update: 04/01/2019

Shu Kong

issues/questions addressed here: aimerykong A-t. g-m.a.-i-l d.o.t/ c.o--m

More Repositories

1

deepImageAestheticsAnalysis

ECCV2016 - fine-grained photo aesthetics rating with interpretability
MATLAB
296
star
2

Low-Rank-Bilinear-Pooling

CVPR2017 - an ultra-compact bilinear model for fine-grained classification
C++
149
star
3

Recurrent-Pixel-Embedding-for-Instance-Grouping

CVPR2018 - pixel embedding & grouping for structured prediction, e.g., instance segmentation
MATLAB
144
star
4

OpenGAN

ICCV2021 - training a post-hoc lightweight GAN-discriminator for open-set recognition
Jupyter Notebook
113
star
5

Pixel-Attentional-Gating

Pixel Attentional Gating for Parsimonious Per-Pixel Labeling
MATLAB
46
star
6

Recurrent-Scene-Parsing-with-Perspective-Understanding-in-the-loop

CVPR2018 - scene parsing network regulated by geometric prior
MATLAB
37
star
7

Dimensional-Emotion-Analysis-of-Facial-Expression

MATLAB
11
star
8

pollenDetClsSystem

Automated Recognition and Counting of Pollen Grain Species from Field Samples
Jupyter Notebook
8
star
9

deepPollen

PNAS 2020 - a pioneering work that greatly enhances plant ecological & evolutionary research using pollen data -- NSF News
MATLAB
4
star
10

wormSegCountSystem

MATLAB
2
star
11

simpleUI_wormAnnotation

a simple UI for annotating C. elegans in images
MATLAB
2
star
12

PatchMatchingForPollenIdentification

Selecting discriminative patches to match pollen grains for identification.
MATLAB
2
star
13

Identify-Fossil-Pollen-with-Modern-Reference

Match patches with modern pollen grains to identify fossilized grains.
MATLAB
2
star
14

modern_pollen_24wayCls

Joint detection, segmentation and classification with multiplicative architecture.
Jupyter Notebook
2
star
15

Submodular-Exemplar-Selection

The component of submodular exemplar selection for discriminative dictionary to identify pollen grains
MATLAB
2
star
16

MidFea_NSlayer

Learning Mid-Level Features and Modeling Neuron Selectivity for Image Classification
MATLAB
1
star
17

videoSeg

MATLAB
1
star