• Stars
    star
    338
  • Rank 124,931 (Top 3 %)
  • Language
    Jupyter Notebook
  • Created over 2 years ago
  • Updated 7 months ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

AI Image SIgnal Processing and Computational Photography - Bokeh Rendering , Reversed ISP Challenge, Model-Based Image Signal Processors via Learnable Dictionaries. Official repo for NTIRE and AIM Challenges

AI Image Signal Processing and Computational Photography

Deep learning for low-level computer vision and imaging

isp lpienet bokeh ntire23 visitors

Marcos V. Conde, Radu Timofte

Computer Vision Lab, CAIDAS, University of Würzburg


Topics This repository contains material for RAW image processing, RAW image reconstruction and synthesis, learned Image Signal Processing (ISP), Image Enhancement and Restoration (denoising, deblurring), Multi-lense Bokeh effect rendering, and much more! 📷


Official repository for the following works:

  1. Efficient Multi-Lens Bokeh Effect Rendering and Transformation at CVPR NTIRE 2023.
  2. Perceptual Image Enhancement for Smartphone Real-Time Applications (LPIENet) at WACV 2023.
  3. Reversed Image Signal Processing and RAW Reconstruction. AIM 2022 Challenge Report ECCV, AIM 2022
  4. Model-Based Image Signal Processors via Learnable Dictionaries AAAI 2022 Oral
  5. MAI 2022 Learned ISP Challenge Complete Baseline solution
  6. Citation and Acknowledgement | Contact for any inquiries.

News 🚀🚀

  • will try to keep the repo updated on a monthly basis ✏️
  • [06/2023] Lens-to-lens bokeh effect transformation and NTIRE 2023 material coming soon.
  • [01/202] LPIENet material is out
  • [10/2022] Reversed ISP and RAW Reconstruction material presented at AIM workshop ECCV 2022 is now available! check here

Perceptual Image Enhancement for Smartphone Real-Time Applications (WACV '23)

This work was presented at the IEEE/CVF Winter Conference on Applications of Computer Vision (WACV) 2023.

Recent advances in camera designs and imaging pipelines allow us to capture high-quality images using smartphones. However, due to the small size and lens limitations of the smartphone cameras, we commonly find artifacts or degradation in the processed images e.g., noise, diffraction artifacts, blur, and HDR overexposure. We propose LPIENet, a lightweight network for perceptual image enhancement, with the focus on deploying it on smartphones.

The code is available at lpienet including versions in Pytorch and Tensorflow. We also include the model conversion to TFLite, so you can generate the corresponding .tflite file and run the model using the AI Benchmark app on android devices. In lpienet-tflite.ipynb you can find a complete tutorial to transform the model to tflite.

Contributions

  • The model can process 4K images under 1s on commercial smartphones.
  • We achieve competitive results in comparison to SOTA methods in relevant benchmarks for denoising, deblurring and HDR correction. For example the SIDD benchmark.
  • We reduce NAFNet number of MACs (or FLOPs) by 50 times.
Click here to read the abstract

Recent advances in camera designs and imaging pipelines allow us to capture high-quality images using smartphones. However, due to the small size and lens limitations of the smartphone cameras, we commonly find artifacts or degradation in the processed images. The most common unpleasant effects are noise artifacts, diffraction artifacts, blur, and HDR overexposure. Deep learning methods for image restoration can successfully remove these artifacts. However, most approaches are not suitable for real-time applications on mobile devices due to their heavy computation and memory requirements.

In this paper, we propose LPIENet, a lightweight network for perceptual image enhancement, with the focus on deploying it on smartphones. Our experiments show that, with much fewer parameters and operations, our model can deal with the mentioned artifacts and achieve competitive performance compared with state-of-the-art methods on standard benchmarks. Moreover, to prove the efficiency and reliability of our approach, we deployed the model directly on commercial smartphones and evaluated its performance. Our model can process 2K resolution images under 1 second in mid-level commercial smartphones.


lpienet



Model-Based Image Signal Processors via Learnable Dictionaries (AAAI '22 Oral)

This work was presented at the 36th AAAI Conference on Artificial Intelligence, Spotlight (15%)

Project website where you can find the poster, presentation and more information.

Hybrid model-based and data-driven approach for modelling ISPs using learnable dictionaries. We explore RAW image reconstruction and improve downstream tasks like RAW Image Denoising via raw data augmentation-synthesis.

mbdlisp

If you have implementation questions or you need qualitative samples for comparison, please contact me. You can download the figure/illustration of our method in mbispld.



AIM 2022 Reversed ISP Challenge

This work was presented at the European Conference on Computer Vision (ECCV) 2022, AIM workshop.

Track 1 - S7 | Track 2 - P20

aim-challenge-teaser

In this challenge, we look for solutions to recover RAW readings from the camera using only the corresponding RGB images processed by the in-camera ISP. Successful solutions should generate plausible RAW images, and by doing this, other downstream tasks like Denoising, Super-resolution or Colour Constancy can benefit from such synthetic data generation. Click here to read more information about the challenge.

Starter guide and code 🔥

  • aim-starter-code.ipynb - Simple dataloading and visualization of RGB-RAW pairs + other utils.
  • aim-baseline.ipynb - End-to-end guide to load the data, train a simple UNet model and make your first submission!

MAI 2022 Learned ISP Challenge

You can find at mai22-learnedisp and end-to-end baseline: dataloading, training top solution, model conversion to tflite. The model achieved 23.46dB PSNR after training for a few hours. Here you can see a sample RAW input and the resultant RGB.

We test the model on AI Benchmark. The model average latency is 60ms using a input RAW image 544,960,4 and generating a RGB 1088,1920,3, in a mid-level smartphone (45.4 AI-score) using Delegate GPU and FP16.


Citation and Acknowledgement

@inproceedings{conde2022model,
  title={Model-Based Image Signal Processors via Learnable Dictionaries},
  author={Conde, Marcos V and McDonagh, Steven and Maggioni, Matteo and Leonardis, Ales and P{\'e}rez-Pellitero, Eduardo},
  booktitle={Proceedings of the AAAI Conference on Artificial Intelligence},
  volume={36},
  number={1},
  pages={481--489},
  year={2022}
}

@inproceedings{conde2022aim,
  title={{R}eversed {I}mage {S}ignal {P}rocessing and {RAW} {R}econstruction. {AIM} 2022 {C}hallenge {R}eport},
  author={Conde, Marcos V and Timofte, Radu and others},
  booktitle={Proceedings of the European Conference on Computer Vision Workshops (ECCVW)},
  year={2022}
}

Contact

Marcos Conde ([email protected]) is the contact persons and co-organizer of NTIRE and AIM challenges.

More Repositories

1

swin2sr

[ECCV] Swin2SR: SwinV2 Transformer for Compressed Image Super-Resolution and Restoration. Advances in Image Manipulation (AIM) workshop ECCV 2022. Try it out! over 3.3M runs https://replicate.com/mv-lab/swin2sr
Python
576
star
2

InstructIR

[ECCV 2024] InstructIR: High-Quality Image Restoration Following Human Instructions https://huggingface.co/spaces/marcosv/InstructIR
Jupyter Notebook
493
star
3

nilut

[AAAI 2024] NILUT: Conditional Neural Implicit 3D Lookup Tables for Image Enhancement. Project Website https://mv-lab.github.io/nilut/
Jupyter Notebook
63
star
4

kuzushiji-recognition

Kuzushiji Recognition Kaggle 2019. Build a DL model to transcribe ancient Kuzushiji into contemporary Japanese characters. Opening the door to a thousand years of Japanese culture.
Jupyter Notebook
21
star
5

youtube8m-19

Google Research 3rd YouTube-8M Video Understanding Challenge 2019. Temporal localization of topics within video. International Conference on Computer Vision (ICCV) 2019.
Jupyter Notebook
17
star
6

Squanchy-PL

Squanchy is a brand new, easy to learn, general purpose, multi-paradigm, compiled programming language. The language is written from scratch (it includes an integrated lexer, parser, code generator etc). I tried to put together Python and Haskell (or at least the main features from both) in Squanchy. In other words you can use it as a scripting language, can do maths (calculus, matrix ...) due to its functional side, work with data ...
Python
15
star
7

IQA-Conformer-BNS

Conformer and Blind Noisy Students for Improved Image Quality Assessment
Python
12
star
8

commonlit-readability-nlp

CommonLit Readability Prize 2021. NLP competition hosted at Kaggle: https://www.kaggle.com/c/commonlitreadabilityprize/. Rate the complexity of literary passages for grades 3-12 classroom use
Jupyter Notebook
8
star
9

dna-proto-workflow

Snakemake ProtoWorkflow for DNA Analysis.
Python
4
star
10

RSNA-Pulmonary-Embolism-AI-Challenge

2020 RSNA-STR Pulmonary Embolism Detection AI Challenge: Classify Pulmonary Embolism cases in chest CT scans. 12th place solution.
Jupyter Notebook
4
star
11

RSNA-AI-Challenge2019

RSNA Intracranial Hemorrhage Detection. The challenge is to build an algorithm to detect acute intracranial hemorrhage and its subtypes. The Dataset provided by the Radiological Society of North America (RSNA) and MD.ai.
Python
4
star
12

CHAMPS-Kaggle19

Predicting Molecular Properties. CHAMPS Kaggle competition 2019.
Python
2
star
13

VideoAI-Speedrun

VQA & VSR Efficiency and Runtime Evaluation -- AIS 2024 Workshop @ CVPR
Python
2
star
14

mv-lab.github.io

HTML
1
star