Awesome Edge Machine Learning

A curated list of awesome edge machine learning resources, including research papers, inference engines, challenges, books, meetups and others.

Papers

Applications

There is a countless number of possible edge machine learning applications. Here, we collect papers that describe specific solutions.

Automated machine learning (AutoML) is the process of automating the end-to-end process of applying machine learning to real-world problems.^Wikipedia AutoML is for example used to design new efficient neural architectures with a constraint on a computational budget (defined either as a number of FLOPS or as an inference time measured on real device) or a size of the architecture.

Efficient Architectures

Efficient architectures represent neural networks with small memory footprint and fast inference time when measured on edge devices.

Federated Learning

Federated Learning enables mobile phones to collaboratively learn a shared prediction model while keeping all the training data on device, decoupling the ability to do machine learning from the need to store the data in the cloud.^{Google AI blog: Federated Learning}

ML Algorithms For Edge

Standard machine learning algorithms are not always able to run on edge devices due to large computational requirements and space complexity. This section introduces optimized machine learning algorithms.

Network Pruning

Pruning is a common method to derive a compact network – after training, some structural portion of the parameters is removed, along with its associated computations.^{Importance Estimation for Neural Network Pruning}

Others

This section contains papers that are related to edge machine learning but are not part of any major group. These papers often deal with deployment issues (i.e. optimizing inference on target platform).

Quantization

Quantization is the process of reducing a precision (from 32 bit floating point into lower bit depth representations) of weights and/or activations in a neural network. The advantages of this method are reduced model size and faster model inference on hardware that support arithmetic operations in lower precision.

Datasets

Visual Wake Words Dataset

Visual Wake Words represents a common microcontroller vision use-case of identifying whether a person is present in the image or not, and provides a realistic benchmark for tiny vision models. Within a limited memory footprint of 250 KB, several state-of-the-art mobile models achieve accuracy of 85-90% on the Visual Wake Words dataset.

Inference Engines

List of machine learning inference engines and APIs that are optimized for execution and/or training on edge devices.

dabnn

Source code: https://github.com/JDAI-CV/dabnn
JDAI Computer Vision

MCU and MPU Software Packages

List of software packages for AI development on MCU and MPU

FP-AI-Sensing

STM32Cube function pack for ultra-low power IoT node with artificial intelligence (AI) application based on audio and motion sensing

FP-AI-VISION1

FP-AI-VISION1 is an STM32Cube function pack featuring examples of computer vision applications based on Convolutional Neural Network (CNN)

Processor SDK Linux for AM57x

TIDL software framework leverages a highly optimized neural network implementation on TI’s Sitara AM57x processors, making use of hardware acceleration on the device

X-LINUX-AI-CV

X-LINUX-AI-CV is an STM32 MPU OpenSTLinux Expansion Package that targets Artificial Intelligence for computer vision applications based on Convolutional Neural Network (CNN)

e-AI Checker

Based on the output result from the translator, the ROM/RAM mounting size and the inference execution processing time are calculated while referring to the information of the selected MCU/MPU

e-AI Translator

Tool for converting Caffe and TensorFlow models to MCU/MPU development environment

eIQ Auto deep learning (DL) toolkit

The NXP eIQ™ Auto deep learning (DL) toolkit enables developers to introduce DL algorithms into their applications and to continue satisfying automotive standards

eIQ ML Software Development Environment

The NXP® eIQ™ machine learning software development environment enables the use of ML algorithms on NXP MCUs, i.MX RT crossover MCUs, and i.MX family SoCs. eIQ software includes inference engines, neural network compilers and optimized libraries

eIQ™ Software for Arm® NN Inference Engine

eIQ™ for Arm® CMSIS-NN

eIQ™ for Glow Neural Network Compiler

eIQ™ for TensorFlow Lite

AI Chips

List of resources about AI Chips

AI Chip (ICs and IPs)

A list of ICs and IPs for AI, Machine Learning and Deep Learning

Books

List of books with focus on on-device (e.g., edge or mobile) machine learning.

TinyML: Machine Learning with TensorFlow on Arduino, and Ultra-Low Power Micro-Controllers

Authors: Pete Warden, Daniel Situnayake
Published: 2020

Machine Learning by Tutorials: Beginning machine learning for Apple and iOS

Author: Matthijs Hollemans
Published: 2019

Core ML Survival Guide

Author: Matthijs Hollemans
Published: 2018

Building Mobile Applications with TensorFlow

Author: Pete Warden
Published: 2017

Challenges

Low Power Recognition Challenge (LPIRC)

Competition with focus on the best vision solutions that can simultaneously achieve high accuracy in computer vision and energy efficiency. LPIRC is regularly held during computer vision conferences (CVPR, ICCV and others) since 2015 and the winners’ solutions have already improved 24 times in the ratio of accuracy divided by energy.

Other Resources

Awesome EMDL

Embedded and mobile deep learning research resources

Awesome Pruning

A curated list of neural network pruning resources

Efficient DNNs

Collection of recent methods on DNN compression and acceleration

Machine Think

Machine learning tutorials targeted for iOS devices

Pete Warden's blog

Contribute

Unlike other awesome list, we are storing data in YAML format and markdown files are generated with awesome.py script.

Every directory contains data.yaml which stores data we want to display and config.yaml which stores its metadata (e.g. way of sorting data). The way how data will be presented is defined in renderer.py.

LicenseBlock

To the extent possible under law, Bisonai has waived all copyright and related or neighboring rights to this work.

Reviews

Repository Details

Awesome Edge Machine Learning

Table of Contents

Papers

Datasets

Inference Engines

Arm Compute Library

Bender

Caffe 2

CoreML

Deeplearning4j

Embedded Learning Library

Feather CNN

MACE

MNN

MXNet

NCNN

Neural Networks API

Paddle Mobile

Qualcomm Neural Processing SDK for AI

Tengine

TensorFlow Lite