• Stars
    star
    405
  • Rank 103,112 (Top 3 %)
  • Language
    Jupyter Notebook
  • License
    Other
  • Created over 4 years ago
  • Updated over 3 years ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

The BlazeFace face detector model implemented in PyTorch

BlazeFace in Python

BlazeFace is a fast, light-weight face detector from Google Research. Read more, Paper on arXiv

A pretrained model is available as part of Google's MediaPipe framework.

Besides a bounding box, BlazeFace also predicts 6 keypoints for face landmarks (2x eyes, 2x ears, nose, mouth).

Because BlazeFace is designed for use on mobile devices, the pretrained model is in TFLite format. However, I wanted to use it from PyTorch and so I converted it.

NOTE: The MediaPipe model is slightly different from the model described in the BlazeFace paper. It uses depthwise convolutions with a 3x3 kernel, not 5x5. And it only uses "single" BlazeBlocks, not "double" ones.

The BlazePaper paper mentions that there are two versions of the model, one for the front-facing camera and one for the back-facing camera. This repo includes only the frontal camera model, as that is the only one I was able to find an official trained version for. The difference between the two models is the dataset they were trained on. As the paper says,

For the frontal camera model, only faces that occupy more than 20% of the image area were considered due to the intended use case (the threshold for the rear-facing camera model was 5%).

This means the included model will not be able to detect faces that are relatively small. It's really intended for selfies, not for general-purpose face detection.

Inside this repo

Essential files:

  • blazeface.py: defines the BlazeFace class that does all the work

  • blazeface.pth: the weights for the trained model

  • anchors.npy: lookup table with anchor boxes

Notebooks:

  • Anchors.ipynb: creates anchor boxes and saves them as a binary file (anchors.npy)

  • Convert.ipynb: loads the weights from the TFLite model and converts them to PyTorch format (blazeface.pth)

  • Inference.ipynb: shows how to use the BlazeFace class to make face detections

Detections

Each face detection is a PyTorch Tensor consisting of 17 numbers:

  • The first 4 numbers describe the bounding box corners:

    • ymin, xmin, ymax, xmax
    • These are normalized coordinates (between 0 and 1).
  • The next 12 numbers are the x,y-coordinates of the 6 facial landmark keypoints:

    • right_eye_x, right_eye_y
    • left_eye_x, left_eye_y
    • nose_x, nose_y
    • mouth_x, mouth_y
    • right_ear_x, right_ear_y
    • left_ear_x, left_ear_y
    • Tip: these labeled as seen from the perspective of the person, so their right is your left.
  • The final number is the confidence score that this detection really is a face.

Image credits

Included for testing are the following images:

  • 1face.png. Fei Fei Li by ITU Pictures, CC BY 2.0

  • 3faces.png. Geoffrey Hinton, Yoshua Bengio, Yann Lecun. Found at AIBuilders

  • 4faces.png from Andrew Ng’s Facebook page / KDnuggets

These images were scaled down to 128x128 pixels as that is the expected input size of the model.

More Repositories

1

neural-engine

Everything we actually know about the Apple Neural Engine (ANE)
1,865
star
2

CoreMLHelpers

Types and functions that make it a little easier to work with Core ML in Swift.
Swift
1,331
star
3

Forge

A neural network toolkit for Metal
Swift
1,267
star
4

YOLO-CoreML-MPSNNGraph

Tiny YOLO for iOS implemented using CoreML but also using the new MPS graph API.
Swift
925
star
5

MobileNet-CoreML

The MobileNet neural network using Apple's new CoreML framework
Swift
702
star
6

MHTabBarController

A custom tab bar controller for iOS 5
Objective-C
490
star
7

TensorFlow-iOS-Example

Source code for my blog post "Getting started with TensorFlow on iOS"
Swift
442
star
8

coreml-survival-guide

Source code for the book Core ML Survival Guide
Python
240
star
9

MHRotaryKnob

UIControl for iOS that acts like a rotary knob
Objective-C
196
star
10

VGGNet-Metal

iPhone version of the VGGNet convolutional neural network for image recognition
Swift
182
star
11

Swift-3D-Demo

Shows how to draw a 3D object without using shaders
Swift
177
star
12

synth-plugin-book

Source code for the book Code Your Own Synth Plug-Ins With C++ and JUCE
C++
164
star
13

MHLazyTableImages

This project is now deprecated.
Objective-C
157
star
14

SoundBankPlayer

Sample-based audio player for iOS that uses OpenAL.
Objective-C
156
star
15

MHPagingScrollView

A UIScrollView subclass that shows previews of the pages on the left and right.
Objective-C
132
star
16

reliability-diagrams

Reliability diagrams visualize whether a classifier model needs calibration
Jupyter Notebook
128
star
17

mda-plugins-juce

JUCE implementations of the classic MDA audio plug-ins
C
110
star
18

coreml-training

Source code for my blog post series "On-device training with Core ML"
Jupyter Notebook
97
star
19

Inception-CoreML

Running Inception-v3 on Core ML
Swift
97
star
20

metal-gpgpu

Collection of notes on how to use Apple’s Metal API for compute tasks
96
star
21

Matrix

A fast matrix type for Swift
Swift
93
star
22

AudioBufferPlayer

Class for doing simple iOS sound synthesis using Audio Queues.
Objective-C
85
star
23

MHNibTableViewCell

This code is now deprecated.
Objective-C
80
star
24

CoreML-Custom-Layers

Source code for the blog post "Custom Layers in Core ML"
Swift
72
star
25

InsideCoreML

Python script to examine Core ML's mlmodel files
Python
64
star
26

BNNS-vs-MPSCNN

Compares the speed of Apple's two deep learning frameworks: BNNS and Metal Performance Shaders
Swift
60
star
27

TransparentJPEG

Allows you to combine a JPEG with a second image to give it transparency.
Objective-C
59
star
28

TheKissOfShame

DSP Magnetic Tape Emulation
C++
47
star
29

synth-recipes

Code snippets of sound synthesis algorithms in C++
C++
46
star
30

TinyML-HelloWorld-ArduinoUno

The TinyML "Hello World" sine wave model on Arduino Uno v3
Jupyter Notebook
43
star
31

WashedOut

Color theme for Xcode 8 based on the colors from the WWDC 2016 slides
42
star
32

RNN-Drummer-Swift

Using a recurrent neural network to teach the iPhone to play drums
Python
42
star
33

BuildYourOwnLispInSwift

A simple LISP interpreter written in Swift
Swift
34
star
34

SemanticSegmentationMetalDemo

Drawing semantic segmentation masks with Metal
Swift
33
star
35

MHSemiModal

Category on UIViewController that makes it easy to present modal view controllers that only partially cover the screen.
Objective-C
32
star
36

Deepfish

Live visualization of convolutional neural network using the iPhone's camera
Swift
24
star
37

MHPopoverManager

A simple class for managing the lifecycle of your UIPopoverControllers
Objective-C
23
star
38

MPS-Matrix-Multiplication

Playing with the Metal Performance Shaders matrix multiplication kernel
Swift
23
star
39

sefr-swift

The SEFR classifier implemented in Swift
Swift
21
star
40

Railroad-Diagrams-Swift

Library for making railroad diagrams in Swift
Swift
19
star
41

AVBufferPlayer

Shows how to use AVAudioPlayer to play a buffer of waveform data that you give it.
Objective-C
16
star
42

GalaxyApocalypse

My January 2013 game for #OneGameADay (iPhone). The galaxy is falling apart and it's your job to move all the planets back to where they belong. Lots of swiping involved.
Objective-C++
15
star
43

fft-juce

Example code for my blog post FFT Processing in JUCE
C++
12
star
44

MHTintHelper

Tool that quickly lets you pick tint colors for navigation bars etc.
Objective-C
12
star
45

MHDatabase

A simple Objective-C wrapper around the sqlite3 functions.
Objective-C
11
star
46

Ignition

PyTorch helper code
Python
10
star
47

Logistic-Regression-Swift

A basic example of how to implement logistic regression in Swift
Swift
9
star
48

ShrinkPng

Simple tool for shrinking images 50% by averaging the color (and alpha) of each 2x2 pixel block.
Objective-C
9
star
49

airwindows-juce

JUCE versions of selected Airwindows plug-ins
C++
8
star
50

MHOverlayWindow

A simple example of how to make a UIWindow that appears on top of everything else, including the status bar.
Objective-C
7
star
51

pumpkin

Everything must bounce!
Swift
7
star
52

MHOverride

Category on NSObject that lets you override methods on existing objects using blocks, without having to make a subclass.
Objective-C
7
star
53

levels

Basic digital level meter plug-in.
C++
5
star
54

ThreeBandEQ

Simple bass/mids/treble equalizer plugin written in JUCE
C++
3
star
55

RWDevCon-App-Architecture

Source code for my RWDevCon talk on app architecture.
Swift
3
star
56

MHMetaColors

Category that allows you to write, for example, [UIColor xFF3399] to make a new UIColor object with values #FF3399.
Objective-C
3
star
57

RWDevCon-Swift-Closures-Generics

Source code for my RWDevCon talk on Swift closures and generics.
Swift
2
star
58

hollance

1
star
59

hollance.github.io

CSS
1
star