• Stars
    star
    299
  • Rank 134,665 (Top 3 %)
  • Language Bikeshed
  • License
    Other
  • Created almost 8 years ago
  • Updated about 1 month ago

Reviews

There are no reviews yet. Be the first to send feedback to the community and the maintainers!

Repository Details

Detection of shapes (faces, QR codes) in images

Shape Detection API Specification 🌠πŸŽ₯

This is the repository for shape-detection-api, an experimental API for detecting Shapes (e.g. Faces, Barcodes, Text) in live or still images on the Web by using accelerated hardware/OS resources.

You're welcome to contribute! Let's make the Web rock our socks off!

Introduction πŸ“˜

Photos and images constitute the largest chunk of the Web, and many include recognisable features, such as human faces, text or QR codes. Detecting these features is computationally expensive, but would lead to interesting use cases e.g. face tagging or detection of high saliency areas. Users interacting with WebCams or other Video Capture Devices have become accustomed to camera-like features such as the ability to focus directly on human faces on the screen of their devices. This is particularly true in the case of mobile devices, where hardware manufacturers have long been supporting these features. Unfortunately, Web Apps do not yet have access to these hardware capabilities, which makes the use of computationally demanding libraries necessary.

Use cases πŸ“·

QR/barcode/text detection can be used for:

  • user identification/registration, e.g. for voting purposes;
  • eCommerce, e.g. Walmart Pay;
  • Augmented Reality overlay, e.g. here;
  • Driving online-to-offline engagement, fighting fakes etc.

Face detection can be used for:

  • producing fun effects, e.g. Snapchat Lenses;
  • giving hints to encoders or auto focus routines;
  • user name tagging;
  • enhance accesibility by e.g. making objects appear larger as the user gets closer like HeadTrackr;
  • speeding up Face Recognition by indicating the areas of the image where faces are present.

Current Related Efforts and Workarounds πŸ”§

Some Web Apps -gasp- run Detection in Javascript. A performance comparison of some such libraries can be found here (note that this performance evaluation does not include e.g. WebCam image acquisition and/or canvas interactions).

Samsung Browser has a private API (click to unfold "Overview for Android", then search for "QR code reader").

TODO: compare a few JS/native libraries in terms of size and performance. A performance and detection comparison of some popular JS QR code scanners can be found here. zxingjs2 has a list of some additional JS libraries.

Android Native Apps usually integrate ZXing (which amounts to adding ~560KB when counting core.jar, android-core.jar and android-integration.jar)).

OCR reader in Javascript are north of 1MB of size ()

Potential for misuse πŸ’Έ

Face Detection is an expensive operation due to the algorithmic complexity. Many requests, or demanding systems like a live stream feed with a certain frame rate, could slow down the whole system or greatly increase power consumption.

Platform specific implementation notes πŸ’»

Overview

What platforms support what detector?

Encoder Mac Android Win10 Linux ChromeOs
Face sw hw/sw sw ✘ ✘
QR/Barcode sw sw ✘ ✘ ✘
Text sw sw sw ✘ ✘

Android

Android provides both a stand alone software face detector and a interface to the hardware ones.

API uses... Release notes
FaceDetector Software based using the Neven face detector API Level 1, 2008
Vision.Face Software based Google Play services 7.2, Aug 2015
Camera2 Hardware API Level 21/Lollipop, 2014
Camera.Face (old) Hardware API Level 14/Ice Cream Sandwich, 2011

The availability of the actual hardware detection depends on the actual chip; according to the market share in 1H 2016 Qualcomm, MediaTek, Samsung and HiSilicon are the largest individual OEMs and they all have support for Face Detection (all the top-10 phones are covered as well):

Barcode/QR and Text detection is available via Google Play Services barcode and text, respectively.

Mac OS X / iOS

Mac OS X/iOS provides CIDetector and Vision Framework for Face, QR, Text and Rectangle detection in software or hardware.

API uses... Release notes
Vision Framework, Mac OS X Software and Hardware OS X v10.13, 2017
Vision Framework, iOS Software and Hardware IOS X v11.0, 2017
CIDetector, Mac OS X Software OS X v10.7, 2011
CIDetector, iOS Software iOS v5.0, 2011
AVFoundation Hardware iOS 6.0, 2012

Apple has supported Face Detection in hardware since the Apple A5 processor introduced in 2011.

Windows

Windows 10 has a FaceDetector class and support for Text Detection OCR.

Rendered URL πŸ“‘

The rendered version of this site can be found in https://wicg.github.io/shape-detection-api (if that's not alive for some reason try the rawgit rendering).

Examples and demos

https://wicg.github.io/shape-detection-api/#examples

Notes on bikeshedding 🚴

To compile, run:

curl https://api.csswg.org/bikeshed/ -F [email protected] -F force=1 > index.html

if the produced file has a strange size (i.e. zero), then something went terribly wrong; run instead

curl https://api.csswg.org/bikeshed/ -F [email protected] -F output=err

and try to figure out why bikeshed did not like the .bs :'(

More Repositories

1

webcomponents

Web Components specifications
HTML
4,306
star
2

import-maps

How to control the behavior of JavaScript imports
JavaScript
2,636
star
3

virtual-scroller

1,997
star
4

focus-visible

Polyfill for `:focus-visible`
JavaScript
1,606
star
5

webusb

Connecting hardware to the web.
Bikeshed
1,287
star
6

webpackage

Web packaging format
Go
1,216
star
7

EventListenerOptions

An extension to the DOM event pattern to allow authors to disable support for preventDefault
JavaScript
1,166
star
8

portals

A proposal for enabling seamless navigations between sites or pages
HTML
945
star
9

floc

This proposal has been replaced by the Topics API.
Makefile
933
star
10

inert

Polyfill for the inert attribute and property.
JavaScript
914
star
11

scheduling-apis

APIs for scheduling and controlling prioritized tasks.
HTML
896
star
12

view-transitions

789
star
13

file-system-access

Expose the file system on the user’s device, so Web apps can interoperate with the user’s native applications.
Bikeshed
641
star
14

background-sync

A design and spec for ServiceWorker-based background synchronization
HTML
638
star
15

scroll-to-text-fragment

Proposal to allow specifying a text snippet in a URL fragment
HTML
577
star
16

ua-client-hints

Wouldn't it be nice if `User-Agent` was a (set of) client hints?
Bikeshed
575
star
17

aom

Accessibility Object Model
HTML
553
star
18

kv-storage

[On hold] A proposal for an async key/value storage API for the web
550
star
19

observable

Observable API proposal
Bikeshed
515
star
20

turtledove

TURTLEDOVE
Bikeshed
505
star
21

navigation-api

The new navigation API provides a new interface for navigations and session history, with a focus on single-page application navigations.
Makefile
474
star
22

webmonetization

Proposed Web Monetization standard
HTML
439
star
23

trust-token-api

Trust Token API
Bikeshed
412
star
24

attribution-reporting-api

Attribution Reporting API
Bikeshed
338
star
25

direct-sockets

Direct Sockets API for the web platform
HTML
304
star
26

display-locking

A repository for the Display Locking spec
HTML
294
star
27

background-fetch

API proposal for background downloading/uploading
Shell
279
star
28

resize-observer

This repository is no longer active. ResizeObserver has moved out of WICG into
HTML
256
star
29

first-party-sets

Bikeshed
255
star
30

serial

Serial ports API for the platform.
HTML
254
star
31

priority-hints

A browser API to enable developers signal the priorities of the resources they need to download.
Bikeshed
228
star
32

dbsc

HTML
227
star
33

is-input-pending

HTML
222
star
34

sanitizer-api

Bikeshed
213
star
35

proposals

A home for well-formed proposed incubations for the web platform. All proposals welcome.
209
star
36

spatial-navigation

Directional focus navigation with arrow keys
JavaScript
199
star
37

js-self-profiling

Proposal for a programmable JS profiling API for collecting JS profiles from real end-user environments
HTML
196
star
38

cq-usecases

Use cases and requirements for standardizing element queries.
HTML
185
star
39

interventions

A place for browsers and web developers to collaborate on user agent interventions.
178
star
40

visual-viewport

A proposal to add explicit APIs to the Web for querying and setting the visual viewport
HTML
174
star
41

frame-timing

Frame Timing API
HTML
170
star
42

layout-instability

A proposal for a Layout Instability specification
Makefile
157
star
43

page-lifecycle

Lifecycle API to support system initiated discarding and freezing
HTML
153
star
44

isolated-web-apps

Repository for explainers and other documents related to the Isolated Web Apps proposal.
Bikeshed
146
star
45

speech-api

Web Speech API
Bikeshed
144
star
46

cookie-store

Asynchronous access to cookies from JavaScript
Bikeshed
141
star
47

nav-speculation

Proposal to enable privacy-enhanced preloading
HTML
141
star
48

construct-stylesheets

API for constructing CSS stylesheet objects
Bikeshed
137
star
49

webhid

Web API for accessing Human Interface Devices (HID)
HTML
135
star
50

color-api

A proposal and draft spec for a Color object for the Web Platform, loosely influenced by the Color.js work. Heavily WIP, if you landed here randomly, please move along.
HTML
124
star
51

devtools-protocol

DevTools Protocol
JavaScript
120
star
52

fenced-frame

Proposal for a strong boundary between a page and its embedded content
Bikeshed
118
star
53

sms-one-time-codes

A way to format SMS messages for use with browser autofill features such as HTML’s autocomplete=one-time-code.
Makefile
109
star
54

bundle-preloading

Bundles of multiple resources, to improve loading JS and the Web.
HTML
103
star
55

netinfo

HTML
95
star
56

intrinsicsize-attribute

Proposal to add an intrinsicsize attribute to media elements
94
star
57

window-controls-overlay

HTML
94
star
58

container-queries

HTML
92
star
59

animation-worklet

🚫 Old repository for AnimationWorklet specification ➑️ New repository: https://github.com/w3c/css-houdini-drafts
Makefile
92
star
60

manifest-incubations

Before install prompt API for installing web applications
HTML
90
star
61

async-append

A way to create DOM and add it to the document without blocking the main thread.
HTML
87
star
62

privacy-preserving-ads

Privacy-Preserving Ads
86
star
63

indexed-db-observers

Prototyping and discussion around indexeddb observers.
WebIDL
83
star
64

shared-storage

Explainer for proposed web platform Shared Storage API
Bikeshed
82
star
65

compression

Standard text for CompressionStream and DecompressionStream API
HTML
81
star
66

file-handling

API for web applications to handle files
81
star
67

compression-dictionary-transport

80
star
68

canvas-color-space

Proposed web platform feature to add color management, wide gamut and high bit-depth support to the <canvas> element.
78
star
69

canvas-formatted-text

HTML
77
star
70

local-font-access

Web API for enumerating fonts on the local system
Bikeshed
75
star
71

performance-measure-memory

performance.measureMemory API
HTML
73
star
72

starter-kit

A simple starter kit for incubations
JavaScript
72
star
73

handwriting-recognition

Handwriting Recognition Web API Proposal
Makefile
72
star
74

css-parser-api

This is the repo where the CSS Houdini parser API will be worked on
HTML
72
star
75

ContentPerformancePolicy

A set of policies that a site guarantees to adhere to, browsers enforce, and embedders can count on.
HTML
72
star
76

web-app-launch

Web App Launch Handler
HTML
72
star
77

pwa-url-handler

71
star
78

eyedropper-api

HTML
70
star
79

idle-detection

A proposal for an idle detection and notification API for the web
Bikeshed
67
star
80

close-watcher

A web API proposal for watching for close requests (e.g. Esc, Android back button, ...)
Makefile
67
star
81

storage-foundation-api-explainer

Explainer showcasing a new web storage API, NativeIO
65
star
82

video-editing

64
star
83

uuid

UUID V4
63
star
84

client-hints-infrastructure

Specification for the Client Hints infrastructure - privacy preserving proactive content negotiation
Bikeshed
61
star
85

sparrow

59
star
86

element-timing

A proposal for an Element Timing specification.
Bikeshed
59
star
87

local-peer-to-peer

↔️ Proposal for local communication between browsers without the aid of a server.
Bikeshed
53
star
88

digital-credentials

Digital Credentials, like driver's licenses
HTML
53
star
89

video-rvfc

video.requestVideoFrameCallback() incubation
HTML
53
star
90

time-to-interactive

Repository for hosting TTI specification and discussions around it.
52
star
91

digital-goods

Makefile
49
star
92

private-network-access

HTML
49
star
93

raw-clipboard-access

An explainer for the Raw Clipboard Access feature
45
star
94

document-picture-in-picture

Bikeshed
45
star
95

admin

πŸ‘‹ Ask your questions here! πŸ‘‹
HTML
42
star
96

soft-navigations

Heuristics to detect Single Page Apps soft navigations
Bikeshed
42
star
97

pending-beacon

A better beaconing API
Bikeshed
40
star
98

webcrypto-secure-curves

Proposal for the addition of Curve25519 and Curve448 to the Web Cryptography API
HTML
40
star
99

entries-api

Spec defining browser support for file/directory upload by drag-and-drop
Bikeshed
40
star
100

transfer-size

38
star