The 'Do not draw a penis?' Dataset
In 2018 Google open-sourced the Quick, Draw! dataset. โThe world's largest doodling datasetโ. The set consists of 345 categories and over 15 million drawings. For obvious reasons the dataset was missing a few specific categories that people seem to enjoy drawing. This made us at Moniker think about the moral reality big tech companies are imposing on our global community and that most people willingly accept this. Therefore we decided to publish an appendix to the Google Quickdraw dataset.
So far we have collected 25,000 doodles formatted the same way as Google's dataset. We are happy to announce you can download them here. We have collected the first 10,000 doodles using Amazon's Mechanical Turk, which were drudgingly audited by the staff here at Moniker.
In June of 2019 we released the Do Not Draw a Penis project to collect inappropriate doodles from people who are not willing to stay within the moral guidelines set by our social network providers. It has helped us to collect another 250,000 doodles of which we have marked 15,000 suitable for this appendix.
Dataset's provided
Similar to Google's QuickDraw dataset, we offer the data in the following forms. More information on how to interpret this data can be found here.
- Raw files (
.ndjson
) - Simplified drawings files (
.ndjson
)
Relevant Locations
Technologies
Data collection:
- Amazon's Mechanical Turk
- Do not draw
Credits
Concept & development by Moniker Luna Maurer & Roel Wouters
Commissioners
Mozilla, Brett Gaylor HKW, Daniel Neugebauer
Technical Development
Moniker, Tjerk Woudsma, Thomas Boland, Jae Perris
License
This data made available by Moniker under the Creative Commons Attribution 4.0 International license.