Fashionpedia Dataset
Fashionpedia is a new dataset which consists of two parts: (1) an ontology built by fashion experts containing 27 main apparel categories, 19 apparel parts, 294 fine-grained attributes and their relationships; (2) a dataset with everyday and celebrity event fashion images annotated with segmentation masks and their associated per-mask fine-grained attributes, built upon the Fashionpedia ontology.
Find out more information about Fashionpedia at below links:
- Fashionpedia python API for reading, visualizing annotations, and result evaluation
- Fashionpedia project page
Download
CVDF hosts the images and annotations in the Fashionpedia dataset.
Images
Annotations
Detection: apparel object instance segmentation with localized attributes prediction:
Global attributes prediction:
- attributes_train2020
- attributes_val2020
- test_images_info2020: same as detection task
Annotation format
We follow the annotation format of the COCO dataset with additonal fields, such as attributes. The annotations are stored in the JSON format and are organized as follows:
Detection task (instances_attributes)
{
"info": info,
"categories": [category],
"attributes": [attribute],
"images": [image],
"annotations": [annotation],
"licenses": [license]
}
info{
"year" : int,
"version" : str,
"description" : str,
"contributor" : str,
"url" : str,
"date_created" : datetime,
}
category{
"id" : int,
"name" : str,
"supercategory" : str, # parent of this label
"level": int, # levels in the taxonomy
"taxonomy_id": string,
}
attribute{
"id" : int,
"name" : str,
"supercategory" : str, # parent of this label
"level": int, # levels in the taxonomy
"taxonomy_id": string,
}
image{
"id" : int,
"width" : int,
"height" : int,
"file_name" : str,
"license" : int,
"time_captured": string,
"original_url": string,
"isstatic": int, 0: the original_url is not a static url,
"kaggle_id": str,
}
annotation{
"id" : int,
"image_id" : int,
"category_id" : int,
"attribute_ids": [int],
"segmentation" : [polygon] or [rle]
"bbox" : [x,y,width,height], # int
"area" : int
"iscrowd": int (1 or 0)
}
polygon: [x1, y1, x2, y2, ...], where x, y are the coordinates of vertices, int
rle: {"size", (height, widht), "counts": str}
license{
"id" : int,
"name" : str,
"url" : str
}
Global attribute prediction task (attributes)
{
"info": info,
"attributes": [attribute],
"images": [image],
"annotations": [annotation],
"licenses": [license]
}
annotation{
"image_id" : int,
"attribute_ids": [int],
}
# other fields follow the same format as detection task