Creating a data-set for a classification or segmentation task. If an annotation file is present, the annotations are also prepared. The data-set is created based on an image-set.
Image-sets are collected images to build a data-set from, stored in the imagesets
folder.
The imagesets
folder contains the following folder structure:
- imagesets/[image_set_type]/[image_set_name]
Inside the [image_set_name]
folder are the following files / folders
test/
: test images (benchmark)trainval/
: training and validation images for cross validationcategories.txt
: all categories (classes) the image-set contains
Data-sets are stored in the datasets
base folder.
The datasets
folder contains the following folder structure:
- datasets/[data_set_type]/[data_set_name]
where
[data_set_type]
is the same as the corresponding[image_set_type]
and[data_set_name]
is the same as the corresponding[image_set_name]
.
Inside the [data_set_name]
folder are the following files / folders
test/
: test set (benchmark)train/
: training setval/
: validation setcategories.txt
: all categories (classes) the data-set contains
Segmentation data-set can be created from a segmentation image-set. All images are validated against the annotations, if they contain at least one annotation and that the annotation category belongs to one of the given categories. The annotations have to be in VIA v1 json format. Rectangle annotations are converted into polygon annotations for unique segment generation.