Skip to main content

Dataset Adapters

Dataset adapters provide calibration and validation data to the compiler and accuracy measurement tools. Each adapter class corresponds to a task category and a dataset format.

See Deploy Custom Weights for how adapters are used in pipeline YAML files.


ObjDataAdapter

For object detection models. Supports COCO 2014/2017 and custom datasets in YOLO/COCO JSON formats.

Definition: $AXELERA_FRAMEWORK/ax_datasets/objdataadapter.py

FieldTypeDescription
data_dir_namestringDataset directory name, relative to the data root (default: data/)
label_typestringLabel format: YOLOv8, COCO JSON, COCO2017, COCO2014
ultralytics_data_yamlstringPath to an Ultralytics data.yaml, relative to data_dir_name. Auto-generates cal/val/labels. Cannot be used with cal_data, val_data, or labels.
cal_datastringCalibration data: directory with images or text file listing image paths
val_datastringValidation data: directory with images or text file listing image paths
labelsstringLabels file (YAML or .names), relative to data_dir_name
repr_imgs_dir_pathstringAbsolute path to a directory of representative calibration images. Alternative to cal_data.
download_yearstringCOCO dataset year: "2014" or "2017" (for built-in COCO support)
formatstringCOCO class format: default COCO-80, or "coco91" / "coco91-with-bg"
output_formatstringBounding box format: "xyxy" (default), "xywh", "ltwh"
is_label_image_same_dirboolTrue if images and labels are in the same directory (default: False)
[val|cal]_img_dir_namestringOverride image directory for val or cal, relative to data_dir_name

KptDataAdapter

Subclass of ObjDataAdapter for YOLO keypoint detection models. Uses COCO 2017 pose dataset.

Inherits all ObjDataAdapter fields. No additional fields.

SegDataAdapter

Subclass of ObjDataAdapter for YOLO instance segmentation models. Uses COCO 2017 segmentation dataset.

Inherits all ObjDataAdapter fields, plus:

FieldTypeDefaultDescription
is_mask_overlapboolTrueWhether masks are overlapped during evaluation
eval_with_letterboxboolTrueWhether to use letterbox resize during mask evaluation
mask_sizetuple(160, 160)Mask dimensions (height, width) during evaluation

TorchvisionDataAdapter

For classification models based on torchvision. Supports ImageNet-style datasets and standard torchvision datasets.

Definition: $AXELERA_FRAMEWORK/ax_datasets/torchvision.py

FieldTypeDefaultDescription
dataset_namestring"ImageFolder"torchvision dataset class to use

Beyond dataset_name, fields map to the corresponding torchvision dataset class arguments. Supported dataset classes:

ImageFolder

For custom datasets organized as root/class_name/image.jpg.

FieldTypeDefault
splitstring"train"
[val|cal]_datastring (required)
[val|cal]_index_pklstringNone
is_one_indexedboolFalse

ImageNet

FieldTypeDefault
splitstring"train"

MNIST / CIFAR10

FieldTypeDefault
trainboolTrue
downloadboolTrue

VOCDetection

FieldTypeDefault
yearstring"2011"
image_setstring"train"
downloadboolFalse

LFWPairs / LFWPeople

FieldTypeDefault
image_setstring"funneled"
downloadboolTrue
splitstring"test"

Caltech101

FieldTypeDefault
downloadboolFalse

See also