不知道是不是参数的意义没有理解对,我是直接照搬了mmdet的Albu的配置, 注释的部分加上就会报错,因为基本是照搬的可能有些我不知道的问题()
albu_train_transforms = [
dict(type='ShiftScaleRotate', shift_limit=0.0625,
scale_limit=0.15, rotate_limit=15, p=0.4),
dict(type='RandomBrightnessContrast', brightness_limit=0.2,
contrast_limit=0.2, p=0.5),
dict(
type="OneOf",
transforms=[
dict(type="GaussianBlur", p=1.0, blur_limit=7),
dict(type="MedianBlur", p=1.0, blur_limit=7),
],
p=0.4,
),
]
cfg.train_pipeline = [
dict(type='LoadImageFromFile'),
dict(type='LoadAnnotations', reduce_zero_label=True),
dict(type='Resize', img_scale=(2048, 512), ratio_range=(0.5, 2.0)),
dict(type='RandomCrop', crop_size=cfg.crop_size, cat_max_ratio=0.75),
dict(
type='Albu',
transforms=albu_train_transforms,
# bbox_params=dict(
# type='BboxParams',
# format='pascal_voc',
# label_fields=['gt_labels'],
# min_visibility=0.0,
# filter_lost_elements=True),
keymap=dict(img='image', gt_semantic_seg='semantic_seg'),
update_pad_shape=False,
skip_img_without_anno=True),
dict(type='RandomFlip', prob=0.5),
dict(type='PhotoMetricDistortion'),
dict(type='Normalize', **cfg.img_norm_cfg),
dict(type='Pad', size=cfg.crop_size, pad_val=0, seg_pad_val=255),
dict(type='DefaultFormatBundle'),
dict(type='Collect', keys=['img', 'gt_semantic_seg']),
]
mmseg是可以使用Albu的增强的,建议题主先就你的配置show出来