Overview

Data augmentation is a popular technique used to improve generalization performance. These strategies include rotation, scaling, flipping, cropping, and brightness or color adjustments.

It's a very reliable way to increase the size of small datasets and to encourage robust knowledge representations.

Different datasets and benchmarks have different standard transformations. The most common is mean standard normalization. These values are calculated over the training set. CIFAR projects tend to include random crop and random horizontal flip.

Let's use a single image to demonstrate some transforms.

Torchvision has a ton of these kinds of augmentations available.

AutoAugment implements a combination of random augmentations that were found to be the most optimal for different datasets. Torchvision includes policies for ImageNet, CIFAR10, and SVHN.