A strategy employed in machine learning to enhance the size and quality of training datasets, thereby improving the performance and generalizability of models. It involves creating modified copies of existing data or generating new data points.
For a computer vision model trained on images of dogs, data augmentation could involve transforming the images to simulate different lighting conditions, angles, or breeds.