RT info:eu-repo/semantics/article T1 Diffeomorphic transforms for data augmentation of highly variable shape and texture objects A1 Vállez Enano, Noelia A1 Bueno García, María Gloria A1 Déniz Suárez, Óscar A1 Blanco Lanza, Saúl A2 Ecologia K1 Biotecnología K1 Ecología. Medio ambiente K1 Data augmentation K1 Diffeomorphism transform K1 Algae classification K1 Taxon life cycle K1 Pollen classification K1 Glomeruli classification K1 2417.07 Algología (Ficología) AB [EN] Background and objective: Training a deep convolutional neural network (CNN) for automatic image classification requires a large database with images of labeled samples. However, in some applications such as biology and medicine only a few experts can correctly categorize each sample. Experts are able to identify small changes in shape and texture which go unnoticed by untrained people, as well as distinguish between objects in the same class that present drastically different shapes and textures. This means that currently available databases are too small and not suitable to train deep learning models from scratch. To deal with this problem, data augmentation techniques are commonly used to increase the dataset size. However, typical data augmentation methods introduce artifacts or apply distortions to the original image, which instead of creating new realistic samples, obtain basic spatial variations of the original ones.Methods: We propose a novel data augmentation procedure which generates new realistic samples, by combining two samples that belong to the same class. Although the idea behind the method described in this paper is to mimic the variations that diatoms experience in different stages of their life cycle, it has also been demonstrated in glomeruli and pollen identification problems. This new data augmentation procedure is based on morphing and image registration methods that perform diffeomorphic transformations.Results: The proposed technique achieves an increase in accuracy over existing techniques of 0.47%, 1.47%, and 0.23% for diatom, glomeruli and pollen problems respectively.Conclusions: For the Diatom dataset, the method is able to simulate the shape changes in different diatom life cycle stages, and thus, images generated resemble newly acquired samples with intermediate shapes. In fact, the other methods compared obtained worse results than those which were not using data augmentation. For the Glomeruli dataset, the method is able to add new samples with different shapes and degrees of sclerosis (through different textures). This is the case where our proposed DA method is more beneficial, when objects highly differ in both shape and texture. Finally, for the Pollen dataset, since there are only small variations between samples in a few classes and this dataset has other features such as noise which are likely to benefit other existing DA techniques, the method still shows an improvement of the results PB Elsevier SN 0169-2607 LK https://hdl.handle.net/10612/18885 UL https://hdl.handle.net/10612/18885 NO Vallez, N., Bueno, G., Deniz, O., & Blanco, S. (2022). Diffeomorphic transforms for data augmentation of highly variable shape and texture objects. Computer Methods and Programs in Biomedicine, 219, Article e106775. https://doi.org/10.1016/J.CMPB.2022.106775 DS BULERIA. Repositorio Institucional de la Universidad de León RD 30-jun-2024