Sample Augmentation
Brief Description
Generates synthetic samples from a sample data file.
Tags
Learning
Long Description
The application takes a sample data file as generated by the SampleExtraction application and generates synthetic samples to increase the number of available samples.
Parameters
Input samples (in): Vector data file containing samples (OGR format)
Output samples (out): Output vector data file storing new samples(OGR format).
Field Name (field): Name of the field carrying the class name in the input vectors.
Layer Index (layer): Layer index to read in the input vector file.
Label of the class to be augmented (label): Label of the class of the input file for which new samples will be generated.
Number of generated samples (samples): Number of synthetic samples that will be generated.
Field names for excluded features. (exclude): List of field names in the input vector data that will not be generated in the output file.
Augmentation strategy (strategy):
Replicate input samples (replicate): The new samples are generated by replicating input samples which are randomly selected with replacement.
Jitter input samples (jitter): The new samples are generated by adding gaussian noise to input samples which are randomly selected with replacement.
Factor for dividing the standard deviation of each feature (strategy.jitter.stdfactor): The noise added to the input samples will have the standard deviation of the input features divided by the value of this parameter.
Smote input samples (smote): The new samples are generated by using the SMOTE algorithm (http://dx.doi.org/10.1613/jair.953) on input samples which are randomly selected with replacement.
Number of nearest neighbors. (strategy.smote.neighbors): Number of nearest neighbors to be used in the SMOTE algorithm
set user defined seed (seed): Set specific seed. with integer value.
Load otb application from xml file (inxml): Load otb application from xml file
Save otb application to xml file (outxml): Save otb application to xml file
Limitations
None
Authors
OTB-Team
See also
Example of use
in: samples.sqlite
field: class
label: 3
samples: 100
out: augmented_samples.sqlite
exclude: OGC_FID name class originfid
strategy: smote
strategy.smote.neighbors: 5
otbcli_SampleAugmentation -in samples.sqlite -field class -label 3 -samples 100 -out augmented_samples.sqlite -exclude OGC_FID name class originfid -strategy smote -strategy.smote.neighbors 5