MultiImageSamplingRate¶
Compute sampling rate for an input set of images.
Description¶
The application computes sampling rates for a set of input images. Before calling this application, each pair of image and training vectors has to be analysed with the application PolygonClassStatistics. The statistics file is then used to compute the sampling rates for each class in each image. Several types of sampling are implemented. Each one is a combination of a mono-image strategy and a multi-image mode. The mono-image strategies are:
- smallest (default): select the same number of sample in each class so that the smallest one is fully sampled.
- constant: select the same number of samples N in each class (with N below or equal to the size of the smallest class).
- byclass: set the required number for each class manually, with an input CSV file (first column is class name, second one is the required samples number).
The multi-image modes (mim) are proportional, equal and custom. The custom mode lets the user choose the distribution of samples among the images. The different behaviours are described below. Ti(c) and Ni(c) refers respectively to the total number and needed number of samples in image i for class c. Let’s call L the total number of images.
strategy = all
- Same behaviour for all modes: take all samples
strategy = constant: let’s call M the global number of samples required per class. For each image i and each class c:
- if mim = proportional, then Ni( c ) = M * Ti( c ) / sum_k( Tk(c) )
- if mim = equal , then Ni( c ) = M / L
- if mim = custom , then Ni( c ) = Mi where Mi is the custom requested number of samples for image i
strategy = byClass : let’s call M(c) the global number of samples for class c). For each image i and each class c:
- if mim = proportional, then Ni( c ) = M(c) * Ti( c ) / sum_k( Tk(c) )
- if mim = equal , then Ni( c ) = M(c) / L
- if mim = custom , then Ni( c ) = Mi(c) where Mi(c) is the custom requested number of samples for image i and class c
strategy = percent : For each image i and each class c:
- if mim = proportional, then Ni( c ) = p * Ti( c ) where p is the global percentage of samples
- if mim = equal , then Ni( c ) = p * sum_k(Tk(c)]/L where p is the global percentage of samples
- if mim = custom , then Ni( c ) = p(i) * Ti(c) where p(i) is the percentage of samples for image i. c
strategy = total : For each image i and each class c:
- if mim = proportional, then Ni( c ) = total * (sum_k(Ti(k))/sum_kl(Tl(k))) * (Ti(c)/sum_k(Ti(k))) where total is the total number of samples specified.
- if mim = equal , then Ni( c ) = (total / L) * (Ti(c)/sum_k(Ti(k))) where total is the total number of samples specified.
- if mim = custom , then Ni( c ) = total(i) * (Ti(c)/sum_k(Ti(k))) where total(i) is the total number of samples specified for image i.
strategy = smallest class
- if mim = proportional, then the smallest class size (computed globally) is used for the strategy constant+proportional.
- if mim = equal , then the smallest class size (computed globally) is used for the strategy constant+equal.
- if mim = custom , then the smallest class is computed and used for each image separately.
This application has several output images and supports “multi-writing”. Instead of computing and writing each image independently, the streamed image blocks are written in a synchronous way for each output. The output images will be computed strip by strip, using the available RAM to compute the strip size, and a user defined streaming mode can be specified using the streaming extended filenames (type, mode and value). Note that multi-writing can be disabled using the multi-write extended filename option: &multiwrite=false, in this case the output images will be written one by one. Note that multi-writing is not supported for MPI writers.
Parameters¶
Input statistics -il filename1 filename2...
Mandatory
List of statistics files for each input image.
Output sampling rates -out filename [dtype]
Mandatory
Output filename storing sampling rates (CSV format with class name, required samples, total samples, and rate). The given filename will be used with a suffix to indicate the corresponding input index (for instance: rates.csv will give rates_1.csv, rates_2.csv, …).
Sampling strategy -strategy [byclass|constant|smallest|percent|total|all]
Default value: smallest
- Set samples count for each class
Set samples count for each class - Set the same samples counts for all classes
Set the same samples counts for all classes - Set same number of samples for all classes, with the smallest class fully sampled
Set same number of samples for all classes, with the smallest class fully sampled - Use a percentage of the samples available for each class
Use a percentage of the samples available for each class - Set the total number of samples to generate, and use class proportions.
Set the total number of samples to generate, and use class proportions. - Use all samples
Use all samples
Set samples count for each class options¶
Number of samples by class -strategy.byclass.in filename1 filename2...
Mandatory
Number of samples by class (CSV format with class name in 1st column and required samples in the 2nd).In the case of the custom multi-image mode, several inputs may be given for each image.
Set the same samples counts for all classes options¶
Number of samples for all classes -strategy.constant.nb string
Mandatory
Number of samples for all classes.In the case of the custom multi-image mode, several values can be given for each image.
Use a percentage of the samples available for each class options¶
The percentage(s) to use -strategy.percent.p string
Mandatory
The percentage(s) to use In the case of the custom multi-image mode, several values can be given for each image.
Set the total number of samples to generate, and use class proportions. options¶
The number of samples to generate -strategy.total.v string
Mandatory
The number of samples to generateIn the case of the custom multi-image mode, several values can be given for each image.
Multi-Image Mode -mim [proportional|equal|custom]
Default value: proportional
- Proportional
Split the required number of samples proportionally - equal
Equal split of the required number of samples - Custom
Split the number of samples based on the user’s choice.
Examples¶
From the command-line:
otbcli_MultiImageSamplingRate -il stats_1.xml stats_2.xml -out rates.csv -strategy smallest -mim proportional
From Python:
import otbApplication
app = otbApplication.Registry.CreateApplication("MultiImageSamplingRate")
app.SetParameterString("out", "rates.csv")
app.SetParameterString("strategy","smallest")
app.SetParameterString("mim","proportional")
app.ExecuteAndWriteOutput()